Weight Caching + GPU Snapshot Recipe for Sub-Second Cold Starts with vLLM + Modal Volume | DEV BAK - TECH BLOG