Browser AI Just Killed My Edge Functions Bill: A Field Report on Gemini Nano in Chrome
6d ago · 3 min read · A 71% drop in inference spend, with one caveat Last quarter our edge inference bill at Vercel hit $11,400 a month, mostly from a single feature: a writing-assist tool that runs a Llama 3.1 8B model behind a Cloudflare Workers AI endpoint. In April 20...
Join discussion















