© 2026 Hashnode
You're a web developer, constantly pushing the boundaries of what's possible in the browser and on the server. For years, integrating sophisticated Artificial Intelligence into web applications often meant hefty cloud bills, data privacy concerns, or...

You've got a 1.7 billion parameter model. You want it running locally. In a browser tab. No server, no API keys, no Docker containers. Sounds impossible, right? A few months ago, I would've agreed with you. But 1-bit quantized models like Bonsai 1.7B...

No external software, no cloud, no complexity—just install and start chatting. TL;DR: I built a Chrome extension that runs Llama, DeepSeek, Qwen, and other LLMs entirely in-browser. No server, no Ollama, no API keys. Here's the full story—why I built...
