Discussion

Govind Maheshwari · 2026-04-05T10:08:47.850Z

I used to think that "smaller model" just meant "worse model." But today I learned that there are two separate ways to make an AI fit on a phone: you can make its memory less precise (Quantization), o

Recent in Forum

T
I built a React component playground where you own every generated file
15h ago
M
I built LLM Aggregator: aggregate RSS feeds and summarise them with LLMs
7h ago
R
Hi, I'm Ruva
7h ago
R
Hi, I'm Ruva
7h ago
A
Understanding RAG by Building One from Scratch
12h ago

View all threads

Discussion

GGUF, Quantization, and Pruning: The Three Keys to "Shrinking" an AI Brain

Responses

Recent in Forum

Search Hashnode

GGUF, Quantization, and Pruning: The Three Keys to "Shrinking" an AI Brain

Responses

Recent in Forum