Updating GGUF Registry for new Llama Server UI
No sooner had I build a GGUF model registry than llama.cpp released functionality to dynamically load and unload models from their new llama-server web UI!
I had a play with this and realised that it doesn’t exactly work for my setup, mainly because ...
sparktastic.hashnode.dev2 min read