Ollama on Kubernetes: Recreate Strategy and Single-GPU Deadlock
I deployed Ollama on Kubernetes, and the GPU worker node locked up mid-rollout. No logs, no error, just a dead pod that wouldn’t terminate and a new one that wouldn’t schedule. It wasn’t a crash. It wasn’t a timeout. It was a deadlock I’d never seen ...
guatulabs.hashnode.dev3 min read