Running the Llama 3.1 8B Large Language Model Cheaply on Google Cloud Kubernetes
This is another short post about my pet project, Shortlist. The project's main aims were to help me pass my CKAD exam (which I luckily did) and give me some exposure to operating LLMs in the cloud. I wanted to do this with as small an impact on my ba...
simoncrowe.hashnode.dev5 min read