Fundamentals of Large Language Models: Understanding LLM Architectures
What is an LLM?
An LLM (Large Language Model) is fundamentally a probabilistic model that predicts distributions over vocabulary tokens. At its core, an LLM understands a fixed set of words called a vocabulary and assigns probabilities to each word a...
ryan-giggs.hashnode.dev6 min read