Text Decoding Strategies: Greedy, Beam Search, and Sampling
TLDR: An LLM doesn't "write" text — it generates a probability distribution over all possible next tokens and then uses a decoding strategy to pick one. Greedy, Beam Search, and Sampling are different rules for that choice. Temperature controls the c...
abstractalgorithms.dev14 min read