Let's Build GPT-2 from Scratch
Jan 1 · 23 min read · Large Language Models are absurdly good right now. Every week there’s a new release from a different lab pushing the state of the art—better reasoning, stronger coding, cleaner instruction-following, you name it. But under the hood, a huge portion of...
Join discussion