Transformers from scratch
Jan 7 · 13 min read · We will go through the paper: Attention is All You Need and build out a transformer model from scratch. I will try to explain everything in a bottom-up approach, moving from raw text to complete architectural blocks. When I train this transformer mod...
Join discussion

