RTRetzam Tarleinretzam.hashnode.dev·Apr 13 · 6 min readIntroduction to Generative Pretrained Transformers (GPT)Hello 👽, We are here now. It only took us 50 chapters to get to GPT! From our first chapter: AI Series, we had a clear roadmap, which you can check here 👉🏼 retzam-ai-roadmap. We've come a long way,10
RTRetzam Tarleinretzam.hashnode.dev·Apr 6 · 9 min readThe true cost of training and inference for state-of-the-art Large Language ModelsHello 👽, I was talking with a friend of mine recently, an AI researcher; He told me that a professor had told him how impressive training the first Google DeepMind, AlphaZero, in 2017 was to win the 00
RTRetzam Tarleinretzam.hashnode.dev·Mar 30 · 6 min readBERT, Dense Passage Retrieval, and how RAG was inventedHello 👽, Welcome to another mid-series special 🎉. I know I said we're done with BERT, but this is truly insightful 🤗, enjoy. While pretraining and fine-tuning the BERT (Bidirectional Encoder Repres00
RTRetzam Tarleinretzam.hashnode.dev·Mar 23 · 7 min readCode: 20 Million Parameter BERT Model For SQuAD TasksHello 👽, Finally, we'll pretrain and finetune a BERT (Bidirectional Encoder Representations from Transformers) model from scratch in this chapter, brace up 🦾 Before we proceed, I want to believe you00
RTRetzam Tarleinretzam.hashnode.dev·Mar 9 · 6 min readThe Distillation Dilemma: Anthropic, OpenAI, and the Security Predicament for Frontier AI LabsHello 👽, This is going to be maybe the first back-to-back mid-series special, enjoy it while it lasts 😁. Let's do what we've come here to do 🕶️ A few weeks ago, Anthropic made a post on Twitter (I 00