Tag feed

#pre-training

6 posts0 followers

Explore Hashnode

Alternatives

Trending tags this week

AAAbstract Algorithmsabstractalgorithms.hashnode.devMar 9 · 15 min read

A Guide to Pre-training Large Language Models

TLDR: Pre-training is the phase where an LLM learns "Language" and "World Knowledge" by reading petabytes of text. It uses Self-Supervised Learning to predict the next word in a sentence. This creates

0

RTRetzam Tarleretzam.hashnode.devFeb 9 · 6 min read

BERT and GLP

Hello 😊, We learned the core philosophy behind BERT in the introductory chapter of our previous series chapter. In this chapter, we’ll learn more about BERT and another pivotal process in Machine Learning known as General Language Pre-training, GLP....

0

KKashifblog.ifkash.devFeb 8 · 5 min read

I Pretrained a 360M LLaMA-Style Language Model from Scratch on 6B FineWeb Tokens (Single H100)

Pretraining an LLM from scratch usually sounds like “big-lab-only” territory. I wanted to test how far a smaller, practical setup can go while keeping the process transparent and reproducible. This post documents an end-to-end run of training a ~360M...

0

AHAnni Huanghuanganni.hashnode.devAug 13, 2025 · 2 min read

Beyond Pre-training: The Power of RLHF in LLM Alignment

Pre-training uses massive datasets and computational resources—often thousands of GPUs running for weeks or months—making it a domain dominated by top AI companies. Post-training is much lighter in cost and time (often days instead of months) and foc...

0

JJayasrij-jayasri.hashnode.devApr 16, 2025 · 6 min read

LLM Pre-Training in GenAI

In this series, we’ll explore everything about pre-training in the Generative AI (GenAI) pipeline — including how models are trained based on specific objectives or goals, and how to design an effective pre-training pipeline. But before diving into ...

0

RJRafal Jackiewiczjackiewicz.hashnode.devOct 18, 2024 · 10 min read

Training GPT-2 From Scratch: A Beginner-Friendly Step-by-Step Guide

Training a GPT-2 model from scratch is a rewarding experience, especially if you want to learn about natural language processing and get hands-on with machine learning models. This guide will walk you through the process step-by-step, with simplified...

0

#pre-training

Search Hashnode

#pre-training

Explore Hashnode

Trending tags this week

A Guide to Pre-training Large Language Models

BERT and GLP

I Pretrained a 360M LLaMA-Style Language Model from Scratch on 6B FineWeb Tokens (Single H100)

Beyond Pre-training: The Power of RLHF in LLM Alignment

LLM Pre-Training in GenAI

Training GPT-2 From Scratch: A Beginner-Friendly Step-by-Step Guide