Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Bug0 Browsers - Cloud Chromium on demand, per-minute, live preview Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Tag feed

#post-training

3 posts·0 followers

Articles

Trending tags this week

Search Hashnode

Search posts, tags, users, and pages

Tag feed

#post-training

3 posts·0 followers

Articles

Trending tags this week

Explore Hashnode

Alternatives

Hashnode vs Medium
Hashnode vs WordPress
Hashnode vs Ghost
Hashnode vs Substack
Hashnode vs Notion
Hashnode vs Dev.to
All alternatives

Changelog
Sitemap
Terms
Privacy

KKashifinblog.ifkash.dev·Feb 25 · 8 min read

Teaching Llama 3 to be Polite

The Objective The goal of this project was to take a powerful open-source Large Language Model (LLM) and instill a strict behavioral constraint: the model must politely decline to answer any request t

AHAnni Huanginhuanganni.hashnode.dev·Aug 13, 2025 · 2 min read

Beyond Pre-training: The Power of RLHF in LLM Alignment

Pre-training uses massive datasets and computational resources—often thousands of GPUs running for weeks or months—making it a domain dominated by top AI companies. Post-training is much lighter in cost and time (often days instead of months) and foc...

AHAnni Huanginhuanganni.hashnode.dev·Aug 14, 2025 · 2 min read

How to change a base model to a reasoning model?

Turning a base model into a reasoning model is essentially a post-training + data problem. The model’s architecture can stay the same — what changes is how it’s fine-tuned, what data it sees, and what training objectives you use. Here’s the typical p...