Tag feed

#finetuning

201 posts4 followers

Explore Hashnode

Alternatives

Trending tags this week

AJAman Jaincurious-pm.hashnode.devJul 21 · 7 min read

Comparing Fine-Tuning Strategies: Which Parts of a GPT Model Should We Update?

TL;DR I compared four ways of fine-tuning a small GPT model for SMS spam classification: Train only the classification head Train the final transformer block Train the final half of the transformer

0

Mmatthewtruong81matthew-codes.hashnode.devJul 8 · 6 min read

RAG or Fine-Tuning? An AI Development Services Comparison

Quick answer: RAG (Retrieval-Augmented Generation) enables the language model to retrieve the most up-to-date information directly from the live knowledge base at query time without retraining. Fine-t

0

ASAdi Shikopeningthehood.hashnode.devJul 7 · 8 min read

APIs Were the First Layer. Training Is the Next One

We all see the world changing. A few years ago, AI was not really accessible to most builders. The important research existed long before ChatGPT, including the 2017 paper Attention Is All You Need, w

0

NSNeeloppher Syedneeloppher.hashnode.devMay 30 · 5 min read

Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

Writing post-mortem root-cause summaries is time-consuming and inconsistent. Junior SREs miss contributing factors. Senior SREs write summaries that vary in depth and structure. Zero-shot LLMs produce

0

HAHarsh Agaleharshagale.hashnode.devMay 22 · 5 min read

How I Fine-Tuned Llama 2 Using QLoRA on Free GPU Resources

Training and fine-tuning Large Language Models (LLMs) is often considered expensive and hardware-intensive. Most tutorials online assume access to powerful GPUs with large amounts of VRAM, which can b

0

AKAbijah Kajabikablog.abijah.meMay 9 · 9 min read

Teaching a Small LLM to Design Electronic Circuits: Fine-Tuning Qwen3-4B on 100K KiCad Netlists

How I built a 100K-example dataset of executable circuit netlists and fine-tuned a 4B parameter model that scores 88% on functional circuit generation, rivaling GPT-4o's published benchmarks. The Pro

0

SRStephane Royflexai.hashnode.devMay 8 · 15 min read

How to Use EasyR1 for Reinforcement Learning on FlexAI

EasyR1 is a reinforcement learning fine-tuning framework that supports GRPO, DAPO, and REINFORCE for reasoning-focused post-training. Use it when SFT starts plateauing on tasks like math, code, or log

0

HHemaNhema-sdet.hashnode.devApr 29 · 4 min read

Fine‑Tuning Isn’t Optional: How QA Engineers Make AI Models Production‑Ready

As part of my daily learning journey in AI, today I focused on fine‑tuning foundation models and quickly realized something important: Most AI models fail not because they are weak — but because they

0

RSRahul Sehrawatai-zero-to-hero.hashnode.devApr 12 · 11 min read

Finetune vs Prompt vs Retrieve — When to Use What

Here is the single most common mistake I see teams make when they start building with LLMs. Someone says "the model doesn't know about our stuff." A smart-sounding engineer nods and says "we should fine-tune it on our data." A project gets scoped. A ...

0

RSRahul Sehrawatai-zero-to-hero.hashnode.devApr 12 · 11 min read

Pretraining, Finetuning, RLHF — The Three-Act Training Story

Here is a question you have probably never thought to ask. When you talk to ChatGPT or Claude, and the thing on the other end is unfailingly polite, stays on task, refuses things that seem dangerous, and generally behaves like an assistant — where di...

0

#finetuning

Search Hashnode

#finetuning

Explore Hashnode

Trending tags this week

Comparing Fine-Tuning Strategies: Which Parts of a GPT Model Should We Update?

RAG or Fine-Tuning? An AI Development Services Comparison

APIs Were the First Layer. Training Is the Next One

Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

How I Fine-Tuned Llama 2 Using QLoRA on Free GPU Resources

Teaching a Small LLM to Design Electronic Circuits: Fine-Tuning Qwen3-4B on 100K KiCad Netlists

How to Use EasyR1 for Reinforcement Learning on FlexAI

Fine‑Tuning Isn’t Optional: How QA Engineers Make AI Models Production‑Ready

Finetune vs Prompt vs Retrieve — When to Use What

Pretraining, Finetuning, RLHF — The Three-Act Training Story