Feed
Pro
Search

Sign in
FactoryKit - the AI software factory: tasks in, pull requests out Bug0 - The AI-native e2e QA regression testing The foreword by Hashnode - official blog from the Hashnode team Passmark - The open-source AI framework for regression testing Hashnode gql skill - let your AI agent publish to your Hashnode blog Hackathons Changelog Brand @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

Stephane Roy

May 8

How to Use EasyR1 for Reinforcement Learning on FlexAI

EasyR1 is a reinforcement learning fine-tuning framework that supports GRPO, DAPO, and REINFORCE for reasoning-focused post-training. Use it when SFT starts plateauing on tasks like math, code, or log

flexai.hashnode.dev15 min read

#easyr1 #reinforcement-learning #finetuning #llm-fine-tuning

Responses

No responses yet.