© 2026 LinearBytes Inc.
Search posts, tags, users, and pages
Stephane Roy
EasyR1 is a reinforcement learning fine-tuning framework that supports GRPO, DAPO, and REINFORCE for reasoning-focused post-training. Use it when SFT starts plateauing on tasks like math, code, or log
No responses yet.