Supercharge Your AI Agents: Meet RLinf, the Next-Gen Infrastructure for Large-Scale RL
π Quick Summary:
RLinf is an open-source infrastructure for post-training foundation models using reinforcement learning. It provides a flexible and scalable framework with features like macro-to-micro flow, flexible execution modes (collocated, dis...