Search Hashnode

Search posts, tags, users, and pages

Discussion on "On the Generalization of SFT: A Reinforcement Learning Perspective with RewardRectification" | Hashnode