TZTianyi Zhangintianyicode.hashnode.dev·Apr 20, 2025 · 8 min readScaling RL Environments for SWE AgentsTL;DR Automatically building SWE bench-like environments can unlock RL training, potentially very impactful. My initial experiments achieved a 85% success rate reconstructing instances from SWE bench repos with Claude 3.7. I am open-sourcing everyth...00