Scaling RL Environments for SWE Agents
TL;DR
Automatically building SWE bench-like environments can unlock RL training, potentially very impactful. My initial experiments achieved a 85% success rate reconstructing instances from SWE bench repos with Claude 3.7. I am open-sourcing everyth...
tianyicode.hashnode.dev8 min read