Contrastive RL: A Step-by-Step Guide to Learning Reachability
The paper "1000 Layer Networks for Self-Supervised RL" won Best Paper at NeurIPS 2025, and for good reason. It demonstrates that goal-conditioned RL can scale to 1000-layer networks—something previously thought impractical. But the real insight isn't...
proximal.hashnode.dev8 min read