Echo-2: Unlocking the Second Scaling Law
TL;DR:
Problem: High training costs stall research iteration. Post-training a 30B model costs thousands per run on standard clouds.
Solution: Echo-2 decouples centralized learning from distributed r
gradient.network7 min read