Building RunHarness — quality baseline harness for automated AI agent evaluation
Joined June 2026
About
Building RunHarness — an open evaluation harness for testing and benchmarking AI agents. Focused on reproducible quality baselines for automated LLM evaluation.