Eval Sets for PMs: the Artifact Engineering Cannot Own Alone
Apr 13 · 17 min read · Here is a pattern I have watched break half a dozen AI projects. The PM writes the PRD. It includes a line that says "quality bar: 85% pass rate on eval set." Engineering builds the feature. Engineering builds the eval set — because the PM is busy an...
Join discussion

















