AI Evaluation Basics: Why a 98% Score Doesn't Mean What You Think
If Netflix says a show is a 97% match for you, why do you still hate it 10 minutes in?
That's not a broken algorithm. That's a score, and scores don't always match reality.
AI has the same problem.
H
changeofbasis.hashnode.dev4 min read