The Illusory Confidence of AI
Imagine asking for someone's birthday.
If the model answers “September 10,” it has a 1/365 chance of getting it right.
If it answers “I don’t know,” the score is zero.
In large-scale testing, the model that dares to answer seems better than the cauti...
deepencanvas.hashnode.dev2 min read