We logged every rejected tool call for a month. A third were our validation being wrong, not the model.
TL;DR: Everyone logs tool calls that error or return junk. We started logging the calls our own validation REJECTED before they ever ran. Over a month, about 1 in 3 of those rejections were false: a v
jamesoconnor.hashnode.dev3 min read