Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "One Token to Fool LLM-as-a-Judge" | Hashnode

FeedDiscussion

Kiran Kumar

GEN AI

Oct 11, 2025

One Token to Fool LLM-as-a-Judge

exposes a major vulnerability in how large language models 1. Problem Background Modern AI training often uses LLMs as judges — meaning, instead of humans evaluating model answers, another LLM gives a score (reward).Example: “Given a question, a mod...

dlwithkiran.dev4 min read

Responses

No responses yet.