Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Smaller is Better: Replacing GPT-4o-mini with a 7B Local Judge" | Hashnode

FeedDiscussion

Duane Nielsen

Obsessed with RL

Feb 5

Smaller is Better: Replacing GPT-4o-mini with a 7B Local Judge

I expected the 30B model to be the better judge. It wasn't. When I set out to replace OpenAI's GPT-4o-mini as the judge for the Oolong benchmark, my plan was simple: use the biggest local model I had. Qwen3-coder at 30B parameters seemed like the obv...

proximal.hashnode.dev4 min read

#ai #benchmarks #llm #machine-learning #ollama

Responses

No responses yet.