Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

Mundher Al-Shabi, PhD

I design and build machine learning models and systems.

Jun 29, 2025

The Messy Reality of Evaluating GenAI Systems

For years, evaluating traditional machine learning models, while never simple, followed a well-trodden path. Your team knew the drill: assemble a labeled dataset, define success with metrics like precision and recall, and track performance. The core ...

mundher.com7 min read

Responses

No responses yet.