Feed
Pro
Search

Sign in
FactoryKit - the AI software factory: tasks in, pull requests out Bug0 - The AI-native e2e QA regression testing The foreword by Hashnode - official blog from the Hashnode team Passmark - The open-source AI framework for regression testing Hashnode gql skill - let your AI agent publish to your Hashnode blog Hackathons Changelog Brand @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

Joshua Marker

Oct 8, 2024

Insisting On Known Knowns: Using Evaluators to Drive Reliability

LLMs are heuristic and opaque. Without the ability to selectively measure elements of correctness, you can’t be sure your system is behaving. Every GenAI project starts with a phase of looking at indi

engineering.fractional.ai6 min read

#generative-ai #ai #llm

Responses

No responses yet.