© 2026 LinearBytes Inc.
Search posts, tags, users, and pages
David Hahn
10+ years in software engineering, with deep expertise in frontend. Now going deep on LLMs — streaming, RAG, tool use, and everything in between.
Automated evaluation using an LLM sounds like an elegant solution until you understand its failure modes. The model playing the role of a teacher grading work has four well-documented ways to get it w
No responses yet.