14h ago · 13 min read · In Part 1, we established that "AI" is a goal, not a method and that Machine Learning is the dominant approach for reaching that goal today. We said ML systems learn from data instead of following han
Join discussion
16h ago · 4 min read · I built a public World Cup prediction arena for 12 AI models. The fun question is: which model predicts football best? The engineering question is better: which model stays calibrated under uncertaint
Join discussion17h ago · 3 min read · I started this project thinking EDA would be the quick part. Open the dataset, run some plots, move on to modeling. That was the plan. Three days later I was still in EDA, had dropped 11 features, rew
Join discussion
1d ago · 3 min read · TL;DR: I compared the main LLM-as-judge tools (DeepEval's G-Eval, Confident AI, Evidently, Braintrust, Promptfoo, and MLflow) on the axis that actually decides whether the scores mean anything: how we
Join discussion