blog.anirudha.devSkills, Agents, and the Missing MiddleIf you’ve been building with LLMs for a while, you’ve probably built an agent at some point. You give the model a role. You add a few tools. You maybe throw in memory, some planning logic, and a carefully tuned system prompt. It works. Until it doesn...Jan 9·4 min read
blog.anirudha.devBuilding AI Tools You Can TrustYou want to build an AI application. Something useful. Something your users can rely on. But here's the problem: How do you know it's actually good? You can build an app that generates summaries, answers questions, or writes emails. It works. Your te...Nov 16, 2025·14 min read
blog.anirudha.devTeaching AI to Grade Other AIIf you’ve been following the world of AI development, you might’ve heard the phrase “LLM-as-Judge.”It sounds dramatic, like some sci-fi overlord where one AI passes judgment on another. But it’s actually one of the most important evolutions in evalua...Nov 9, 2025·10 min read
blog.anirudha.devUnit Tests for IntelligenceA few months ago, as I was exploring machine learning while working on a project, one of my models kept behaving in a weird way. I had built a classifier to detect cats in images. During training, accuracy was awesome, near 99%. But in production, it...Oct 26, 2025·5 min read
blog.anirudha.devRun your code anywhereLet me tell you about Vani. She's a web development instructor teaching Node.js to a class of 30 students. Every semester, she spends the first two weeks just getting everyone's environment set up. Some students have Windows, others have Mac, and a f...Oct 12, 2025·12 min read