Jul 24, 2025 · 14 min read · Introduction I have been tinkering with LLMs at work and outside now for quite a while and one of the most pressing issues compared to traditional machine learning is the unsolved problem of how to evaluate them. Evaluating LLM outputs is exponential...
SSebastian commented
Jul 13, 2025 · 4 min read · With the advent of LLMs in the recent few years, we stumbled upon something called Prompt Engineering - let’s speak a few words about it to set the context right and then let’s venture into DSPy What is Prompt Engineering? Prompt Engineering is the ...
Join discussion
Jul 2, 2025 · 10 min read · 💡 This blog post draws inspiration from Andrej Karpathy's insightful talk, "Software is Changing" which I highly recommend watching! It offers a fascinating perspective that will broaden your understanding of the topic. Those who watched Andrej's ...
Join discussion
Mar 27, 2025 · 5 min read · What is DSPy DSPy(Declarative Self-Improving Language Programs) is a framework for building AI applications. Unlike its counterparts i.e. Langchain, Llamaindex, it emphasizes programming the LLM model over prompt engineering. In this tutorial, you wi...
Join discussion
Mar 12, 2025 · 3 min read · 📝 Quick Summary: LangWatch is an LLM Ops platform designed to monitor, experiment with, and optimize LLM pipelines. It offers features such as a drag-and-drop optimization studio, quality assurance tools with evaluators and dataset management, and m...
Join discussionJan 20, 2025 · 3 min read · The Problem: Model Lock-In Is a Jail of Your Own Making After months of training, our fact-checking intern, Chatty, is now a star. But then your boss says: “Great! Now port this to Claude. And Llama. And the new Google model next week.” Manual Approa...
Join discussion
Jan 20, 2025 · 4 min read · The Problem: Language Models Are Like Overeager Interns Imagine you’ve hired a brilliant but chaotic intern named "Chatty" (your language model). You ask Chatty to draft an email. It writes a Shakespearean sonnet. You say, “No, just a meeting reminde...
Join discussion
Jan 20, 2025 · 4 min read · The Problem: AI’s Greatest Flaw (It Thinks It’s Aristotle) Our fact-checking intern, Chatty, has a fatal flaw: it never checks the bookshelf. Ask it “Did Napoleon own a pet kangaroo?” and it’ll confidently answer:“No, kangaroos are native to Australi...
Join discussion
Jan 20, 2025 · 3 min read · The Problem: AI’s Midlife Crisis Our fact-checking intern, Chatty, has grown reliable… until it meets ambiguity or trolls. Example 1: “Some say Earth is flat.”Chatty panics: “False! The Earth is round… but maybe those ‘some’ are onto something?” Exam...
Join discussion