May 21 · 9 min read · Spent the better part of the last quarter building out an Agentic RAG system on top of a production Python full stack AI app. Standard RAG was not cutting it for our use case. Too much hallucination o
DNDevang and 1 more commented
Apr 13 · 4 min read · I reviewed post-mortems from teams deploying Retrieval-Augmented Generation (RAG) in production and found a consistent failure mode: RAG handles simple questions well but breaks down on ambiguous quer
Join discussion
Apr 4 · 19 min read · Three months into a contract with a mid-sized insurance company, I was sitting across from their CTO watching their "AI knowledge base" answer questions about their own products. The system retrieved the right documents 90% of the time. But on anythi...
Join discussion
Mar 1 · 8 min read · Agentic RAG represents an evolutionary leap in how AI systems retrieve and generate information. Unlike conventional RAG systems that execute a single retrieval-generation cycle, agentic rag introduce
Join discussionSep 17, 2025 · 3 min read · Agent teams (planner, retriever, ranker, synthesizer) thrive when latency is predictable, privacy is strict, and costs are controlled. That’s why I deploy them on Amazon EKS with vLLM as the serving layer: you keep data in-VPC, pin workloads to GPUs,...
Join discussionJun 27, 2025 · 4 min read · Why Agentic RAG? And Why Now? Vanilla Retrieval-Augmented Generation (RAG) systems work like diligent librarians: you ask a question, they fetch documents from a knowledge base, and generate a response. But here’s the catch—they rarely ask follow-ups...
Join discussionJun 21, 2025 · 10 min read · Retrieval-Augmented Generation (RAG) is becoming the go-to pattern for building AI systems that can fetch real-time or domain-specific knowledge on demand. But RAG alone doesn’t make your chatbot smart. With LangGraph, you can build stateful, agent-l...
Join discussion
Feb 25, 2025 · 18 min read · 🗝 TLDR: This tutorial shows how to build AI agents that orchestrate multiple knowledge sources (SQL databases, vector database search, web search) to answer complex healthcare queries. We use Azure SQL, Azure AI Search, and Bing Grounding Tool via A...
Join discussion