Feb 8 · 4 min read · Voice AI is everywhere today — but most of it still feels… foreign. Foreign accents.Foreign assumptions.Foreign ways of explaining things. While building for the Build with Bulbul challenge, I asked a simple question: What if India could listen to i...
Join discussionFeb 6 · 4 min read · When traditional file transfer methods are locked down, sometimes the answer is literally staring you in the face. The Problem: Extracting Data from Fortress Environments Picture this: You're working in a highly secured environment. The network is co...
Join discussionFeb 3 · 5 min read · DeepSeek OCR 2 ist ein 3B-Parameter Vision-Language-Modell, das am 27. Januar 2026 veröffentlicht wurde. Anders als klassische OCR-Systeme, die nur Text extrahieren, versteht dieses Modell Dokumentenstrukturen wie ein Mensch – es liest in der logisch...
Join discussion
Feb 3 · 21 min read · It's Been a While Sorry for being away for so long. I've been head-down strengthening my software engineering foundation—consuming rather than producing. After a while, I started feeling this pull to balance things out by building again. I just finis...
Join discussion
Feb 3 · 4 min read · Objective The objective is to empower users to upload invoice documents, in either PDF or image formats, and receive structured, machine-readable invoice data that is ideal for storage, reporting, and automation. This capability is crucial for busine...
Join discussion
Jan 29 · 3 min read · In the software development landscape of 2026, we’ve reached the limits of "Cloud-Everything." While centralized processing was the backbone of the last decade, the high-latency and privacy risks of constantly shipping data to a remote server have be...
Join discussion
Jan 19 · 7 min read · For years, OCR has been treated as a solved problem. Extract text from an image, dump it into a file, and move on. But anyone who has actually built production systems knows the truth - real-world documents are messy. They are skewed, low resolution,...
Join discussion
Jan 19 · 1 min read · Paddle OCR is a powerful open-source Optical Character Recognition (OCR) solution designed to extract text from images and scanned documents with speed and high accuracy. Whether you’re processing invoices, KYC documents, IDs, forms, receipts, or mul...
Join discussionJan 7 · 2 min read · Late last year, I started exploring how to extract metadata from product drawings. Part numbers, material specifications, revision history, manufacturing process notes. The kind of information that lives in title blocks and needs to end up in a PLM d...
Join discussion