Apr 29 · 8 min read · There's a class of problem in document AI that sounds deceptively simple: look at a page, figure out what's on it. Not read the text. Not classify the document. Just answer: where is the table? where
Join discussion
Apr 18 · 4 min read · You try to extract text from a PDF file using JavaScript. Sometimes it works fine. Sometimes the output is empty or broken. This confuses many developers. The thing is that not all PDF files behave th
Join discussion
Mar 29 · 12 min read · Hook The first time I watched a clean-looking frame fail the chain, the problem was not blur, bad framing, or a weak prompt. The failure was more structural than that: the frame carried text, and that text was the thing that poisoned the next step. O...
Join discussion
Mar 25 · 3 min read · Every AI workflow that touches the physical world eventually needs OCR. Scanned contracts, handwritten notes, screenshots, receipts, whiteboards — if it's an image with text, you need a way to extract it for an LLM to work with. Part of our Business ...
Join discussionMar 24 · 8 min read · Last year, our team at pdf2text.ai hit a milestone: 10,000 scanned documents processed in a single month. Bank statements, invoices, receipts, tax forms — the full spectrum of messy, real-world paper
Join discussionFeb 8 · 4 min read · Voice AI is everywhere today — but most of it still feels… foreign. Foreign accents.Foreign assumptions.Foreign ways of explaining things. While building for the Build with Bulbul challenge, I asked a simple question: What if India could listen to i...
Join discussion