Tag feed

#ocr

151 posts12 followers

Explore Hashnode

Alternatives

Trending tags this week

Sseafonseafon.hashnode.dev17h ago · 6 min read

From OCR Heuristics to Quality Gates: Testing Receipt Totals Before Release

OCR does not fail in one neat, predictable way. A receipt may contain a subtotal, a rounded cash amount, a payment amount, a zero-change line, an order balance, and a marketing or account-status total

0

TATalha Aminsmartformatter.hashnode.dev5d ago · 4 min read

PNG to Text OCR: Extract Data with Tesseract & Python Meta

Extracting Text from PNGs: The Developer's Guide to OCR Optical Character Recognition (OCR) translates raster image pixels into machine-readable strings. When working with PNGs—especially screenshots

0

RPRalph Pecayomasteralph.hashnode.devJul 10 · 3 min read

Building a Private AI Memory OS for Android

Have you ever remembered seeing a specific piece of information on your screen but completely forgot which app it was in? The standard industry solution to this involves sending your screen data to th

0

BSBhavin Shethfreecodecamp.orgJul 7 · 23 min read

How to Build a Browser-Based PDF OCR to Text Converter Using JavaScript

Not every PDF contains searchable or editable text. Many PDFs are simply scanned images of documents such as invoices, contracts, books, receipts, government forms, and handwritten notes. While these

0

MMediavoxmediavox.hashnode.devJul 6 · 3 min read

Extracting structured data from invoices and contracts with one API call

I've been working on a document analysis API and wanted to share a pattern that saved me from writing custom parsers for every document type my clients throw at me. The problem If you work with LATAM

0

AAZAPIaadhaar-ocr-api.hashnode.devJul 1 · 8 min read

Building a Scalable Invoice Parsing API for Multi-Page Invoices Using AI and OCR

Processing invoices sounds simple until you start working with real-world documents. Some invoices contain just one page with a few products, while others span multiple pages with hundreds of line ite

0

SHSanskriti Harmukhvultr.hashnode.devJun 23 · 3 min read

Deploying Paperless-ngx Open-Source Document Management System on Ubuntu 24.04

Paperless-ngx is an open-source document management system that converts scans and PDFs into a fully searchable archive using Tesseract OCR, with tags, custom fields, and automated processing rules. T

0

SHSarwar Hosseinwebequipe.hashnode.devJun 23 · 6 min read

Text-Based PDFs vs Scanned PDFs: Why Search Works Differently for Each

PDF search sounds simple at first. You upload a PDF, extract the content, and make it searchable. But once you start working with real documents, you quickly realize that not every PDF behaves the sam

0

FFoxdeepfox.hashnode.devJun 19 · 6 min read

The hard part of national ID OCR isn't the OCR

You wire up OCR for your KYC flow, point it at a national ID card, and get back a clean { name, idNumber, dateOfBirth }. Ship it. Then you onboard your second country — and it falls apart. Fields you

0

MDMLAI Digitalmlaidigital.hashnode.devJun 18 · 10 min read

Using LLMs as OCR? Read This First | MLAI Digital

Introduction: When “AI Can Read Anything” Goes Wrong The use case of AI document extraction is among the most popular and discussed use cases of modern AI systems. As large language models profess to

0

#ocr

Search Hashnode

#ocr

Explore Hashnode

Trending tags this week

From OCR Heuristics to Quality Gates: Testing Receipt Totals Before Release

PNG to Text OCR: Extract Data with Tesseract & Python Meta

Building a Private AI Memory OS for Android

How to Build a Browser-Based PDF OCR to Text Converter Using JavaScript

Extracting structured data from invoices and contracts with one API call

Building a Scalable Invoice Parsing API for Multi-Page Invoices Using AI and OCR

Deploying Paperless-ngx Open-Source Document Management System on Ubuntu 24.04

Text-Based PDFs vs Scanned PDFs: Why Search Works Differently for Each

The hard part of national ID OCR isn't the OCR

Using LLMs as OCR? Read This First | MLAI Digital