#vision articles | Hashnode

DADamien Alleyneblog.alleyne.devJul 8 · 13 min read

My Code, My Test, and My Prompt All Agreed. All Three Were Wrong.

A friend handed me a grocery receipt to test my receipt-scanning app, Receipt Tracker, which turns a photo into categorised line items. It was a dense Massy Stores run, 37 items, and the photo wasn't

0

Nniixoniixolabs.hashnode.devMay 8 · 2 min read

A braille reader that works entirely on your phone — how we built TenjiScan

Why we built it Braille appears constantly in urban Japan — train station handrails, elevator buttons, medication packaging. It's designed to be read, but the vast majority of people can't read it. We wanted to close that gap with the simplest possib...

0

RSRahul Sehrawatai-zero-to-hero.hashnode.devApr 12 · 10 min read

Multimodal AI — Models That See, Hear, and Act

Until recently, most AI systems were monolingual in a very specific sense: each system worked on one kind of input. Speech recognizers took audio and produced text. Image classifiers took pictures and produced labels. Translation systems took one lan...

0

RSRahul Sehrawatai-zero-to-hero.hashnode.devApr 13 · 10 min read

Multimodal in Practice: Images In, Structured Data Out

Every multimodal AI demo you see on a conference stage has the same shape. A person holds up a picture of their fridge. The model says "you have eggs, milk, two bell peppers, and some leftover Thai food." The crowd applauds. The demo ends. Nobody shi...

0

YTYash Tyagicadence.hashnode.devFeb 16 · 8 min read

Your Camera Doesn't Know This.

Your ATE Is 59 Meters And You Can't Figure Out Why What a broken evaluation teaches you about the one thing cameras will never know The trajectory shape looks perfect. The depth maps look great. The network converged smoothly. But the evaluation pri...

0

YBYogesh Bhawsaryogeshbhawsar.comFeb 11 · 4 min read

Real-Time Inventory Management with Vision LLMs

Manual inventory management is a pain. Staff manually counting items, typing product names, and dealing with inevitable typos - it's slow, error-prone, and often means your inventory records are outdated before they're even saved. But what if you cou...

0

YLYuheng LIANGlatenighttavern.hashnode.devJan 31 · 5 min read

Taste Aligner（二）：Embedding + Vision + Ontology（V1）实现与验收笔记

目标：把 Ontology（标签本体）→ Vision（标签提取）→ Embedding（TES 向量生成）这条“可验证的最小链路”完整记录下来，作为博客发布与后续迭代留档。范围：本篇只覆盖 V1（可预测、可回放、可验收）的工程实现；不讨论完整业务（Recommendation / Planner / UI），也不讨论 Vision 的真实大模型接入（留给 V2）。 0. 当前项目结构与约定 0.1 仓库关键目录 gateway/：Java Gateway（统一入口、显式路由、Fea...

0

EHEyeLens Hospitalblogsbyeyelens.hashnode.devJan 23 · 3 min read

Glasses Number in Children: What Parents Should Know and When to Worry

Today, more parents than ever are searching for answers about glasses number in children and child eyesight problems. Many come worried and say, My child is so young, why does my child need glasses? This concern is completely natural. Seeing your chi...

0

SGSatyakam Goswamisatyakam.devJan 12 · 3 min read

How Does India Cook Biryani?

Today i don’t want to talk about ML/AI there has been lots of said and the Internet is full of the buzz , December 17-20 IIIT Hyderabad Team presented a paper with the same name as the title of this blog and it went Viral for the eye catchy and easy ...

0

AOAbdirashid Omar Matansimpleml.hashnode.devSep 20, 2025 · 9 min read

From Pixels to Predictions: Linear Regression in Computer Vision

Introduction: Linear regression is like drawing a straight line that best fits the data points. In computer vision, we can use this simple idea to predict things from images. Even though it is a fundamental model, it helps us understand how machines...

0

#vision

#vision

Explore Hashnode

Trending tags this week

My Code, My Test, and My Prompt All Agreed. All Three Were Wrong.

A braille reader that works entirely on your phone — how we built TenjiScan

Multimodal AI — Models That See, Hear, and Act

Multimodal in Practice: Images In, Structured Data Out

Your Camera Doesn't Know This.

Real-Time Inventory Management with Vision LLMs

Taste Aligner（二）：Embedding + Vision + Ontology（V1）实现与验收笔记

Glasses Number in Children: What Parents Should Know and When to Worry

How Does India Cook Biryani?

From Pixels to Predictions: Linear Regression in Computer Vision

#vision

Search Hashnode

#vision

Explore Hashnode

Trending tags this week

My Code, My Test, and My Prompt All Agreed. All Three Were Wrong.

A braille reader that works entirely on your phone — how we built TenjiScan

Multimodal AI — Models That See, Hear, and Act

Multimodal in Practice: Images In, Structured Data Out

Your Camera Doesn't Know This.

Real-Time Inventory Management with Vision LLMs

Taste Aligner（二）：Embedding + Vision + Ontology（V1）实现与验收笔记

Glasses Number in Children: What Parents Should Know and When to Worry

How Does India Cook Biryani?

From Pixels to Predictions: Linear Regression in Computer Vision