@Leenamalhotra

Leena Malhotra

@Leenamalhotra

where tech meets humanization

Joined July 2025

About

Nothing here yet.

Available for

Nothing here yet.

Leena Malhotra's blogs

techwithleenatechwithleena.hashnode.dev92 posts

Articles Threads1 Comments

Recently published

LMLeena Malhotratechwithleena.hashnode.dev

Claude Opus 4.6 vs GPT-5 on Multi-Step Reasoning: Where Each One Starts to Fail

Mar 25 · 8 min read · Both models handle simple reasoning well. The gap opens when tasks have multiple dependent steps — and the failure type is different for each one. Key Takeaways Neither model dominates multi-step re

Join discussion

LMLeena Malhotratechwithleena.hashnode.dev

Debugging AI-Generated Code Across Different Models

Mar 18 · 11 min read · The bug was invisible to three different AI models before a human finally spotted it. I had asked Claude Opus 4.6 to write a function that parsed user-uploaded CSV files and extracted email addresses.

Join discussion

LMLeena Malhotratechwithleena.hashnode.dev

Why Consensus Matters More Than Confidence in AI Systems

Jan 19 · 6 min read · We are building our digital infrastructure on a fault line. The current generation of Large Language Models (LLMs) suffers from a specific, dangerous pathology: they are programmed to be confident, not correct. When you ask an AI a question, it does ...

Join discussion

LMLeena Malhotratechwithleena.hashnode.dev

The Failure Boundary Where LLM Reasoning Quietly Collapses

Jan 16 · 5 min read · Large language models feel impressive right up until they do not. The responses still look fluent. The structure still appears logical. But somewhere beneath the surface, reasoning quality drops. Assumptions blur. Constraints leak. The model keeps ta...

Join discussion

LMLeena Malhotratechwithleena.hashnode.dev

A Production Rule for Handling Model Uncertainty

Jan 15 · 5 min read · You are shipping gambling algorithms, not software. I look at the codebases of "AI-native" startups, and I see the same terrifying pattern. A developer makes an API call to an LLM. They get a response. They JSON.parse() it. And they push it to the fr...

Join discussion

Leena Malhotra

About

Available for

Leena Malhotra's blogs

Recently published

Claude Opus 4.6 vs GPT-5 on Multi-Step Reasoning: Where Each One Starts to Fail

Debugging AI-Generated Code Across Different Models

Why Consensus Matters More Than Confidence in AI Systems

The Failure Boundary Where LLM Reasoning Quietly Collapses

A Production Rule for Handling Model Uncertainty

Search Hashnode

Leena Malhotra

About

Available for

Leena Malhotra's blogs

Recently published

Claude Opus 4.6 vs GPT-5 on Multi-Step Reasoning: Where Each One Starts to Fail

Debugging AI-Generated Code Across Different Models

Why Consensus Matters More Than Confidence in AI Systems

The Failure Boundary Where LLM Reasoning Quietly Collapses

A Production Rule for Handling Model Uncertainty