@harshavardhan

Harshavardhanan

@harshavardhanJoined August 2018

Architect In Role. Developer by Heart

About

Nothing here yet.

Available for

Nothing here yet.

Harshavardhanan's blogs

The Harsh Techblog.pragmaticbyharsh.com31 posts

Articles Comments

Recently published

HHarshavardhananblog.pragmaticbyharsh.comFeb 15 · 8 min read

Anatomy of a Prompt — System, User, and Assistant Explained

You've used ChatGPT. You've typed questions, gotten answers, maybe even had it write code for you. But here's something most people never think about: every conversation you have with an LLM isn't just you talking to a model. There's a hidden layer s...

HHarshavardhananblog.pragmaticbyharsh.comFeb 10 · 11 min read

Choosing Embedding Models and Dimensions: Why 1536 Isn't Always Better Than 384

You're building a RAG system and need to pick an embedding model. The options are overwhelming: OpenAI, Voyage, Google, Cohere, or self-hosted open-source. Prices range from free to $0.13 per million tokens. Dimensions range from 256 to 3072. How do ...

HHarshavardhananblog.pragmaticbyharsh.comFeb 8 · 14 min read

What Are Embeddings and How Vector Similarity Actually Works

If you've ever wondered how AI "understands" that "king" is closer to "queen" than to "pizza," you're about to find out. And no, it's not magic, it's math. Specifically, it's embeddings and vector similarity. This is the foundation that powers semant...

HHarshavardhananblog.pragmaticbyharsh.comFeb 3 · 9 min read

How Tokenization Works: BPE and the Algorithm Behind Your LLM

Every time you send a message to GPT-4 or Claude, an algorithm from 1994 decides how much you'll pay. That algorithm is Byte Pair Encoding — BPE for short. It's not glamorous, but it's running under the hood of nearly every modern LLM. Once you under...

HHarshavardhananblog.pragmaticbyharsh.comFeb 1 · 9 min read

What Are Tokens and Why Your LLM Bill Depends on Them

"Hello" is 1 token. "你好" is 2 tokens. Same meaning. Double the cost. That little fact tripped me up when I first started working with LLMs. I assumed tokens were just... words. They're not. And that misunderstanding quietly inflates API bills everywh...

Harshavardhanan

About

Available for

Harshavardhanan's blogs

Recently published

Anatomy of a Prompt — System, User, and Assistant Explained

Choosing Embedding Models and Dimensions: Why 1536 Isn't Always Better Than 384

What Are Embeddings and How Vector Similarity Actually Works

How Tokenization Works: BPE and the Algorithm Behind Your LLM

What Are Tokens and Why Your LLM Bill Depends on Them

Search Hashnode

Harshavardhanan

About

Available for

Harshavardhanan's blogs

Recently published

Anatomy of a Prompt — System, User, and Assistant Explained

Choosing Embedding Models and Dimensions: Why 1536 Isn't Always Better Than 384

What Are Embeddings and How Vector Similarity Actually Works

How Tokenization Works: BPE and the Algorithm Behind Your LLM

What Are Tokens and Why Your LLM Bill Depends on Them