Tag feed

#benchmarking

45 posts1 followers

Explore Hashnode

Alternatives

Trending tags this week

NNNipun Nairnipunnair.hashnode.dev17h ago · 4 min read

Streaming Showdown: Kafka vs Redpanda vs NATS, benchmarked honestly

If you've ever had to pick a message broker or streaming platform, you've run into the same wall I did: every vendor comparison is written by the vendor, "benchmarks" quietly compare a tuned cluster a

0

NNNipun Nairnipunnair.hashnode.dev17h ago · 7 min read

Kafka vs Redpanda vs NATS on My M4 Mac: The Real Numbers

Part 2 of 3 in Streaming Showdown. Previous: the benchmark bug. Next: when I'd actually reach for each. Part 1 is about how my first latency numbers were an artifact of a burst-then-drain test design,

0

NNNipun Nairnipunnair.hashnode.dev17h ago · 6 min read

The Benchmark Bug That Made NATS Look 9x Faster Than Kafka

Part 1 of 3 in Streaming Showdown. Next: the real numbers and my test rig. I've been running a small side-by-side benchmark of Kafka, Redpanda, and NATS JetStream — same event schema, same message siz

0

L(Lucian (LKB)lkforge.hashnode.devJul 20 · 5 min read

Proving It's Actually Unbeatable: How I Benchmark a Game AI Before Publishing a Number

This was originally published on the LK Forge blog, where the charts are interactive and you can play the AI it talks about. "Unbeatable" is a testable claim, not a marketing word. Before that word go

0

MMMarco Mornatiblog.mornati.netMay 31 · 14 min read

Your AI Agent Deserves a Tool Harness, Not a Wild West

We started the same way everyone does: give the LLM access to everything and hope it figures it out. Connect the GitHub MCP, the Jira MCP, the internal product API MCP, throw in a database schema or t

0

OOmnithiumomnithium.hashnode.devMay 31 · 15 min read

The Enterprise AI Agent Performance Benchmark: How to Measure and Compare Agent Effectiveness

Why Current AI Agent Benchmarks Fail the Enterprise Why do most AI agent benchmarks fail to predict what actually happens in your production environment? Because they measure the wrong things, in the

0

NSNeeloppher Syedneeloppher.hashnode.devMay 15 · 8 min read

ASR Evaluation Framework: Benchmarking Speech Recognition Models Across Accuracy, Speed, and Robustness

Picking an ASR model for production is not straightforward. Whisper might be the most accurate for general English but too slow for real-time use. Wav2Vec2 might be fast enough for edge devices but st

0

EEntreelistiaq.hashnode.devMay 8 · 6 min read

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications - Blog Post

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications I've spent the last three months stress-testing vector indexes in production environments, and the results challenge conventional wisdom about when to use each index type. ...

0

EEntreelistiaq.hashnode.devMay 8 · 6 min read

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications - Blog Post

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications I've spent the last three months stress-testing vector indexes in production environments, and the results challenge conventional wisdom about when to use each index type. ...

0

NVNolan Vossnolan-voss.hashnode.devApr 30 · 7 min read

The 1,000-Message Test: A Benchmark for AI Memory That Most Apps Fail

Most apps that claim "memory" don't have it. I spent 200 days testing AI companion apps. 15 platforms, every subscription paid out of pocket. What I found, consistently, is that "memory" in marketing

0

#benchmarking

Search Hashnode

#benchmarking

Explore Hashnode

Trending tags this week

Streaming Showdown: Kafka vs Redpanda vs NATS, benchmarked honestly

Kafka vs Redpanda vs NATS on My M4 Mac: The Real Numbers

The Benchmark Bug That Made NATS Look 9x Faster Than Kafka

Proving It's Actually Unbeatable: How I Benchmark a Game AI Before Publishing a Number

Your AI Agent Deserves a Tool Harness, Not a Wild West

The Enterprise AI Agent Performance Benchmark: How to Measure and Compare Agent Effectiveness

ASR Evaluation Framework: Benchmarking Speech Recognition Models Across Accuracy, Speed, and Robustness

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications - Blog Post

Benchmarking pgvector IVFFlat vs HNSW indexes for production RAG applications - Blog Post

The 1,000-Message Test: A Benchmark for AI Memory That Most Apps Fail