Tag feed

#mlops

840 posts304 followers

Explore Hashnode

Alternatives

Trending tags this week

ITIngero Teamingero.hashnode.dev11h ago · 6 min read

Read-Only Kernel Telemetry as MCP Tools: A Design Note

Design constraints (left) and the seven existing tool calls (right) for a read-only MCP server over an eBPF kernel trace database. TL;DR An MCP server over kernel telemetry has a specific shape: read-

0

Eedgepilot-4edgepilothq.hashnode.dev1d ago · 4 min read

Your Edge Fleet Can't Afford to Wait for the Cloud

Imagine you've deployed 200 computer vision models across a fleet of autonomous inspection drones. They're monitoring remote pipeline infrastructure — hours from the nearest city, zero cellular covera

0

NSNeeloppher Syedneeloppher.hashnode.dev3d ago · 15 min read

Dataset Factory: A Production-Grade Benchmark Dataset Factory for AI Agent Evaluation

Evaluating AI agents requires benchmark datasets that are high-quality, diverse, balanced, and free of duplicates. Building those datasets by hand is slow, inconsistent, and hard to reproduce. The Mer

0

TRThe rightmodeler teamrightmodeler.hashnode.dev3d ago · 3 min read

Why we built a model downgrade report before building a router

Author: The rightmodeler team The first version of our model cost tool does not route a single production request. At first, that can sound backwards. If the problem is expensive model calls, why not

0

AJAlen Joypragmaticstack.in4d ago · 15 min read

Your Eval Lied: The Architecture of a Real LLM Evaluation Program

🗓️ Last updated: July 2026 The team spent six weeks on a model upgrade. Every internal eval is green. The new model beats the old one on the golden set by four points. It beats it on the regression

0

SGSergio González Téllezevankhandev.hashnode.dev5d ago · 2 min read

Where does an AI model end, and where does an AI system truly begin?

SEED-006 PROBLEM Where does an AI model end, and where does an AI system truly begin? INSIGHT An AI Systems Engineer understands that the system is the entire state: data, software, models, infrastr

0

ISilya sergeevkrauncher.hashnode.dev5d ago · 5 min read

The Messy Real World, or Why Probability Decides What You Pay

Researchers pick GPUs the way a spec sheet invites them to: choose a card, read a number, pay the bill. But the real world doesn't behave like a number. It behaves like a cloud — and that cloud is thr

0

Aalexeyg377alexeygolev.hashnode.dev5d ago · 3 min read

The Limits of One Model Per Pod ML Platform

Every engineering platform is built around trade-offs. For several years the model serving platform followed a simple idea: one model, one Kubernetes pod. It was easy to understand, inexpensive to ope

0

SHSanskriti Harmukhvultr.hashnode.dev6d ago · 8 min read

Deploying ClearML as a GCP Vertex AI Alternative on Ubuntu

Google Vertex AI is Google Cloud's managed ML platform with experiment tracking, training jobs, pipelines, a model registry, and endpoints, but it locks you into GCP-specific APIs and per-use billing.

0

JVJason Vertreesjvertrees.hashnode.dev6d ago · 3 min read

The problem with "does this look right?"

LLM-as-a-judge means using one model to check another model's output — usually because manual review doesn't scale. It sounds like an obvious fix. In practice, a naive version of this check fails in t

0

#mlops

Search Hashnode

#mlops

Explore Hashnode

Trending tags this week

Read-Only Kernel Telemetry as MCP Tools: A Design Note

Your Edge Fleet Can't Afford to Wait for the Cloud

Dataset Factory: A Production-Grade Benchmark Dataset Factory for AI Agent Evaluation

Why we built a model downgrade report before building a router

Your Eval Lied: The Architecture of a Real LLM Evaluation Program

Where does an AI model end, and where does an AI system truly begin?

The Messy Real World, or Why Probability Decides What You Pay

The Limits of One Model Per Pod ML Platform

Deploying ClearML as a GCP Vertex AI Alternative on Ubuntu

The problem with "does this look right?"