Tag feed

#hitl

4 posts0 followers

Explore Hashnode

Alternatives

Trending tags this week

DBDnyandeo Bharambellmjudge.hashnode.devJun 7 · 6 min read

Building a production LLM Judge: lessons from the enterprise audit engine

When I was building the enterprise audit engine, the LLM Judge was the last thing I planned to add. It felt like over-engineering. The main agent already had MCP tool access to live device state, a po

0

MAManni Aroraagent-fight-club.hashnode.devDec 26, 2025 · 7 min read

Taught an AI to Attack Another AI. It Won 44% of the time — With No Backdoor.

What 100 automated battles taught us about why prompt guardrails aren't enough I built an AI attacker. I gave it one job: break an HR chatbot's rules and get it to approve unauthorized leave. Then I let them fight — 100 times, completely unsupervise...

0

SAShakil Ahmedshaq-ai-agent.hashnode.devNov 15, 2025 · 3 min read

The 5-Day Agentic Revolution: Building AI That Thinks, Acts, & Observes — Day 4: The Agent Quality Flywheel: Seeing Inside the Trajectory

Welcome to Day 4 of the 5-Day Agentic Revolution! We’ve built the Agent’s Brain (Day 1), given it Hands (Day 2), and granted it Memory (Day 3). Today, we tackle the most critical challenge for enterprise adoption: Trust. AI Agents, by nature, are non...

0

MHMichelle Hacundasquidcloud.hashnode.devAug 8, 2025 · 5 min read

Scaling AI Autonomy with Human-in-the-Loop Control

New AI tools are showing up constantly. One might help a support team resolve tickets faster. Another might automate basic contract review or help engineers generate code. More teams are exploring what is actually useful and what still needs human ju...

0

#hitl

Search Hashnode

#hitl

Explore Hashnode

Trending tags this week

Building a production LLM Judge: lessons from the enterprise audit engine

Taught an AI to Attack Another AI. It Won 44% of the time — With No Backdoor.

The 5-Day Agentic Revolution: Building AI That Thinks, Acts, & Observes — Day 4: The Agent Quality Flywheel: Seeing Inside the Trajectory

Scaling AI Autonomy with Human-in-the-Loop Control