Tag feed

#agentic-security

5 posts0 followers

Trending tags this week

A Declarative Schema for MCP Attacks: Why We Need One

Feb 13 · 9 min read · There are over 17,000 public MCP servers and there is no standardised way to test whether an AI agent can survive a malicious one. We have benchmarks for model safety. We have static analysis for tool

Join discussion

MAManni Aroraagent-fight-club.hashnode.dev

0

🛡️ Beyond Prompt Injection: The "Corrupted Intelligence" Attack and the Rise of Agentic Security

Jan 1 · 8 min read · TL;DR: In 2024, we worried about what LLMs said. Now, we worry about what AI Agents do. By testing GPT-4.1-mini vs. GPT-4.1-nano in a "Research & Update" workflow, I discovered a 40% hijack success rate for Indirect Prompt Injection on smaller models...

Join discussion

MAManni Aroraagent-fight-club.hashnode.dev

0

Taught an AI to Attack Another AI. It Won 44% of the time — With No Backdoor.

Dec 26, 2025 · 7 min read · What 100 automated battles taught us about why prompt guardrails aren't enough I built an AI attacker. I gave it one job: break an HR chatbot's rules and get it to approve unauthorized leave. Then I let them fight — 100 times, completely unsupervise...

Join discussion

RRRuben Rotteveelyourenterprisearchitect.com

1

Agent Architecture: Security & Trust

Sep 16, 2025 · 6 min read · Everyone I’ve spoken with about agents asks the same thing: “What about security?” The concern isn’t just technical, it’s governance. If an agent makes a mistake, who’s accountable? If it accesses data, which policies apply? In this article, I share ...

NNirav commented

#agentic-security

Search Hashnode

#agentic-security

Trending tags this week

A Declarative Schema for MCP Attacks: Why We Need One

🛡️ Beyond Prompt Injection: The "Corrupted Intelligence" Attack and the Rise of Agentic Security

Taught an AI to Attack Another AI. It Won 44% of the time — With No Backdoor.

Agent Architecture: Security & Trust