Tiny Guards: Defending Agents with small 1-0.6B Models
Feb 11 · 11 min read · Prompt injections and their new rival Prompt injection turns “trusted inputs” (emails, web pages, retrieved docs) into an execution surface. If your agent can browse, read email, or call tools, a buried instruction can hijack actions. The dirty secre...
Join discussion