Tiny Guards: Defending Agents with small 1-0.6B Models
Prompt injections and their new rival
Prompt injection turns “trusted inputs” (emails, web pages, retrieved docs) into an execution surface. If your agent can browse, read email, or call tools, a buried instruction can hijack actions.
The dirty secre...