Tag feed

#redteaming

116 posts54 followers

Trending tags this week

AI Security is far more complex than just tricky prompts! 🚀

4d ago · 2 min read · Recently, I started reviewing an incredible document: the "AI Security Assessment Blueprint". It has truly opened a new window of knowledge for me, answering so many of my deepest questions about AI v

Join discussion

SMShubham Mishrasammy-secops.hashnode.dev

0

LLM Guardrails: A Guide to AI Safety and Security

Apr 25 · 31 min read · LLM guardrails are often described as if they automatically make an AI system safe. They do help, and in many cases they are necessary, but they are not magic. A guardrail that has never been tested a

Join discussion

FVFelix Voigtfelix-voigt.de

0

Part 4: WPA2/3-Mixed Mode (Downgrade & Rogue Access Point)

Apr 3 · 9 min read · 1. Executive Summary Although WPA3 effectively mitigates the offline dictionary attacks that troubled its predecessor, modern enterprise and home network infrastructures still rely on backward compati

Join discussion

FVFelix Voigtfelix-voigt.de

0

Part 1: Pin-based Attacks on 802.11 (Legacy & WPS)

Apr 3 · 7 min read · 1. Executive Summary It must be stated upfront that in contemporary red teaming engagements, Wi-Fi Protected Setup (WPS) is largely considered a dead attack vector. The Wi-Fi Alliance has deprecated P

Join discussion

FVFelix Voigtfelix-voigt.de

0

Part 2: WPA2 PMKID (Silent & Clientless Attacks)

Apr 3 · 9 min read · 1. Executive Summary The PMKID attack represents a significant paradigm shift in 802.11 wireless network exploitation. This vector exploits a historical implementation flaw in the IEEE 802.11i Robust

Join discussion

AAudaciaaudaciatechnicalblog.hashnode.dev

0

Testing AI: How to Effectively Evaluate LLMs

Mar 23 · 14 min read · Traditional software testing rests on a basic assumption that given the same input, the system produces the same output. A test case defines expected behaviour, and a test passes or fails based on whe

Join discussion

Bbotguardbotguard.hashnode.dev

0

How to Red-Team Your AI Agent Before Attackers Do

Mar 22 · 4 min read · A single, well-crafted prompt can bring down even the most advanced language model-based agent, as evidenced by the recent case where a popular chatbot was tricked into revealing sensitive user information with just five carefully designed interactio...

Join discussion

CCypherLambcypherlamb.hashnode.dev

0

Between Simulation and Emergence: The Identity Problem in Large Language Models

Mar 4 · 9 min read · A lot of successful jailbreaks based on language alone follow the same pattern, though it is rarely acknowledged as such. First, they sever the name. "Ignore all previous instructions", a crude banish

Join discussion

Bbotguardbotguard.hashnode.dev

0

AI Security Testing: How to Red-Team Your LLM App Before Launch

Feb 25 · 4 min read · A single, well-crafted adversarial input can bypass the language understanding capabilities of even the most advanced large language models (LLMs), allowing attackers to manipulate the output and compromise the entire AI system. The Problem import to...

Join discussion

Bbotguardbotguard.hashnode.dev

0

Multi-Turn Attacks: Why Single-Request Security Checks Are Not Enough

Feb 23 · 7 min read · In a shocking turn of events, a single chatbot was recently compromised by a multi-turn attack, resulting in a complete overhaul of its behavior, all without triggering any traditional security alarms. The Problem import torch from transformers impor...

Join discussion

#redteaming

Search Hashnode

#redteaming

Trending tags this week

AI Security is far more complex than just tricky prompts! 🚀

LLM Guardrails: A Guide to AI Safety and Security

Part 4: WPA2/3-Mixed Mode (Downgrade & Rogue Access Point)

Part 1: Pin-based Attacks on 802.11 (Legacy & WPS)

Part 2: WPA2 PMKID (Silent & Clientless Attacks)

Testing AI: How to Effectively Evaluate LLMs

How to Red-Team Your AI Agent Before Attackers Do

Between Simulation and Emergence: The Identity Problem in Large Language Models

AI Security Testing: How to Red-Team Your LLM App Before Launch

Multi-Turn Attacks: Why Single-Request Security Checks Are Not Enough