Gradient-based Adversarial Attacks against Text Transformers
Abstract
We introduce the first general-purpose gradient-based attack targeting transformer models. Rather than focusing on finding a single adversarial example, we aim to discover a distribution of adversarial examples represented by a continuous ma...
blogs.night-wolf.io17 min read