AIRiskAware
AI Governance Glossary
Governance Concept

What Is Adversarial Example?

Adversarial Example is an input deliberately crafted with small, often imperceptible changes that cause an AI model to make a confident but wrong prediction.

Definition

Adversarial Examplean input deliberately crafted with small, often imperceptible changes that cause an AI model to make a confident but wrong prediction.

Adversarial examples exploit the gap between how a model "sees" data and how a human does — a tiny, targeted perturbation can flip a classification while looking unchanged to a person. They are a core concern for the robustness and security of high-stakes AI, and a reason adversarial testing is part of serious model evaluation.

Source: Machine-learning security research

Plain-language explanation

Adversarial examples exploit the gap between how a model "sees" data and how a human does — a tiny, targeted perturbation can flip a classification while looking unchanged to a person. They are a core concern for the robustness and security of high-stakes AI, and a reason adversarial testing is part of serious model evaluation.

Primary source: Machine-learning security research

Related terms

Robustness Data Poisoning AI Red Teaming Model Extraction

See where you stand on AI governance

Take the free 7-question maturity assessment and get a personalised action plan.

Free assessment — 3 minutes →