Regulation

What Is Model Extraction?

Model Extraction is an attack that reconstructs a model's parameters or replicates its behaviour by systematically querying it and observing the outputs.

Definition

Model Extraction — an attack that reconstructs a model's parameters or replicates its behaviour by systematically querying it and observing the outputs.

Also called model stealing, this lets an adversary build a close copy of a proprietary model — or probe it for weaknesses — using only its API responses. It threatens intellectual property and can be a stepping stone to other attacks, which is why query monitoring and rate limiting feature in AI security controls.

Source: Machine-learning security research

Plain-language explanation

Primary source: Machine-learning security research

See where you stand on AI governance

Take the free 7-question maturity assessment and get a personalised action plan.

Free assessment — 3 minutes →

What Is Model Extraction?

Plain-language explanation

Related terms