When AI Goes Wrong: Building an AI Incident Response Capability

The inevitability of AI incidents

Every organisation using AI at scale will eventually experience an AI incident. This is not pessimism, it is the actuarial reality of deploying complex systems that interact with the real world, process imperfect data, and are maintained by humans. The question is not whether an AI incident will occur, but whether the organisation will be prepared to respond effectively when it does.

Most are not. Standard IT incident response processes are designed for technology failures: systems go down, data is lost, security is breached. AI incidents have different characteristics that standard processes do not address. They may affect people before they are detected, a biased hiring tool may screen out qualified candidates for months before the pattern is identified. They may be difficult to attribute, it can be genuinely difficult to determine whether an adverse outcome was caused by the AI system, the data it processed, the way it was deployed, or a combination of factors. And their scope can be hard to quantify, the number of people affected by an AI decision error may only be estimable, not exactly knowable.

Categories of AI incident

Performance failure: The AI system produces incorrect, inaccurate, or unreliable outputs at a rate or scale that causes material harm. This includes AI systems that degrade over time due to model drift, systems that fail under distribution shift, and systems that were never sufficiently accurate for their deployed use case. Performance failure is often gradual and may go undetected without active monitoring.

Fairness failure: The AI system produces systematically different outcomes for identifiable demographic groups in ways that cause discriminatory harm. Fairness failures may not be apparent from aggregate performance metrics, a model that performs well on average may perform significantly worse for a minority subgroup. Detection requires demographic stratification of performance data, not just overall accuracy monitoring.

Security incident: The AI system is manipulated through adversarial inputs, the training data or model is compromised, or the system is used in ways not intended by its designers. Adversarial attacks on AI systems, inputs designed to cause misclassification or unexpected outputs, are a documented and growing threat vector.

Regulatory or compliance breach: The AI system is found to be non-compliant with applicable law, operating without required conformity assessment, processing data in violation of privacy law, or producing outputs that breach anti-discrimination legislation. Compliance breaches may be discovered by regulators rather than by internal monitoring.

What AI incident response requires

Detection capability: Incidents that are not detected cannot be responded to. AI monitoring infrastructure must be capable of detecting the indicators of each incident category: performance degradation triggers, fairness metric anomalies, unusual input patterns that may indicate adversarial activity, and compliance risk signals. Monitoring must be continuous, not periodic.

Cross-functional response from the first moment: AI incidents are not purely technical events. From the moment an AI incident is identified, the response must involve legal and compliance (to assess regulatory notification obligations and manage privilege), communications (to manage reputational risk and stakeholder notification), and the technical team (to contain the incident and conduct root cause analysis). Waiting for technical root cause before engaging legal or communications is a governance failure that compounds the original incident.

Regulatory notification: The EU AI Act requires providers and deployers of high-risk AI systems to notify national market surveillance authorities of serious incidents without undue delay. Serious incidents are defined to include deaths, serious harms to health or fundamental rights, and significant property damage attributable to the AI system. Most organisations subject to this requirement have no process for identifying reportable AI incidents or making the required notifications.

Affected individual notification: Where an AI incident has caused harm to identifiable individuals, those individuals may have rights to notification. The appropriate notification obligation depends on the nature of the incident, the jurisdiction, and the applicable regulatory regime, but the analysis must happen promptly, not as an afterthought.

Post-incident review

Post-incident review of AI failures should ask two distinct questions. The first is the technical question: what went wrong with the model, the data, or the deployment? The second, and more important governance question, is: what governance failures allowed this to happen and allowed it to persist until it became an incident?

AI technical failures are almost always preceded by governance failures: inadequate monitoring that failed to detect degradation, insufficient validation that missed a performance issue before deployment, absent accountability structures that meant nobody's job it was to catch the problem, or a culture that treated AI system concerns as engineering problems rather than risk management problems.

Post-incident review that identifies only technical causes, without examining governance failures, will produce technical remediation that does not prevent recurrence. Governance remediation, strengthening monitoring, validation, accountability, and culture, is what prevents the next incident.

When AI Goes Wrong: Building an AI Incident Response Capability

Key Takeaways

The inevitability of AI incidents

Categories of AI incident

What AI incident response requires

Post-incident review

Stay ahead of AI governance

Related articles

AI Controls Framework: The Practical Guide for Enterprise Risk and Compliance Teams

AI Internal Audit: What Audit Committees Should Demand and How to Test AI Controls

Third-Party AI Controls: The Vendor Management Framework for AI Risk

More from AIRiskAware