AIRiskAware
AI Governance Glossary
Governance Concept

What Is Datasheets for Datasets?

Datasheets for Datasets is a documentation standard that records how a dataset was created, composed, intended to be used, and maintained.

Definition

Datasheets for Datasetsa documentation standard that records how a dataset was created, composed, intended to be used, and maintained.

Proposed to bring transparency to the data behind AI, a datasheet accompanies a dataset and answers questions about its motivation, collection, preprocessing, and appropriate uses — much as an electronic component ships with a spec sheet. It complements model cards by documenting the data rather than the model.

Source: Gebru et al., "Datasheets for Datasets"

Plain-language explanation

Proposed to bring transparency to the data behind AI, a datasheet accompanies a dataset and answers questions about its motivation, collection, preprocessing, and appropriate uses — much as an electronic component ships with a spec sheet. It complements model cards by documenting the data rather than the model.

Primary source: Gebru et al., "Datasheets for Datasets"

Related terms

Model Card Training Data Governance Content Provenance Technical Documentation

See where you stand on AI governance

Take the free 7-question maturity assessment and get a personalised action plan.

Free assessment — 3 minutes →