What Is Datasheets for Datasets?
Datasheets for Datasets is a documentation standard that records how a dataset was created, composed, intended to be used, and maintained.
Datasheets for Datasets — a documentation standard that records how a dataset was created, composed, intended to be used, and maintained.
Proposed to bring transparency to the data behind AI, a datasheet accompanies a dataset and answers questions about its motivation, collection, preprocessing, and appropriate uses — much as an electronic component ships with a spec sheet. It complements model cards by documenting the data rather than the model.
Source: Gebru et al., "Datasheets for Datasets"
Plain-language explanation
Proposed to bring transparency to the data behind AI, a datasheet accompanies a dataset and answers questions about its motivation, collection, preprocessing, and appropriate uses — much as an electronic component ships with a spec sheet. It complements model cards by documenting the data rather than the model.
Related terms
See where you stand on AI governance
Take the free 7-question maturity assessment and get a personalised action plan.
Free assessment — 3 minutes →