AIRiskAware
AI Governance Glossary
Governance Concept

What Is Embedding?

Embedding is a numerical vector representation of data — such as a word, sentence, image, or user — learned so that items with similar meaning sit close together in a high-dimensional space.

Definition

Embeddinga numerical vector representation of data — such as a word, sentence, image, or user — learned so that items with similar meaning sit close together in a high-dimensional space.

Embeddings are how modern AI turns messy real-world data into something a model can compute over, and they power semantic search and retrieval-augmented generation. They carry governance implications: embeddings can encode bias, and research shows sensitive information can sometimes be reconstructed from them, so they are not automatically a privacy-safe representation.

Source: Machine-learning literature

Plain-language explanation

Embeddings are how modern AI turns messy real-world data into something a model can compute over, and they power semantic search and retrieval-augmented generation. They carry governance implications: embeddings can encode bias, and research shows sensitive information can sometimes be reconstructed from them, so they are not automatically a privacy-safe representation.

Primary source: Machine-learning literature

Related terms

Retrieval-Augmented Generation (RAG) Grounding Inference (AI) Multimodal Model Vector Database

See where you stand on AI governance

Take the free 7-question maturity assessment and get a personalised action plan.

Free assessment — 3 minutes →