Cross-cutting

Embedding vector

An embedding vector is a numerical representation of a piece of content (text, image, audio) as a fixed-length vector, typically 384 to 3072 dimensions, produced by a neural network trained so that semantically similar inputs produce vectors close to each other in the embedding space. Embeddings power semantic search, recommendation, and clustering.

May 23, 2026

The defining property is that cosine similarity or dot product on embedding vectors corresponds to semantic similarity. 'cat' and 'kitten' produce vectors close together; 'cat' and 'spaceship' produce distant vectors. Embeddings let semantic search match queries to documents by meaning rather than keyword overlap, useful for content discovery, RAG retrieval, and clustering. Production embedding models vary by dimensionality (smaller = cheaper retrieval, larger = better quality), language support, and training corpus (general vs domain-specific). The choice of model matters significantly for retrieval quality; benchmarks like MTEB cover most production scenarios.