01. What It Is
An embedding is a dense numeric vector: an ordered list of floating-point numbers, typically hundreds or thousands of values long. An embedding model takes unstructured data (a sentence, a document, an image, a piece of audio) and maps it to a specific point in a high-dimensional vector space.
The key property: items that are semantically similar end up close together in that vector space, and items that are semantically different end up far apart. "Dog" and "puppy" produce nearby vectors. "Dog" and "spreadsheet" produce distant ones.
This is fundamentally different from keyword search, where "automobile" and "car" share zero characters in common and a naive search engine treats them as unrelated. Embedding models are trained to understand that they mean the same thing, so their vectors are close.