Back to list
Lv.3

Vector Database

Vector Database

A database that stores data as numerical vectors, enabling fast retrieval of similar or semantically related content

In Simple Terms

A vector database is a system that finds information in massive datasets not by exact keyword matches, but by how close things are in meaning or nuance. For example, it comes into play when an AI searches through a large collection of documents to find information relevant to a question, or when an online store recommends products with a similar style to the item you're currently looking at.

Behind the Name

The name combines two words: "Vector" (a quantity with both direction and magnitude — in the world of AI, it refers to a sequence of numbers that captures the characteristics of a piece of data) and "Database" (a structured collection of data).

Take a Closer Look!

A vector database is a system that stores data — such as text, images, and audio — converted into sequences of numbers called "vectors."
The job of converting content into vectors is handled by separate AI models and other technologies; the vector database's role is to store those converted representations and make them searchable.
Unlike traditional databases, which retrieve data based on exact text or numeric conditions, vector databases find similar items by measuring how numerically close their feature representations are to each other.

Representing data as vectors makes it possible to calculate how "semantically close" two pieces of data are.
For example, the words "dog" and "puppy" look different as text strings, but when converted to vectors, they end up with very similar numerical values.
This means you can quickly find data that is similar in meaning or nuance to your query, even without an exact keyword match.

Vector databases are used across a wide range of applications, including semantic search (finding results by meaning rather than keywords), similarity search, and personalized recommendation features tailored to user preferences.
They also power systems where an AI is given access to specific domain knowledge or internal documentation so it can answer questions.
In that workflow, the meaning of a user's question is converted into a vector by AI, the database finds the information with the closest semantic match, and the AI uses that as a reference to generate an accurate answer.

CategoryAIData