Vector Databases — Search by Meaning, Not Keywords

Introduction 0%

Introduction

🎯 0/5 0%

🧠

Google doesn’t match words.
It matches meaning.

Search “affordable running shoes” and find results for “budget jogging sneakers”
— different words, same meaning. This is powered by vector search:
converting everything into numbers and finding what’s mathematically close.

↓ Scroll to understand the database behind RAG, semantic search, and AI features

The Problem

Why Keywords Fail

Traditional databases search by exact matching. But humans don’t think in keywords — we think in concepts.

Keyword search misses semantically identical queries

Embeddings

Embeddings: Turning Everything into Numbers

An embedding is a vector — a list of numbers — that captures the meaning of something. Text, images, audio — anything can be embedded.

Similar concepts cluster together in embedding space

How embeddings are created

📝 Input Text → 🧠 Embedding Model → 📊 Output Vector

📝 Input Text

Any piece of text — a word, sentence, paragraph, or entire document — serves as input. The embedding model doesn't care about length; it converts meaning into numbers.

🧠 Embedding Model

A neural network (like OpenAI's text-embedding-3-small or Sentence-BERT) trained on billions of text pairs. It learns to map semantically similar text to nearby points in vector space.

📊 Output Vector

A fixed-size array of 768–1536 floating-point numbers encoding the text's meaning. Similar concepts produce similar vectors: 'dog' [0.82, 0.15, ...] and 'puppy' [0.80, 0.18, ...] are nearly identical, while 'Python' [-0.21, 0.76, ...] is completely different.

text → [0.12, -0.45, 0.78, ..., 0.33]

↑ Answer the question above to continue ↑

Why do embeddings use 768-1536 dimensions instead of just 2 or 3?

Similarity

Measuring Similarity: Cosine Distance

Once everything is a vector, finding similar items is just measuring the angle between vectors.

Cosine Similarity — the standard metric

📐 The Formula

Cosine similarity measures the angle between two vectors by dividing their dot product by the product of their magnitudes. It captures direction (meaning) rather than magnitude (length), so a long document and a short tweet about the same topic score high.

cos(θ) = (A · B) / (||A|| × ||B||)

🎯 Score = 1.0 (Identical)

The vectors point in exactly the same direction — the texts share the same meaning. 'Happy dog playing' and 'Joyful puppy having fun' would score near 1.0.

↔️ Score = 0.0 (Unrelated)

The vectors are perpendicular — the texts have no semantic relationship at all. 'Dog' and 'Economics' are neither similar nor opposite, just completely unrelated topics.

🔄 Score = -1.0 (Opposite)

The vectors point in opposite directions — the texts convey opposite meanings. This is rare in practice because most embedding spaces don't produce true semantic opposites as negative cosine values.

Cosine similarity measures the angle, not the magnitude

↑ Answer the question above to continue ↑

Vectors A=[1,0] and B=[0,1] have cosine similarity of 0. What does this mean?

ANN Search

The Scale Problem: You Can’t Compare Everything

You have 100 million vectors. A user queries with a new vector. Finding the most similar one means computing 100 million cosine similarities — way too slow.

Brute force is O(n) — that doesn't scale

Brute force: compare query to ALL n vectors

100M vectors × 1536 dimensions = ~150 billion floating point operations per query

At ~10 GFLOPS: 15 seconds per query

No one waits 15 seconds for search results

Solution: Approximate Nearest Neighbor (ANN) search

Find 'close enough' results in milliseconds by not checking everything

Trade-off: 95-99% accuracy for 1000x speed

You might miss the absolute closest vector, but you'll find one that's nearly as close

HNSW

HNSW: The Algorithm Behind Vector Search

Hierarchical Navigable Small World — the most popular ANN algorithm. It builds a multi-layer graph where each layer is a “highway system” for fast navigation.

HNSW: start at the top layer (sparse, fast jumps) and descend to the bottom layer (dense, precise)

↑ Answer the question above to continue ↑

HNSW has multiple layers. What's the purpose of the upper (sparse) layers?

Real World

Vector Databases in the Real World

RAG (Retrieval-Augmented Generation): the #1 use case for vector databases today

The vector database landscape

↑ Answer the question above to continue ↑

In a RAG system, why do you give retrieved documents to the LLM instead of just querying the LLM directly?

↑ Answer the question above to continue ↑

You need to add semantic search to an existing PostgreSQL app with 1M documents. Which approach is most practical?

🎓 What You Now Know

✓ Embeddings turn meaning into numbers — Neural networks convert text/images into high-dimensional vectors where similar concepts cluster together.

✓ Cosine similarity measures semantic closeness — It captures direction (meaning) rather than magnitude (length).

✓ ANN search trades accuracy for speed — Checking every vector is O(n) and too slow. ANN algorithms give ~99% accuracy at 1000x the speed.

✓ HNSW is the dominant algorithm — Multi-layer graph with fast long-range navigation at the top and precise local search at the bottom.

✓ RAG is the killer app — Retrieve relevant docs from a vector DB, feed them to an LLM, get grounded answers with less hallucination.

Vector databases are the infrastructure layer powering the AI revolution. Every chatbot, every AI search, every recommendation engine is built on these concepts. They’re to the AI era what SQL databases were to the web era. 🚀

📄 Efficient and Robust Approximate Nearest Neighbor using HNSW Graphs (Malkov & Yashunin, 2016)

📄 Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Lewis et al., 2020)

Vector Databases — Search by Meaning, Not Keywords

Google doesn’t match words.
It matches meaning.

Why Keywords Fail

Embeddings: Turning Everything into Numbers

How embeddings are created

Measuring Similarity: Cosine Distance

Cosine Similarity — the standard metric

The Scale Problem: You Can’t Compare Everything

Brute force is O(n) — that doesn't scale

HNSW: The Algorithm Behind Vector Search

Vector Databases in the Real World

🎓 What You Now Know

Comments

↗ Keep Learning

Database Sharding — Scaling Beyond One Machine

K-Nearest Neighbors — The Algorithm with No Training Step

PCA — Compressing Reality Without Losing the Plot

Database Sharding — Scaling Beyond One Machine

Google doesn’t match words. It matches meaning.

Why Keywords Fail

Embeddings: Turning Everything into Numbers

How embeddings are created

Measuring Similarity: Cosine Distance

Cosine Similarity — the standard metric

The Scale Problem: You Can’t Compare Everything

Brute force is O(n) — that doesn't scale

HNSW: The Algorithm Behind Vector Search

Vector Databases in the Real World

🎓 What You Now Know

Comments

↗ Keep Learning

Database Sharding — Scaling Beyond One Machine

K-Nearest Neighbors — The Algorithm with No Training Step

PCA — Compressing Reality Without Losing the Plot

Database Sharding — Scaling Beyond One Machine

Google doesn’t match words.
It matches meaning.