Introduction 0%

Introduction

🎯 0/4 0%

🕸️

A graph where every hop
gets you closer.

HNSW (Hierarchical Navigable Small World) builds a multi-layer graph where each vector is a node and edges connect nearby vectors. Searching means “hopping” through the graph — each hop brings you closer to the query. The result: O(log N) query time with the highest recall of any ANN algorithm. The tradeoff? It’s memory-hungry.

↓ Scroll to understand the graph-based index that powers most vector databases

HNSW Deep Dive

HNSW: The Graph-Based Index

HNSW's two critical parameters: M (connections per node) and ef (search beam width)

↑ Answer the question above to continue ↑

You're tuning HNSW for a latency-sensitive application. You set ef=5 (very low beam width). What problem will you likely encounter?

HNSW Math

HNSW complexity

🏗️ Build Time

Each of N vectors is inserted into the graph, connecting to M neighbors at each layer. The hierarchical structure means insertion traverses O(log N) layers.

O(N × log(N) × M)

💾 Space

Every node stores M edges at each layer, and the number of layers grows logarithmically with N. This makes HNSW memory-hungry — every edge must be explicitly stored.

O(N × M × number_of_layers)

🔍 Query Time

Search traverses log(N) layers, tracking ef candidates at each step, and checking M neighbors per candidate. The logarithmic layer traversal is what gives HNSW its speed.

O(log(N) × ef × M)

📦 Memory at Scale

At 1 billion vectors with M=16, the graph index alone requires roughly 2 TB of RAM. This is HNSW's main cost — every edge is an explicit pointer in memory.

N × M × 4 bytes × log(N) ≈ 2 TB

⚡ Practical Latency

For 100M vectors, typical query latency is 1-10ms — much faster than IVF at equivalent recall, but at the cost of roughly 4× more memory.

↑ Answer the question above to continue ↑

HNSW query time is O(log N × ef × M). If you double the database size from 500M to 1B vectors, how does query latency change?

Complexity Table

The Big Complexity Comparison

Time and space complexity of major index types — pick the right tool for your scale

↑ Answer the question above to continue ↑

You have 500M 768-dim vectors. HNSW with M=32 achieves 99% recall but requires ~500GB RAM for the graph alone. You only have 64GB RAM. What do you do?

You’ve seen the memory–accuracy tradeoff at scale. Now let’s zoom into why HNSW’s hierarchy matters — the layered structure is the key insight that separates HNSW from a flat graph and gives it O(log N) search.

↑ Answer the question above to continue ↑

Why does HNSW use MULTIPLE layers instead of a single flat graph?

🎓 What You Now Know

✓ HNSW builds a navigable graph — each vector is a node, edges connect nearby vectors, search means hopping through the graph.

✓ M controls the graph density — more edges = higher recall but more memory. Sweet spot: M = 16-64.

✓ ef controls the search beam width — higher ef explores more paths for better recall at the cost of latency.

✓ O(log N) query time, but memory-hungry — HNSW gives the highest recall but requires storing the entire graph in RAM.

✓ When RAM is tight, use IVF-PQ instead — 32× compression with ~92% recall fits billion-scale datasets on a single machine.

HNSW dominates vector search, but production systems combine it with BM25 keyword search and cross-encoder reranking for best results. ⚡

HNSW — Hierarchical Navigable Small World Graphs for Vector Search

A graph where every hop
gets you closer.

HNSW: The Graph-Based Index

HNSW complexity

The Big Complexity Comparison

🎓 What You Now Know

Comments

↗ Keep Learning

Approximate Nearest Neighbor Search — Trading 1% Accuracy for 1000× Speed

IVF Index — Partitioning Vector Space with K-Means for Fast Search

Vector Databases — Search by Meaning, Not Keywords

K-Nearest Neighbors — The Algorithm with No Training Step

Approximate Nearest Neighbor Search — Trading 1% Accuracy for 1000× Speed

A graph where every hop gets you closer.

HNSW: The Graph-Based Index

HNSW complexity

The Big Complexity Comparison

🎓 What You Now Know

Comments

↗ Keep Learning

Approximate Nearest Neighbor Search — Trading 1% Accuracy for 1000× Speed

IVF Index — Partitioning Vector Space with K-Means for Fast Search

Vector Databases — Search by Meaning, Not Keywords

K-Nearest Neighbors — The Algorithm with No Training Step

Approximate Nearest Neighbor Search — Trading 1% Accuracy for 1000× Speed

A graph where every hop
gets you closer.