Introduction 0%

Introduction

🎯 0/4 0%

Instance-Based Learning

No training.
Just remembering.

KNN stores the entire training set and makes predictions by finding similar examples. It’s the simplest algorithm that actually works — and it teaches you everything about the bias-variance tradeoff.

How KNN Works

The Algorithm

KNN in three steps — no training at all

K=3: the 3 nearest neighbors vote. 2 blue vs 1 red → predict blue.

↑ Answer the question above to continue ↑

KNN is called a 'lazy learner.' Why?

Distance Metrics

How Do You Measure “Nearest”?

Common distance metrics

📏 Euclidean Distance

The straight-line distance between two points. The default choice for KNN — works well when all features are on similar scales and you care about magnitude.

d = √(Σ(xᵢ - yᵢ)²)

🏙️ Manhattan Distance

City-block distance — sum of absolute differences along each axis. More robust to outliers than Euclidean because it doesn't square the differences.

d = Σ|xᵢ - yᵢ|

📐 Cosine Distance

Measures the angle between vectors, ignoring magnitude. Ideal for text and documents where the direction of the feature vector matters more than its length.

d = 1 - (x·y)/(‖x‖ ‖y‖)

⚖️ ⚠️ Normalize First!

If features have wildly different scales (e.g., salary 10K-200K vs age 18-65), the larger-scale feature dominates all distance calculations. Always normalize before using KNN.

↑ Answer the question above to continue ↑

Feature A ranges from 0-1 and Feature B ranges from 0-1,000,000. Without normalization, what happens with Euclidean distance?

Choosing K

The Bias-Variance Tradeoff in Action

K=1: jagged boundary (overfitting). K=15: smooth boundary (underfitting). K=5: just right.

↑ Answer the question above to continue ↑

What happens when K = N (the entire training set)?

Curse of Dimensionality

Why KNN Breaks in High Dimensions

Why distance fails in high dimensions

Volume of unit hypersphere → 0 as d → ∞

Most of the space is in the corners, far from any data point

dₘₐₓ/dₘᵢₙ → 1 as d → ∞

The farthest and nearest neighbor have nearly the same distance!

To maintain density: need N ∝ eᵈ samples

With 20 features, you'd need millions of samples for KNN to work

↑ Answer the question above to continue ↑

In 1,000 dimensions, how does the ratio of nearest-to-farthest neighbor distance behave?

In Practice

When to Use KNN

KNN decision guide

🎓 What You Now Know

✓ KNN has zero training — it memorizes data — Prediction time: find K nearest, majority vote.

✓ Distance metric and feature scaling matter hugely — Always normalize before KNN.

✓ K controls bias-variance — Small K = overfitting. Large K = underfitting. Use cross-validation.

✓ Curse of dimensionality kills KNN — Beyond ~20 features, all points become equidistant.

✓ Best for small, low-dimensional datasets — Otherwise use tree-based methods.

KNN is the simplest ML algorithm — and that’s its power. It makes the bias-variance tradeoff tangible, teaches you about distance metrics and scaling, and demonstrates why dimensionality matters. Every ML engineer should understand it, even if you rarely use it in production. 🚀

K-Nearest Neighbors — The Algorithm with No Training Step

No training.
Just remembering.

The Algorithm

How Do You Measure “Nearest”?

Common distance metrics

The Bias-Variance Tradeoff in Action

Why KNN Breaks in High Dimensions

Why distance fails in high dimensions

When to Use KNN

🎓 What You Now Know

Comments

↗ Keep Learning

Logistic Regression — The Classifier That's Not Really Regression

Decision Trees — How Machines Learn to Ask Questions

Bias-Variance Tradeoff — The Most Important Concept in ML

Logistic Regression — The Classifier That's Not Really Regression

No training. Just remembering.

The Algorithm

How Do You Measure “Nearest”?

Common distance metrics

The Bias-Variance Tradeoff in Action

Why KNN Breaks in High Dimensions

Why distance fails in high dimensions

When to Use KNN

🎓 What You Now Know

Comments

↗ Keep Learning

Logistic Regression — The Classifier That's Not Really Regression

Decision Trees — How Machines Learn to Ask Questions

Bias-Variance Tradeoff — The Most Important Concept in ML

Logistic Regression — The Classifier That's Not Really Regression

No training.
Just remembering.