Introduction 0%

Introduction

🎯 0/4 0%

Core ML Theory

Every model is wrong.
This tells you HOW it’s wrong.

Bias means your model is too simple — it misses the pattern. Variance means your model is too sensitive — it chases noise. You can’t minimize both simultaneously. Mastering this tradeoff is the single most important skill in machine learning.

Error Decomposition

The Three Sources of Error

The bias-variance decomposition

🎯 Total Error = 🏹 Bias² + 🎲 Variance + 💨 Noise (σ²)

🏹 Bias²

How far off is your average aim? This is the systematic gap between what your model predicts on average and the true answer. More training data won't fix it — you need a more flexible model.

Bias = E[f̂(x)] − f(x)

🎲 Variance

How scattered are your predictions? Train on a different sample and you get a different model. High variance means your model is too sensitive to which specific data points it saw.

Var = E[(f̂(x) − E[f̂(x)])²]

💨 Noise (σ²)

The wind you can't control. Even a perfect model can't beat this — it's randomness baked into the data itself. This is the error floor that no model can go below.

σ² = irreducible noise

↑ Answer the question above to continue ↑

You train the same linear regression on 100 different random samples of the same data. The predictions are tightly clustered but consistently 5 units above the true values. This model has:

Under vs Overfit

The Spectrum: Too Simple ↔ Too Complex

As model complexity increases: bias decreases, variance increases

↑ Answer the question above to continue ↑

Your model has 0.01 training error and 0.35 test error. What's happening?

The U-Curve

The Classic U-Shaped Curve

Test error is U-shaped: decreases (less bias), then increases (more variance)

Strategies

Managing the Tradeoff

Remedies for underfitting (high bias) and overfitting (high variance)

↑ Answer the question above to continue ↑

Your model underfits. You collect 10x more training data. What happens?

In Practice

The Diagnostic Checklist

↑ Answer the question above to continue ↑

Regularization (like L2/Ridge) reduces overfitting by:

🎓 What You Now Know

✓ Error = Bias² + Variance + Noise — Three irreducible components of prediction error.

✓ Bias = systematic error (underfitting) — Model is too simple to capture the pattern.

✓ Variance = sensitivity to data (overfitting) — Model memorizes noise.

✓ Diagnose with train/test gap — Small gap both high = bias. Large gap = variance.

✓ More data fixes variance, NOT bias — Know which problem you have before acting.

Every ML decision you make — model selection, regularization, feature engineering, data collection — is a choice about the bias-variance tradeoff. Understanding it deeply is what separates someone who guesses from someone who knows. 🎯

Bias-Variance Tradeoff — The Most Important Concept in ML

Every model is wrong.
This tells you HOW it’s wrong.

The Three Sources of Error

The bias-variance decomposition

The Spectrum: Too Simple ↔ Too Complex

The Classic U-Shaped Curve

Managing the Tradeoff

The Diagnostic Checklist

🎓 What You Now Know

Comments

↗ Keep Learning

Polynomial Regression — When Lines Aren't Enough

Ridge & Lasso — Taming Overfitting with Regularization

Cross-Validation & Hyperparameter Tuning — How to Actually Evaluate Models

Polynomial Regression — When Lines Aren't Enough

Every model is wrong. This tells you HOW it’s wrong.

The Three Sources of Error

The bias-variance decomposition

The Spectrum: Too Simple ↔ Too Complex

The Classic U-Shaped Curve

Managing the Tradeoff

The Diagnostic Checklist

🎓 What You Now Know

Comments

↗ Keep Learning

Polynomial Regression — When Lines Aren't Enough

Ridge & Lasso — Taming Overfitting with Regularization

Cross-Validation & Hyperparameter Tuning — How to Actually Evaluate Models

Polynomial Regression — When Lines Aren't Enough

Every model is wrong.
This tells you HOW it’s wrong.