jzhao.xyz

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

k-Nearest Neighbours (KNN)

Sep 23, 20221 min read

seed
CPSC340

To classify an example, we find the $k$ examples closest to the example and take the mode of the $k$ examples.

Works based off of the assumption that similar features are likely to have similar labels

Effects on fundamental tradeoff:

As $k$ grows, training error increases and approximation error decreases.
As $n$ grows, model complexity increases

We measure distance using the “norm” between feature vectors. The most common norm is the L2-Norm or Euclidean Norm

Performance

$O (1)$ training (just relies on training data)
$O (n d)$ predictions ( $O (d)$ distance calculations for all $n$ examples)
$O (n d)$ space to store each training example in memory
- This is non-parametric

KNN can suck in high dimensions (see: curse of dimensionality)

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Graph View

Backlinks

Object Classification
Supervised learning

Created with Quartz v4.5.1 © 2025

GitHub
Twitter