jzhao.xyz

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Hyper-parameter Optimization

Oct 21, 20221 min read

seed
CPSC340

How do we efficiently find the “best” hyper-parameters?

More complicated models have even more hyper-parameters. This makes searching all values expensive (increases over-fitting risk)

Simplest approaches:

Exhaustive search: try all combinations among a fixed set of $σ$ and $λ$ values.
Random search: try random values
Stochastic local search: Generic global optimization methods (simulated annealing, genetic algorithms, and so on)
Coordinate search: Optimize one hyper-parameter at a time, keeping the others fixed. Repeatedly go through the hyper-parameters

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Graph View

Backlinks

Machine Learning

Created with Quartz v4.5.1 © 2025

GitHub
Twitter