jzhao.xyz

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Random Forest

Sep 23, 20221 min read

seed
CPSC340

Example of an Ensemble method. They are non-parametric

They work by taking a vote from a set of deep decision trees. Two key ingredients to help ensure the deep decision trees make independent errors

Bootstrap sampling: generate different “versions” of your dataset
- Usually done by sampling with replacement $n$ times, this creates a bootstrap sample
- On average, this maintains roughly the same distribution as the original
Random Trees: grow decision trees that incorporates some randomness
- Randomly sample a small number of possible features (typically $d$ )
- Only consider these random features when searching for the optimal rule so splits will tend to use different features in different trees

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Graph View

Backlinks

Ensemble method
Clustering

Created with Quartz v4.5.1 © 2025

GitHub
Twitter