jzhao.xyz

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

NLP

Apr 11, 20221 min read

seed

See also: LLMs

Chain of Thought Prompting

Flat scaling curves—simply increasing model scale does not lead to substantive performance gains
Chain of thought prompting facilitating multistep reasoning in large language models
- The intuition is that a chain of thought allows language models to decompose a multi-step problem into intermediate steps that are solved individually, instead of solving an entire multi-hop problem in a single forward pass
- Another intuition behind chain of thought reasoning is that it allows the model to spend more computation (i.e., intermediate tokens) solving harder problems (though later sections of the paper rule this out as the primary factor for performance improvements)
- Notably, chain of thought prompting only does better than standard prompting only at the scale of ~100B params
Really cool side-effect is more explainable decision making processes
- Fully characterizing a model’s computations that support an answer remains an open question

See: OODA

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Graph View

Backlinks

LLMs
reflect: NLP Model Explained
Transformer Models

Created with Quartz v4.5.1 © 2025

GitHub
Twitter