jzhao.xyz

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Generative Models

Dec 07, 20221 min read

seed
CPSC340

Given data, we want to make more data that look like it

Last 10 years have seen a variety of new deep generative models:

Variational autoencoders (VAEs)
Generative adversarial networks (GANs)
Normalizing flows
Diffusion models
- Take training images, and add noise to them in a sequence of steps.
- Until the image basically looks like random noise.
- Train neural network to reverse those steps.
- Generate a new image by starting from random noise and applying the network
Text-guided Diffusion
- A Diffusion Model starts from randomly sampled Gaussian noise so there is no way to guide this process to generate specific images. We can augment this process with textual embeddings
- Generate the image and text encoding of each of the image-caption pairs
- Trains to maximize the cosine similarity between image-caption pairs
- After this step is finished, the model is frozen

Recent Writing

2024: Centering
Dec 23, 2024
Taste is a guide for what is worthwhile
Jan 14, 2024
Agentic Computing
Nov 29, 2022
Building a BFT JSON CRDT
Nov 16, 2022

See 21 more →

Recent Notes

TrueTime
May 26, 2025
Concurrency control
May 26, 2025

See 735 more →

Graph View

Backlinks

Autoencoders
Machine Learning

Created with Quartz v4.5.1 © 2025

GitHub
Twitter