What are Diffusion Models?
Ari Seff
View ChannelAbout
I'm a research scientist at OpenAI and previously completed a PhD at Princeton in machine learning. I’m using this channel to make short tutorials about various ML or related math topics. Stay tuned for new videos!
Latest Posts
Video Description
This short tutorial covers the basics of diffusion models, a simple yet expressive approach to generative modeling. They've been behind a recent string of impressive results, including OpenAI's DALL-E 2, Google's Imagen, and Stable Diffusion. Errata: At 12:39, parentheses are missing around the difference: \epsilon(x, t, y) - \epsilon(x, t, \empty). See https://i.imgur.com/PhUxugm.png for corrected version. Timestamps: 0:00 - Intro 1:07 - Forward process 3:07 - Posterior of forward process 4:16 - Reverse process 5:34 - Variational lower bound 9:26 - Reduced variance objective 10:27 - Reverse step implementation 11:38 - Conditional generation 13:45 - Comparison with other deep generative models 14:34 - Connection to score matching models Special thanks to Jonathan Ho and Elmira Amirloo for feedback on this video. Papers: Feller, 1949: On the Theory of Stochastic Processes, with Particular Reference to Applications (https://digitalassets.lib.berkeley.edu/math/ucb/text/math_s1_article-21.pdf) Sohl-Dickstein et al., 2015: Deep Unsupervised Learning using Nonequilibrium Thermodynamics (https://arxiv.org/abs/1503.03585) Ho et al., 2020: Denoising Diffusion Probabilistic Models (https://arxiv.org/abs/2006.11239) Song & Ermon, 2019: Generative Modeling by Estimating Gradients of the Data Distribution (https://arxiv.org/abs/1907.05600) Dhariwal & Nichol, 2021: Diffusion Models Beat GANs on Image Synthesis (https://arxiv.org/abs/2105.05233) Nichol et al., 2021: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models (https://arxiv.org/abs/2112.10741) Saharia et al., 2021: Palette: Image-to-Image Diffusion Models (https://arxiv.org/abs/2111.05826) Ramesh et al, 2022: Hierarchical Text-Conditional Image Generation with CLIP Latents (https://arxiv.org/abs/2204.06125) Saharia et al., 2022: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding (https://arxiv.org/abs/2205.11487) Song et al., 2021: Denoising Diffusion Implicit Models (https://arxiv.org/abs/2010.02502) Nichol & Dhariwal, 2021: Improved Denoising Diffusion Probabilistic Models (https://arxiv.org/abs/2102.09672) Kingma et al., 2021: Variational Diffusion Models (https://arxiv.org/abs/2107.00630) Song et al., 2021: Score-Based Generative Modeling through Stochastic Differential Equations (https://arxiv.org/abs/2011.13456) Links: YouTube: https://www.youtube.com/ariseffai Twitter: https://twitter.com/ari_seff Homepage: https://www.ariseff.com If you'd like to help support the channel (completely optional), you can donate a cup of coffee via the following: Venmo: https://venmo.com/ariseff PayPal: https://www.paypal.me/ariseff
Upgrade Your Photo Gear Now
AI-recommended products based on this video

COTUBLR 31 Inch Computer Desk, Home Office Desk, Simple Modern Small Desk for Bedroom, Writing Desk with Storage Bag, Study Table for Students, Grey Oak

ANXRE 71" Phone Tripod, NT79 Extendable Selfie Stick for Cell Phone&Camera Photo Video Kits, Travel Tripod Stand with Remote for Video Recording, Phone Holder Tripod Compatible with Phone Camera Gopro

Kaiess 62" Tripod for iPhone, Selfie Stick Tripod & Phone Tripod Stand with Remote, Cell Phone Tripod for iPhone, Extendable Travel Tripod Compatible with iPhone 14/13/12 Pro Max/Android

EUCOS 62" Phone Tripod, Tripod for iPhone & Selfie Stick Tripod with Remote, Extendable Phone Tripod Stand & Travel Tripod, Solidest Cell Phone Tripod Compatible with iPhone/Android

Jhcztrk 17.3 Inch Portable Monitor HDMI Type-C Laptop Gaming Monitor, 1600X900 with Built-in Speakers and Tripod External Slim Travel Monitors Second Monitors for Mac Smartphone PS4/PS5/ Gaming

Girl Moments: Coloring Book for Adults and Teens Featuring Cute Cozy Daily Activities for Relaxation

SanDisk 64GB Extreme PRO SDXC UHS-I Memory Card - C10, U3, V30, 4K UHD, SD Card - SDSDXXU-064G-GN4IN

SanDisk 128GB Extreme PRO SDXC UHS-I Memory Card - C10, U3, V30, 4K UHD, SD Card - SDSDXXD-128G-GN4IN

Lexar 128GB Micro SD Card, microSDXC UHS-I Flash Memory Card with Adapter - Up to 100MB/s, A1, U3, Class10, V30, High Speed TF Card

TAURI for iPhone 17 Pro Max Case 6.9" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard
![Ailun 3 Pack Screen Protector for iPhone 17 [6.3 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film](https://m.media-amazon.com/images/I/71Ps91qFQ9L._AC_UL960_FMwebp_QL65_.jpg)
Ailun 3 Pack Screen Protector for iPhone 17 [6.3 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film

TAURI for iPhone 17 Pro Case 6.3" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard
![Ailun 3 Pack Screen Protector for iPhone 17 Pro [6.3 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film](https://m.media-amazon.com/images/I/71ZyAPjoBhL._AC_UL960_FMwebp_QL65_.jpg)
Ailun 3 Pack Screen Protector for iPhone 17 Pro [6.3 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film

LifeStraw Go Series – BPA-Free Water Filter Bottle for Travel and Everyday use removes Bacteria, parasites and microplastics, Improves Taste

AKASO V50 Elite 4K60fps Touch Screen WiFi Action Camera Voice Control EIS 131 feet Waterproof Camera 8X Zoom Remote Control with with 64GB U3 MicroSDXC Memory Card Global Recycled Standard

Selfie Stick, 7 Section Adjustable Action Camera Extension Pole with 1/4in Thread and Screw Hole, 23.5 to 120cm Portable Extension Rod for Panoramic Camera

NEEWER Advanced 18 inch LED Ring Light for Phone, LCD Touch Screen, 2.4G Remote Lights Control, 3200-5600K, Tripod Light for iPhone Action Camera, for Studio Makeup TikTok YouTube Video Salon (Black)

