4-Bit Training for Billion-Parameter LLMs? Yes, Really.
AI Coffee Break with Letitia
View ChannelAbout
Lighthearted bite-sized ML videos for your AI Coffee Break! πΊ Mostly videos about the latest technical advancements in AI, such as large language models (LLMs), text-to-image models and everything cool in natural language processing, computer vision, etc.! We try to post twice a month! π€ But you know, Letitia has a full-time job, and Ms. Coffee Bean tends to enjoy time off to go out and have fun. π Disclaimer: Opinions expressed are solely my own and do not express the views or opinions of my employer. Impressum: https://aicoffeebreak.com/impressum.html
Latest Posts
Video Description
π Check out Simplilearnβs SkillUp FREE courses (sponsor): https://www.simplilearn.com/skillup-free-online-courses?utm_campaign=AICoffeeBreak_Description&utm_medium=INFLCR_SkillUP&utm_source=Youtube Video summary: πΊ We all know quantization works at inference time, but researchers successfully trained a 13-billion-parameter LLaMA 2 model using FP4 precisionβyes, just 16 values per number! In this video, we explain and break down the paper. Check it out if you want to learn something about quantization and low/mixed-precision training in general! AI Coffee Break Merch! ποΈ https://aicoffeebreak.creator-spring.com/ π FP4 training: Ruizhe Wang, Yeyun Gong, Xiao Liu, Guoshuai Zhao, Ziyue Yang, Baining Guo, Zhengjun Zha, and Peng Cheng. "Optimizing Large Language Model Training Using FP4 Quantization." (2025) https://arxiv.org/abs/2501.17116 πFP8 training: Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Y. Prince, BjΓΆrn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, and Douglas Orr. "u-$\mu $ P: The Unit-Scaled Maximal Update Parametrization." (2024) https://arxiv.org/abs/2407.17465 Outline: 00:00 Training with FP4 quantization 02:02 Simplilearn (Sponsor) 03:25 Training LLMs in FP4 β Motivation 08:14 Step 1: Quantize the matrix multiplications 10:22 Step 2: Handle the outliers in activations 11:44 Step 3: Make quantization differentiable 13:00 Putting it all together 13:33 Results 14:14 Impact Thanks to our Patrons who support us in Tier 2, 3, 4: π Vignesh Valliappan, Michael, Sunny Dhiana, Andy Ma ββββββββββββββββββββββββββ π₯ Optionally, pay us a coffee to help with our Coffee Bean production! β Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak Join this channel as a Bean Member to get access to perks: https://www.youtube.com/channel/UCobqgqE4i5Kf7wrxRxhToQA/join ββββββββββββββββββββββββββ π Links: AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community Twitter / X: https://twitter.com/AICoffeeBreak LinkedIn: https://www.linkedin.com/in/letitia-parcalabescu/ Threads: https://www.threads.net/@ai.coffee.break Bluesky: https://bsky.app/profile/aicoffeebreak.bsky.social Reddit: https://www.reddit.com/r/AICoffeeBreak/ YouTube: https://www.youtube.com/AICoffeeBreak Substack: https://aicoffeebreakwl.substack.com/ Web: https://explanationmark.de/letitia https://aicoffeebreak.com #AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #researchβ Video editing: Nils Trost
Transform Your LLM Training Today
AI-recommended products based on this video

Seasonic Focus V4 GX-1000 (ATX3) - 1000W - 80+ Gold - ATX 3.0 & PCIe 5.1 Ready -Full-Modular -ATX Form Factor -Premium Japanese Capacitor -10 Year Warranty -Nvidia RTX 30/40 Super & AMD GPU Compatible

GMKtec AD-GP1 External GPU Docking Station, eGPU Enclosure with AMD Radeon 7600M XT GPU Graphics Card, HDMI2.1, DisplayPort2.0, Oculink, USB4, eGPU Dock for Mini PC Laptop Notebook Game Console

MSI Ultra-Slim Thin 15 VR-Ready High FPS Gaming Laptop, 15.6 FHD 144Hz, Intel Core i5-13420H, NVIDIA GeForce RTX 4060, 32GB RAM, 2TB SSD, Backlit KB, Wi-Fi 6, Bundle with PCO Notebook Fold Radiator

acer Nitro 50 N50-620-UA91 Gaming Desktop | 11th Gen Intel Core i5-11400F 6-Core Processor | NVIDIA GeForce GTX 1650 | 8GB DDR4 | 512GB NVMe M.2 SSD | Intel Wi-Fi 6 AX201 | Keyboard and Mouse

GIGABYTE - AORUS Elite 16 Gaming Laptop - 165Hz 2560x1600 WQXGA - NVIDIA GeForce RTX 5070 - Intel Core Ultra 9 275HX - 1TB SSD with 32GB DDR5 RAM - Windows 11 Home AD (AORUS Elite 16 BWHC3USC64SH)

Dell G16 Gaming Laptop 7630-16-inch QHD+ 240Hz 3ms Display, Intel Core i9-13900HX, 32GB DDR5 RAM, 1TB SSD, NVIDIA GeForce RTX 4060 8GB GDDR6, Windows 11 Home, Onsite Service - Metallic Nightshade

Handcrafted Boulder Block: Build Strength and Precision, 3D Rock Climbing Ball, Rock Climbing Training Balls for Take It Out Anytime, Finger Strength Training Climbing Ball Gifts for Rock Climbers

KEXIN 64GB USB Flash Drive 3 Pack - Swivel Thumb Drives with LED Indicator, High-Speed USB 2.0 (Pink/Yellow/Cyan) for Data Storage, Bulk Pen Drives Multi-Color Pack

PCCOOLER CPS YS1200 Power Supply, 1200W 80 Plus Gold Certified Fully Modular PCIe 5.1 & ATX 3.1 Gaming PSU, Wide Compatibility, 135mm FDB Fan, Full Japan Electrolytic Capacitors 12 Year Warranty Black


