The Man Who Might SOLVE AI Alignment — Dr. Steven Byrnes, AGI Safety Researcher @ Astera Institute

Doom Debates August 1, 2025
Video Thumbnail
Doom Debates Logo

Doom Debates

View Channel

About

It's time to worry about the end of the world. With your host, Liron Shapira.

Video Description

Dr. Steven Byrnes, UC Berkeley physics PhD and Harvard physics postdoc, is an AI safety researcher at the Astera Institute and one of the most rigorous thinkers working on the technical AI alignment problem. Steve has a whopping 90% P(Doom), but unlike most AI safety researchers who focus on current LLMs, he argues that LLMs will plateau before becoming truly dangerous, and the real threat will come from next-generation "brain-like AGI" based on actor-critic reinforcement learning. For the last five years, he's been diving deep into neuroscience to reverse engineer how human brains actually work, and how to use that knowledge to solve the technical AI alignment problem. He's one of the few people who both understands why alignment is hard and is taking a serious technical shot at solving it. We cover his "two subsystems" model of the brain, why current AI safety approaches miss the mark, his disagreements with social evolution approaches, and why understanding human neuroscience matters for building aligned AGI. 00:00:00 - Cold Open: Solving the technical alignment problem 00:00:26 - Introducing Dr. Steven Byrnes and his impressive background 00:01:59 - Steve's unique mental strengths 00:04:08 - The cold fusion research story demonstrating Steve's approach 00:06:18 - How Steve got interested in neuroscience through Jeff Hawkins 00:08:18 - Jeff Hawkins' cortical uniformity theory and brain vs deep learning 00:11:45 - When Steve first encountered Eliezer's sequences and became AGI-pilled 00:15:11 - Steve's research direction: reverse engineering human social instincts 00:21:47 - Four visions of alignment success and Steve's preferred approach 00:29:00 - The two brain subsystems model: steering brain vs learning brain 00:35:30 - Brain volume breakdown and the learning vs steering distinction 00:38:43 - Cerebellum as the "LLM" of the brain doing predictive learning 00:46:44 - Language acquisition: Chomsky vs learning algorithms debate 00:54:13 - What LLMs fundamentally can't do: complex context limitations 01:07:17 - Hypothalamus and brainstem doing more than just homeostasis 01:13:45 - Why morality might just be another hypothalamus cell group 01:18:00 - Human social instincts as model-based reinforcement learning 01:22:47 - Actor-critic reinforcement learning mapped to brain regions 01:29:33 - Timeline predictions: when brain-like AGI might arrive 01:38:28 - Why humans still beat AI on strategic planning and domain expertise 01:47:27 - Inner vs outer alignment: cocaine example and reward prediction 01:55:13 - Why legible Python code beats learned reward models 02:00:45 - Outcome pumps, instrumental convergence, and the Stalin analogy 02:11:48 - What’s Your P(Doom)™ 02:16:45 - Massive headroom above human intelligence 02:20:45 - Can AI take over without physical actuators? (Yes) 02:26:18 - Steve's bold claim: 30 person-years from proto-AGI to superintelligence 02:32:17 - Why overhang makes the transition incredibly dangerous 02:35:00 - Social evolution as alignment solution: why it won't work 02:46:47 - Steve's research program: legible reward functions vs RLHF 02:59:52 - AI policy discussion: why Steven is skeptical of pause AI 03:05:51 - Lightning round: offense vs defense, P(simulation), AI unemployment 03:12:42 - Thanking Steve and wrapping up the conversation 03:13:30 - Liron's outro: Supporting the show and upcoming episodes with Vitalik and Eliezer --- Episode links and transcript: https://lironshapira.substack.com/p/the-man-who-might-solve-ai-alignment

You May Also Like

AI Safety Research Essentials

AI-recommended products based on this video

Loading...
Seasonic Focus V4 GX-1000 (ATX3) - 1000W - 80+ Gold - ATX 3.0 & PCIe 5.1 Ready -Full-Modular -ATX Form Factor -Premium Japanese Capacitor -10 Year Warranty -Nvidia RTX 30/40 Super & AMD GPU Compatible

Seasonic Focus V4 GX-1000 (ATX3) - 1000W - 80+ Gold - ATX 3.0 & PCIe 5.1 Ready -Full-Modular -ATX Form Factor -Premium Japanese Capacitor -10 Year Warranty -Nvidia RTX 30/40 Super & AMD GPU Compatible

(22)
$423.35
FREE delivery Oct 8 - 10
Loading...
Premium Feminine PH Balance Gummies for Women's Health,Vaginal Sugar Free Probiotics for Immune Support,Hawaiian Pineapple Gummies,60 Count

Premium Feminine PH Balance Gummies for Women's Health,Vaginal Sugar Free Probiotics for Immune Support,Hawaiian Pineapple Gummies,60 Count

(10)
$29.99
FREE delivery Sat, Nov 22 on your first order
300+ bought in past month
Loading...
Gauze Rolls – 40 Rolls- Premium First Aid Supplies for Safe Adventuring–individually wrapped -Flexible, Stretchable, Breathable Gauze Bandage Rolls – 3” x 4.1 Yards Bandage Wrap for Wound Dressing

Gauze Rolls – 40 Rolls- Premium First Aid Supplies for Safe Adventuring–individually wrapped -Flexible, Stretchable, Breathable Gauze Bandage Rolls – 3” x 4.1 Yards Bandage Wrap for Wound Dressing

(2)
$13.25
FREE delivery Tue, Oct 28 on your first order
Loading...
9-in-1 5000A 150PSI Car Battery Booster Jump Starter with Air Compressor (All Gas/9L Diesel), Portable Car Battery Booster Pack, Safe Durable Car Jump Starter with Extended Jumper Cables, Glove, Light

9-in-1 5000A 150PSI Car Battery Booster Jump Starter with Air Compressor (All Gas/9L Diesel), Portable Car Battery Booster Pack, Safe Durable Car Jump Starter with Extended Jumper Cables, Glove, Light

(261)
$99.99
FREE delivery Sat, Sep 20
1K+ bought in past month
Loading...
3PCS Dog Grooming Scissors kit Thinning Shears Scissors Curved Scissor Comb Stainless Steel Pet Cat Dog Grooming for Dogs Cats Pets Professional Straight Scissor Comb

3PCS Dog Grooming Scissors kit Thinning Shears Scissors Curved Scissor Comb Stainless Steel Pet Cat Dog Grooming for Dogs Cats Pets Professional Straight Scissor Comb

(0)
$9.96
FREE delivery Thu, Sep 11 on your first order
Loading...
Elycura Nerve Care Healing Salve, Elycura Professional Nerve Pain Relief Cream, Soothing Cream, Supports Healthy Nerves, Joint and Muscle Pain Relief Gel, for Back, Neck, Hands, Knees (3PCS)

Elycura Nerve Care Healing Salve, Elycura Professional Nerve Pain Relief Cream, Soothing Cream, Supports Healthy Nerves, Joint and Muscle Pain Relief Gel, for Back, Neck, Hands, Knees (3PCS)

(0)
$29.97
$1.98 delivery Sep 18 - Oct 1
Loading...
Pumpkin Seed Oil with Saw Palmetto Capsules | Cold Pressed, Pure Virgin Oil, Essential Fatty Acids & Phytosterols 180 Softgels, Good for hair health | Non-GMO

Pumpkin Seed Oil with Saw Palmetto Capsules | Cold Pressed, Pure Virgin Oil, Essential Fatty Acids & Phytosterols 180 Softgels, Good for hair health | Non-GMO

(29)
$31.49
Prime
2K+ bought in past month