Do you think that ChatGPT can reason? [Prof. Subbarao Kambhampati]
About
No channel description available.
Video Description
Prof. Subbarao Kambhampati argues that while LLMs are impressive and useful tools, especially for creative tasks, they have fundamental limitations in logical reasoning and cannot provide guarantees about the correctness of their outputs. He advocates for hybrid approaches that combine LLMs with external verification systems. MLST is sponsored by Brave: The Brave Search API covers over 20 billion webpages, built from scratch without Big Tech biases or the recent extortionate price hikes on search API access. Perfect for AI model training and retrieval augmentated generation. Try it now - get 2,000 free queries monthly at http://brave.com/api. This is 2/13 of our #ICML2024 series TOC [00:00:00] Intro [00:02:06] Bio [00:03:02] LLMs are n-gram models on steroids [00:07:26] Is natural language a formal language? [00:08:34] Natural language is formal? [00:11:01] Do LLMs reason? [00:19:13] Definition of reasoning [00:31:40] Creativity in reasoning [00:50:27] Chollet's ARC challenge [01:01:31] Can we reason without verification? [01:10:00] LLMs cant solve some tasks [01:19:07] LLM Modulo framework [01:29:26] Future trends of architecture [01:34:48] Future research directions Pod: https://podcasters.spotify.com/pod/show/machinelearningstreettalk/episodes/Prof--Subbarao-Kambhampati---LLMs-dont-reason--they-memorize-ICML2024-213-e2mjcse Subbarao Kambhampati: https://x.com/rao2z Interviewer: Dr. Tim Scarfe Refs: Can LLMs Really Reason and Plan? https://cacm.acm.org/blogcacm/can-llms-really-reason-and-plan/ On the Planning Abilities of Large Language Models : A Critical Investigation https://arxiv.org/pdf/2305.15771 Chain of Thoughtlessness? An Analysis of CoT in Planning https://arxiv.org/pdf/2405.04776 On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks https://arxiv.org/pdf/2402.08115 LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks https://arxiv.org/pdf/2402.01817 Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve https://arxiv.org/pdf/2309.13638 https://arxiv.org/abs/2402.04210 "Task Success" is not Enough Faith and Fate: Limits of Transformers on Compositionality "finetuning multiplication with four digit numbers" (added after pub) https://arxiv.org/pdf/2305.18654 Partition function (number theory) (Srinivasa Ramanujan and G.H. Hardy's work) https://en.wikipedia.org/wiki/Partition_function_(number_theory) Poincaré conjecture https://en.wikipedia.org/wiki/Poincar%C3%A9_conjecture Gödel's incompleteness theorems https://en.wikipedia.org/wiki/G%C3%B6del%27s_incompleteness_theorems ROT13 (Rotate13, "rotate by 13 places") https://en.wikipedia.org/wiki/ROT13 A Mathematical Theory of Communication (C. E. SHANNON) https://people.math.harvard.edu/~ctm/home/text/others/shannon/entropy/entropy.pdf Sparks of AGI https://arxiv.org/abs/2303.12712 Kambhampati thesis on speech recognition (1983) https://rakaposhi.eas.asu.edu/rao-btech-thesis.pdf PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change https://arxiv.org/abs/2206.10498 Explainable human-AI interaction https://link.springer.com/book/10.1007/978-3-031-03767-2 Tree of Thoughts https://arxiv.org/abs/2305.10601 On the Measure of Intelligence (ARC Challenge) https://arxiv.org/abs/1911.01547 Getting 50% (SoTA) on ARC-AGI with GPT-4o (Ryan Greenblatt ARC solution) https://redwoodresearch.substack.com/p/getting-50-sota-on-arc-agi-with-gpt PROGRAMS WITH COMMON SENSE (John McCarthy) - "AI should be an advice taker program" https://www.cs.cornell.edu/selman/cs672/readings/mccarthy-upd.pdf Original chain of thought paper https://arxiv.org/abs/2201.11903 ICAPS 2024 Keynote: Dale Schuurmans on "Computing and Planning with Large Generative Models" (COT) https://www.youtube.com/watch?v=YnMqbpdHcaY The Hardware Lottery (Hooker) https://arxiv.org/abs/2009.06489 A Path Towards Autonomous Machine Intelligence (JEPA/LeCun) https://openreview.net/pdf?id=BZ5a1r-kVsf AlphaGeometry https://www.nature.com/articles/s41586-023-06747-5 FunSearch https://www.nature.com/articles/s41586-023-06924-6 Emergent Abilities of Large Language Models https://arxiv.org/abs/2206.07682 Language models are not naysayers (Negation in LLMs) https://arxiv.org/abs/2306.08189 The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" https://arxiv.org/abs/2309.12288 Embracing negative results https://openreview.net/forum?id=3RXAiU7sss
Upgrade Your Everyday
AI-recommended products based on this video

Kasa Smart Outdoor Smart Plug by TP-Link (KP400) - Smart WiFi Outlet with 2 Sockets, IP64 Waterproof, Works with Alexa and Google Home, 2.4GHz WiFi Required, No Hub Required, Sunset & Sunrise Offset

Wireless Earbuds, Bluetooth 5.4 Headphones in Ear with 4 ENC Noise Cancelling Mic, in Ear Earphones 40H, IP7 Waterproof, Bluetooth Ear Buds for Sports, Gym, Workout, USB C, White

Wireless Earbuds, Bluetooth 5.4 Headphones in Ear with 4 ENC Noise Cancelling Mic, in Ear Earphones 40H, IP7 Waterproof, USB C, Bluetooth Ear Buds for Sports, Gym, Workout, Rose Pink

Wireless Earbuds, Bluetooth 5.4 Headphones in Ear with 4 ENC Noise Cancelling Mic, in Ear Earphones 40H, IP7 Waterproof, USB C, Bluetooth Ear Buds for Sports, Gym, Workout, Black

Fvyao Sleep Earbuds, Mini Wireless Earbuds in Ear, Sleep Headphones Design for Side Sleeper, 24H Bass Stereo, Bluetooth 5.4, with ENC Noise Cancelling, IPX6 Waterproof Ear Buds for Android iOS

Brita Stainless Steel Premium Filtering Water Bottle, BPA-Free, Reusable, Insulated, Replaces 300 Plastic Water Bottles, Filter Lasts 2 Months or 40 Gallons, Includes 1 Filter, Carbon - 20 oz.

Simple Modern Filtered Water Bottle | Insulated Stainless-Steel Carbon Filter Travel Water Bottles | Reusable for Clean Drinking Water On The Go | 24oz, Sea Glass Sage

Motivaris Fitness Tracker Health Watch, 1.47" Step Counter with 24/7 Heart Rate Blood Oxygen Sleep Monitor, 3ATM Waterproof Smart Watch for Women Men, Pedometer Black

FITVII Health & Fitness Tracker (Answer/Make Calls), Smart Watch with 24/7 Heart Rate and Blood Pressure, Sleep Tracking Monitor, 120+ Sport Mode Activity Tracker

AYATAHA AYATAHA Smart Watch for Kids, Smartwatch Fitness Tracker for Boys Girls, Children's Activity Watch 37 Sports Modes SMS Notification, HD Full Touchscreen IP67 Waterproof, Blue

Iaret Iaret Smart Watch for Women, 1.83" HD Fitness Tracker with 4 Bands, Answer/Make Calls, Heart Rate/Sleep/SpO2/Step Tracking, 100+ Sport Modes, Android/iPhone Compatible Gift (Rose Gold)

Hand Warmers 2 Pack, 14000mAh Rechargeable Hand Warmers, Electric Hand Warmer Reusable, Portable Power Bank USB Hand Warmers 4 Levels 8 Heating, Gifts for Raynauds Ski Golf Camping

Hand Warmers Rechargeable, 10000mAh Electric Heated Gloves Power Bank Portable Graphene Handwarmers Pouch with 3 Levels & Double-Sided Heating for Hunting Camping Golf Xmas Gifts for Women Men Kids

2Pack Rechargeable Hand Warmer, 8000mAh Electric Hand Warmer Power Bank, Portable USB-C Hand Warmer for Pocket, Reusable Hand Warmer Up to 8 hrs Each, Warm Gift for Men Women, for Hunting, Camping

GTOCE Portable Charger,40000mAh Power Bank with 22.5W Fast Charging LED Digital Display Battery Pack with 6 Outputs 2 Inputs, Type C Powerbank Portable Charger for iPhone 16 pro Samsung AirPods,Black

Monster Sleep Ear200, Wireless in-Ear Headphones, Bluetooth 6.0 Sleep Headphones, with ANC Active Noise Cancellation Designed for Side Sleepers, 30 Hours of bass Stereo Sound.

Fvyao Sleep Earbuds, Mini Wireless Earbuds in Ear, Sleep Headphones Design for Side Sleeper, 24H Bass Stereo, Bluetooth 5.4, with ENC Noise Cancelling, IPX6 Waterproof Ear Buds for Android iOS

Monster Sleep Ear100 Ear Buds, Sleep Earbuds with Stereo Sound, Design for Side Sleeper, 32H Playtime, Bluetooth 6.0, ENC Noise Cancelling, IPX6 Waterproof Mini Headphones, White

Monster Sleep Ear100 Ear Buds, Sleep Earbuds with Stereo Sound, Design for Side Sleeper, 32H Playtime, Bluetooth 6.0, ENC Noise Cancelling, IPX6 Waterproof Mini Headphones, Black

Hydroponics Growing System Indoor Garden - Herb Garden with Grow Light, 15 Pods Stainless Steel Indoor Garden Kit, Auto Timer, Gardening Gift for All Ages

Umbra Triflora Hanging Planter for Window, Indoor Herb Garden, Set of 5, White/Black

Large Hydroponics Growing System 14 Pods, Indoor Herb Garden with LED Grow Light, 5L Water Tank, Hydroponic Grow Kit with 3 Auto-Timers, Rotatable Light Panel and Child Lock for Home School Gardening

Hanging Planter Hanging Plant Holder, 6 Inch 4 Indoor Plant Pots, Wall/Window Plant Hanger Indoor Herb Garden

slopehill Multi Hair Stylers & Hair Straightener - 2 in 1 Wet to Dry Air Straightener and Hair Dryer Combo with High Speed Air + Rapid Heat-Up + Customizable Temperature(Pink)

Hi.FANCY Portable Laptop Stand with Dual Cooling Fans for 14-17inch Laptops, Grey, 23.5 x 25.9 x 0.95cm

Laptop Stand for Desk, Adjustable Laptop Riser ABS+Silicone Foldable Portable Laptop Holder, Ventilated Cooling Notebook Stand for 10-15.6” Laptops,Tablet-Black

JETech 5 in 1 Case for Samsung Galaxy S25 Ultra 5G with 2-Pack Each Tempered Glass Screen Protector and Camera Lens Protector, Non-Yellowing Shockproof Bumper Phone Cover (Clear)

TAURI for iPhone 17 Pro Max Case 6.9" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard

TAURI for iPhone 17 Pro Case 6.3" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard

JOINPAYA 1Set Rechargeable Hand Warmer Hand Heater for Winter Heating Levels Compact

Shakven Rechargeable Hand Warmer | Cute Comfortable Portable Hand Warmers,Ergonomic Adjustable Energy-Efficient Small Heater for Travel, Outdoor, Winter

OCOOPA IP45 Waterproof Hand Warmer Rechargeable, Up to 15hrs Heat,10000mAh Durable Quick Charge Electric Hand Heater, PD Compatible, 3 Levels for Outdoors, Heavy Duty, H01-PD PRO

![Abstraction & Idealization: AI's Plato Problem [Mazviita Chirimuuta]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/yq318DIwPqw/hqdefault.jpg)
![Why Every Brain Metaphor in History Has Been Wrong [SPECIAL EDITION]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/pO0WZsN8Oiw/hqdefault.jpg)
![AutoGrad Changed Everything (Not Transformers) [Dr. Jeff Beck]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/9suqiofCiwM/hqdefault.jpg)
![Why Scientists Can't Rebuild a Polaroid Camera [César Hidalgo]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/vzpFOJRteeI/hqdefault.jpg)

![Why High Benchmark Scores Don’t Mean Better AI [SPONSORED]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/rqiC9a2z8Io/hqdefault.jpg)
![The Mathematical Foundations of Intelligence [Professor Yi Ma]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/QWidx8cYVRs/hqdefault.jpg)

![Tensor Logic "Unifies" AI Paradigms [Pedro Domingos]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/4APMGvicmxY/hqdefault.jpg)

![He Co-Invented the Transformer. Now: Continuous Thought Machines [Llion Jones / Luke Darlow]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/DtePicx_kFY/hqdefault.jpg)


![We Built Calculators Because We're STUPID! [Prof. David Krakauer]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/dY46YsGWMIc/hqdefault.jpg)
![Why Humans Are Still Powering AI [Sponsored] - Phelim Bradley](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/R11ESdfVX64/hqdefault.jpg)
![The Universal Hierarchy of Life - Prof. Chris Kempes [SFI]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/iwClZ-7OweY/hqdefault.jpg)

![Google Researcher Shows Life "Emerges From Code" [Blaise Agüera y Arcas]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/rMSEqJ_4EBk/hqdefault.jpg)
![AI training data will never be fully synthetic [SPONSORED]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/cnxZZTl1tkk/hqdefault.jpg)
![AI Agents can write 10,000 lines of hacking code in seconds [Dr. Ilia Shumailov]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/aoX_pGQMbEM/hqdefault.jpg)