NEW Open Source AI VIdeo, FREE Runway, & Gemini Image Model UPDATES!
Theoretically Media
@theoreticallymediaAbout
Welcome to Theoretically Media, I'm Tim! For Sponsor/Partner Inquires: [email protected] Anything Else: [email protected]
Latest Posts
Video Description
We've got big news about Tencent's open-source multimodal video model, Hunyuan Custom, and its upcoming Open Source Day! Then, I'll give you a quick refresher on what exactly are multimodal models and how they differ from typical diffusion process AI generation – they're a real game-changer. We'll then get a little technical (but I'll keep it breezy!) as I walk you through how Hunyuan Custom actually generates video, from reference images and text prompts to the magic behind VAE, LLaVA, and UniDiffuser video. This is where it gets really interesting, as I'll show you how you can use existing video and even audio to drive the AI video generation. Of course, we need to see the results! I'll show you some examples of Hunyuan Custom's output quality, from human characters and animals to complex scenes with generated backgrounds and actions. And get ready for a shootout! I'm impressed that Hunyuan isn't afraid to go head-to-head with other models like Kling, Pika, and more. We'll look at character referencing, object referencing, and multi-referencing. The code for Hunyuan Custom should be dropping around May 9th on Hugging Face and GitHub, but I'll share a link where you can try a version of it RIGHT NOW! (Quick note: I had some ISP issues, but you should be good to go!) Shifting gears, we'll look at Google! While Gemini 2.5 Pro is getting a lot of buzz (interactive visualizers, Godzilla vs. Gorillas simulations!), Gemini 2.0's image model quietly got an upgrade with better visual quality and text rendering. And it's free to use in AI Studio! Finally, rounding out the freebies, Runway now allows free-tier users to access their image generator, frames, and character reference features. There are limits, but it's a great way to test things out! CHAPTERS: 00:00 – Intro & What's Coming Up! 00:36 – Tencent's Hunyuan Custom: The Free AI Video Generator! 01:08 – Understanding Multimodal Models 01:32 - The Problem With Diffusion 01:55 - Multimodal Models 02:28 – How Hunyuan Custom Video Generation Works 03:00 - LLava 03:40 – Driving Hunyuan Custom with Video & Audio 04:29 – Hunyuan Custom: The Output Quality! 05:03 - Non Human Inputs 05:27 - Multi Character References 05:47 - Video Inpainting with Reference! 06:18 - Example 2 06:44 - A Day in the Life of A Guy 07:27 – Hunyuan Custom vs. The Competition (State-of-the-Art Shootout!) 07:45 - Example 1 08:18 - Example 2 08:56 - Example 3 09:50 - Example 4 10:49- Example 5 12:08 - Example 6 12:55- Shout out to Hunyaun! 13:11 - Driving Video Inpainting is amazing 13:47 – Try Hunyuan Custom For Yourself! (Code & Links) 14:58 – Google News! 15:55 - Google Gemini 2.0 Image Model Update 16:14 - Using AI Studio 16:37 - Generations 16:56 - What we CAN'T do in Midjourney 17:22 - One Major improvement 18:09 – Runway is Now (Kinda) Free! 19:00 – Wrapping Up! LINKS & RESOURCES: Try Hunyuan Custom: https://hunyuancustom.github.io/ Hugging Face (Code likely available May 9th): [https://hunyuan.tencent.com/modelSquare/home/play?modelId=192 Google AI Studio (for Gemini 2.0): https://aistudio.google.com/prompts/new_chat The Greatest AI Video Ever: https://www.reddit.com/r/aivideo/comments/1ej5a7f/avalanche/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button My Video on Runway Frames: https://youtu.be/7jwSNb4qq_E My Video on Runway References: https://youtu.be/umhFIUudEwo My Video on Runway References vs. Midjourney Omni Reference: [ttps://youtu.be/Poy__YfsQNo Don't forget to LIKE this video if you found it helpful, SUBSCRIBE for more AI news and tutorials, and hit that NOTIFICATION BELL so you don't miss out on the latest drops!
You May Also Like
AI Enthusiast's Urgent Upgrades
AI-recommended products based on this video

EZDIY-FAB RTX 3000 Series 12 Pin to Dual 8 Pin PCIe Sleeved Extension Cable 300 MM- Connector for NVIDIA Ampere GEFORCE RTX 3060ti 3070 3080 FE Funder Edition- White

Selore USB C Docking Station for Laptop Dual HDMI Monitor, 8 in 1 Triple 4K Docking Station 4 Monitors Adapter,VGA,100W PD,2USB A 2.0,USB C 2.0 USB C Hub for MacBook HP Dell Lenovo

USB C Docking Station Dual Monitor for Dell Hp,15-in-1 Laptop Docking Station 3 Monitors USB C Hub with Dual 4K HDMI,8K DP,Button,PD Charging,Ethernet,6 USB A&C,SD/TF, Audio USB-C Multiport Adapter

MAGICRAVEN 4K Portable Monitor 13.3" - 3840 * 2160 UHD IPS Portable Screen Laptop Monitor, Slim Lightweight Dual USB C HDMI Computer Monitor Gaming Display with Speakers, Travel Monitor for PC Phone




















