How We Power the Largest AI Deployments on the Planet: Running Vir... Brandon Jacobs & Lukas Gentele
CNCF [Cloud Native Computing Foundation]
@cncfAbout
To provide educational and informative content on cloud native computing, which uses an open source software stack to deploy applications as microservices, packaging each part into its own container, and dynamically orchestrating those containers to optimize resource utilization. Educational content on CNCF projects, like Kubernetes and Prometheus, will also be provided.
Video Description
How We Power the Largest AI Deployments on the Planet: Running Virtual Clusters at Scale - Brandon Jacobs, CoreWeave & Lukas Gentele, Loft Labs Running and managing a large number of Kubernetes clusters on bare metal poses significant challenges, from security to GPU provisioning to scalability. Specialized cloud provider CoreWeave experienced these first-hand, operating 3,000+ Kubernetes clusters on top of 5,000 bare metal nodes with massive amounts of GPUs to power modern AI applications at scale. In the session, we’ll dive into these challenges and how CoreWeave partnered with Loft Labs, the maintainers of vcluster, to create this serverless Kubernetes experience for numerous companies running AI workloads at scale. This session demonstrates the pitfalls, design choices and architectural challenges the teams have dealt with over the course of 3 years while evolving its serverless Kubernetes offering, including: -Secure Isolation Of Tenants On A Shared Infrastructure -Challenges in achieving 10 second autoscaling -On-Demand Cluster & Compute Provisioning For Tenants -Day 2 Operations & Managing A Fleet Of Clusters At Scale
AI Deployment Essentials
AI-recommended products based on this video

OWC 64GB DDR5 4800 PC5-38400 CL40 2Rx4 288-pin 1.1V ECC Registered RDIMM Memory RAM Module Upgrade Compatible with Dell PowerEdge R660XS R760XS

OWC 64GB DDR5 4800 PC5-38400 CL40 2Rx4 288-pin 1.1V ECC Registered RDIMM Memory RAM Module Upgrade Compatible with Dell PowerEdge XE9680

OWC 64GB DDR5 4800 PC5-38400 CL40 2Rx4 288-pin 1.1V ECC Registered RDIMM Memory RAM Module Upgrade Compatible with Dell PowerEdge R6625 R760 R7615 R7625

OWC 64GB DDR5 4800 PC5-38400 CL40 2Rx4 288-pin 1.1V ECC Registered RDIMM Memory RAM Module Upgrade Compatible with Dell PowerEdge HS5610 HS5620

Lenovo IdeaPad 3 14" Full HD Business Laptop, Intel i5-1135G7, 36GB RAM, 2.28TB Storage (2TB SSD+288GB Docking Station Set), Intel Iris Xe Graphics, WiFi 6, Webcam, Windows 11 Pro, Platinum Grey

Alienware Aurora Gaming Desktop ACT1250 - Intel Core Ultra 9 285 Processor, Liquid Cooled, NVIDIA GeForce RTX 5080, 32GB DDR5 RAM, 1TB SSD, 1000W Platinum Rated PSU, Windows 11 Home - Clear Panel

IdeaPad 3 14" HD Laptop - Intel Pentium Silver N5030, 4GB RAM, 128GB SSD, Windows 10 S Mode - Platinum Grey (81WH004LUS)

oelaio Clearance of Sales Cargo Pants for Men Big and Tall Hiking Pants Lightweight Ripstop Water Resistant Outdoor Fitness Green

Bike Trailer Attachment, Cargo and Pet Bike Trailers, Quick-Release Adapter Cycling Accessories for Kids Pet Transport Cargo Commute Family Adventure Daily Trailers




















