Build Real ML Model That Predicts Taxi Tips with XGBoost and NVIDIA GPU 🧠⚡
Python Simplified
@pythonsimplifiedAbout
Hi everyone! My name is Mariya and I'm a software developer from Sofia, Bulgaria. I film programming tutorials about Computer Science Concepts, GUI Applications, Machine Learning and Artificial Intelligence, Automation and Web Scraping, Data Science and even Math! 🤓 I'm here to help you with your programming journey (in particular - your Python programming journey 😉) and show you how many beautiful and powerful things we can do with code! 💪💪💪
Video Description
Ever wondered what makes people tip more in taxis? 🚕💵 In this hands-on machine learning project, we’ll build a complete workflow on real-world NYC data — cleaned, engineered, and trained entirely on GPU using XGBoost CUDA and cuDF Pandas! 🐼 (🚨No GPU?🚨 I’ll show you how to use one for free on Google Colab! 😉) You’ll see how professionals approach problems, handle massive data, and fix memory errors - designing real data-science pipelines step by step! 😎 By the end, you’ll have a meaningful project that’s fun to build, technically impressive, and looks perfect on your portfolio!! 🤩 Join me on this adventure — and learn how to think like a pro-level data scientist. 💡 What You’ll Learn - Handling Real-World Datasets: Cleanup, Missing Values, Anomalies, Aggregation. 📊 - Solving memory limitations and runtime crushes with cuDF Pandas + RMM. 💾 - Accelerating machine learning with XGBoost on NVIDIA GPUs. 🤖 - Evaluate your model’s performance — and keep making it smarter! 💪🤓 - And most importantly — develop the mindset of a data scientist, solving problems instead of guessing. 🔎 🧠 What Makes This Project Different This isn’t another “beginner demo” — it’s a real workflow based on real data and real problems. You’ll experience the same challenges professionals face: huge sloppy datasets, missing labels, CPU and GPU memory limits — all explained step by step, in simple terms. I’ll show you why we make each decision, not just how to code it — so you learn to think, debug, and reason like a pro. 🔗 Important Links ------------------------------------------------ 🔹Download Tutorial Code and Smaller Dataset from GitHub: https://github.com/MariyaSha/nyc_taxi_xgboost_lab 🔹 Download Full Dataset from NYC Open Data: https://data.cityofnewyork.us/Transportation/2023-Yellow-Taxi-Trip-Data/4b4i-vvec/about_data 🔹RAPIDS Installation Guide: https://docs.rapids.ai/install/ 🔹Official NVIDIA Google Colab Notebook - 🧐 VERY ADVANCED 🧐: https://colab.research.google.com/drive/1vlzvB981pej2RlKmXBUF1CNzyxl8YpJg 📽️ Important Tutorials ------------------------------------------------ ⭐ WSL + Conda Setup: https://youtu.be/luM5kwH6tjQ ⭐ Machine Learning with Scikit-Learn: https://youtu.be/-IvNzmrcyUM ⭐ cuDF Pandas For Beginners: https://youtu.be/9KsJRyZJ0vo ⭐ What is CUDA? https://youtu.be/r9IqwpMR9TE ⏰ Time Stamps ------------------------------------------------ 01:08 - Download Dataset 01:43 - Solving Big Data Problems with GPU Processing 02:46 - Google Colab Setup with Free T4 GPU 03:02 - Local Setup with NVIDIA GPU 03:43 - RAPIDS Installation Guide 05:07 - Solving Jupyter Kernel Crash with cuDF Pandas 05:29 - Handling Missing Values 05:53 - Detect Missing Values 06:29 - Replace with Zero 07:31 - Replace with Mean 08:57 - Investigate Columns with Ambiguous Names 11:21 - Drop Columns (If No Other Option) 12:01 - Split Data For Training & Testing 12:07 - Shuffle Data 13:39 - Features & Targets Split 14:02 - Train & Test Split 16:20 - Load XGBoost Model on GPU 17:55 - Train XGBoost Model 18:08 - Test XGBoost Model and Get Predictions 18:45 - Solve ValueError : DataFrame.dtypes must be int float bool or category 20:15 - Evaluate Trained Model 22:39 - Data Optimization & Anomalies 22:41 - Detect Data Anomalies with Aggregation 23:47 - Solve XGBoostError : No GPU Memory Left with RMM 25:04 - Handle Negative Charges and Unrealistic Distances 28:19 - Detect and Handle Unrealistic Transactions 30:28 - Second Train Run on Optimized Data 31:45 - Best Practices 31:45 - Plot Training Results & Feature Importance 32:17 - Hyperparameter Tuning 32:49 - Date Extraction : From String to Int or Category 33:05 - K-Fold Validation 33:45 - Thanks for Watching! 🚀 Environment Setup ------------------------------------------------ You can run this project in two ways, coding along with me: 1️⃣ Google Colab: - Change your runtime to T4 GPU. - Use smaller version of the NYC Taxi dataset (5 million rows). Download above 👆 2️⃣ Local setup: - Make sure you have a CUDA compatible GPU. - Use WSL and Minforge/Conda (⚠️MUST! ⚠️). - Use current command from RAPIDS Installation Guide for your setup (⚠️MUST! ⚠️). - Use the full version of the NYC Taxi dataset (38 million rows). Download above 👆 💻 Tutorial Code ------------------------------------------------ 📌 Remove all the rows that have negative numbers: data = data[~data.select_dtypes("number").lt(0).any(axis=1)] 📌 Solve "XGBoostError: No GPU memory is left" and kernel crashes: import rmm rmm.reinitialize(pool_allocator=True, initial_pool_size="8GB") #MachineLearning #DataScience #Python #BigData #GPU #NVIDIA #RAPIDS #DataAnalysis #DataCleaning #PythonTutorial #AI #pythonprogramming
Upgrade Your Everyday
AI-recommended products based on this video

Kasa Smart Outdoor Smart Plug by TP-Link (KP400) - Smart WiFi Outlet with 2 Sockets, IP64 Waterproof, Works with Alexa and Google Home, 2.4GHz WiFi Required, No Hub Required, Sunset & Sunrise Offset

Oraolo Open Ear Headphones, Bluetooth 5.4 Wireless Earbuds with Stereo Sound, ENC Clear Call, 21H Playtime, Type-C Charging, Touch Control, IPX6 Waterproof Open Ear Clip-On Earbuds for outdoor workout

Open Ear Wireless Earbuds, Sports Bluetooth Headphones with Real Time Translation, 80Hrs Playtime with LED Power Display, Noise Canceling Headset, IP55 Waterproof Earphones for Workout/Running (Black)

Wireless Earbuds, Sports Bluetooth Headphones, 80Hrs Playtime Ear Buds with LED Power Display, Noise Canceling Headset, IPX7 Waterproof Earphones for Workout/Running Z(Black)

Monster Wireless Earbuds Open Ear AC311, Bluetooth 5.4 Open Ear Headphones, Up to 30 Hours Playtimes Bluetooth Ear Earphones with Stereo Sound, Touch Control and IPX5 Waterproof Open Ear Buds

Brita Stainless Steel Premium Filtering Water Bottle, BPA-Free, Reusable, Insulated, Replaces 300 Plastic Water Bottles, Filter Lasts 2 Months or 40 Gallons, Includes 1 Filter, Carbon - 20 oz.

Simple Modern Filtered Water Bottle | Insulated Stainless-Steel Carbon Filter Travel Water Bottles | Reusable for Clean Drinking Water On The Go | 24oz, Sea Glass Sage

FITVII Health & Fitness Tracker (Answer/Make Calls), Smart Watch with 24/7 Heart Rate and Blood Pressure, Sleep Tracking Monitor, 120+ Sport Mode Activity Tracker

AYATAHA AYATAHA Smart Watch for Kids, Smartwatch Fitness Tracker for Boys Girls, Children's Activity Watch 37 Sports Modes SMS Notification, HD Full Touchscreen IP67 Waterproof, Blue

Iaret Iaret Smart Watch for Women, 1.83" HD Fitness Tracker with 4 Bands, Answer/Make Calls, Heart Rate/Sleep/SpO2/Step Tracking, 100+ Sport Modes, Android/iPhone Compatible Gift (Rose Gold)

Smart Watch for Men Women 1.8" Fitness Tracker, Bluetooth Call, DIY Dial, Heart Rate Sleep Blood Oxygen Monitor, 100+ Sports Modes, IP68 Waterproof Smartwatch for Android iPhone, Alexa Built-in

Hand Warmers 2 Pack, 14000mAh Rechargeable Hand Warmers, Electric Hand Warmer Reusable, Portable Power Bank USB Hand Warmers 4 Levels 8 Heating, Gifts for Raynauds Ski Golf Camping

Hand Warmers Rechargeable, 10000mAh Electric Heated Gloves Power Bank Portable Graphene Handwarmers Pouch with 3 Levels & Double-Sided Heating for Hunting Camping Golf Xmas Gifts for Women Men Kids

2Pack Rechargeable Hand Warmer, 8000mAh Electric Hand Warmer Power Bank, Portable USB-C Hand Warmer for Pocket, Reusable Hand Warmer Up to 8 hrs Each, Warm Gift for Men Women, for Hunting, Camping

GTOCE Portable Charger,40000mAh Power Bank with 22.5W Fast Charging LED Digital Display Battery Pack with 6 Outputs 2 Inputs, Type C Powerbank Portable Charger for iPhone 16 pro Samsung AirPods,Black

Monster Sleep Ear200, Wireless in-Ear Headphones, Bluetooth 6.0 Sleep Headphones, with ANC Active Noise Cancellation Designed for Side Sleepers, 30 Hours of bass Stereo Sound.

Monster Sleep Ear100 Ear Buds, Sleep Earbuds with Stereo Sound, Design for Side Sleeper, 32H Playtime, Bluetooth 6.0, ENC Noise Cancelling, IPX6 Waterproof Mini Headphones, White

Monster Sleep Ear100 Ear Buds, Sleep Earbuds with Stereo Sound, Design for Side Sleeper, 32H Playtime, Bluetooth 6.0, ENC Noise Cancelling, IPX6 Waterproof Mini Headphones, Black

Monster Sleep Ear100 Ear Buds, Sleep Earbuds with Stereo Sound, Design for Side Sleeper, 32H Playtime, Bluetooth 6.0, ENC Noise Cancelling, IPX6 Waterproof Mini Headphones, Black

Hydroponics Growing System Indoor Garden - Herb Garden with Grow Light, 15 Pods Stainless Steel Indoor Garden Kit, Auto Timer, Gardening Gift for All Ages

Umbra Triflora Hanging Planter for Window, Indoor Herb Garden, Set of 5, White/Black

Large Hydroponics Growing System 14 Pods, Indoor Herb Garden with LED Grow Light, 5L Water Tank, Hydroponic Grow Kit with 3 Auto-Timers, Rotatable Light Panel and Child Lock for Home School Gardening

Hanging Planter Hanging Plant Holder, 6 Inch 4 Indoor Plant Pots, Wall/Window Plant Hanger Indoor Herb Garden

slopehill Multi Hair Stylers & Hair Straightener - 2 in 1 Wet to Dry Air Straightener and Hair Dryer Combo with High Speed Air + Rapid Heat-Up + Customizable Temperature(Pink)

Hi.FANCY Portable Laptop Stand with Dual Cooling Fans for 14-17inch Laptops, Grey, 23.5 x 25.9 x 0.95cm

Laptop Stand for Desk, Adjustable Laptop Riser ABS+Silicone Foldable Portable Laptop Holder, Ventilated Cooling Notebook Stand for 10-15.6” Laptops,Tablet-Black

JETech 5 in 1 Case for Samsung Galaxy S25 Ultra 5G with 2-Pack Each Tempered Glass Screen Protector and Camera Lens Protector, Non-Yellowing Shockproof Bumper Phone Cover (Clear)

TAURI for iPhone 17 Pro Max Case 6.9" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard

TAURI for iPhone 17 Pro Case 6.3" with 1-Pack Screen Protector, Camera Lens Full Protection, Military-Grade Protection, Shockproof Transparent Back Bumper Phone Cover - Clear Global Recycled Standard

JOINPAYA 1Set Rechargeable Hand Warmer Hand Heater for Winter Heating Levels Compact

Shakven Rechargeable Hand Warmer | Cute Comfortable Portable Hand Warmers,Ergonomic Adjustable Energy-Efficient Small Heater for Travel, Outdoor, Winter

OCOOPA IP45 Waterproof Hand Warmer Rechargeable, Up to 15hrs Heat,10000mAh Durable Quick Charge Electric Hand Heater, PD Compatible, 3 Levels for Outdoors, Heavy Duty, H01-PD PRO



















