Why should you care about DuckDB? ft. Mihai Bojin

MotherDuck February 7, 2024
Video Thumbnail
MotherDuck Logo

MotherDuck

View Channel

About

Collaborative serverless analytics platform

Video Description

Talk from the DuckDB meetup that happened in Dublin on 23 January 2024! Future events: https://motherduck.com/events/ More about Dublin's DuckDB meetup : https://www.meetup.com/duckdb-dublin-meetup/ ☁️🦆 Start using DuckDB in the Cloud for FREE with MotherDuck : https://hubs.la/Q02QnFR40 📓 Resources * Slides : https://docs.google.com/presentation/d/1C1H9aoSICrILaRrZsSE16WImeESWZ17aQCheWS2RrOI/preview?slide=id.g2ab5652b19d_0_333 * Mihai's Linkedin : https://www.linkedin.com/in/mihai-bojin/ * Mihai's Blog : https://mihaibojin.medium.com/duckdb-the-big-data-rising-star-71916f953f18 * Mihai's YouTube @MihaiBojin ➡️ Follow Us LinkedIn: https://linkedin.com/company/motherduck Twitter : https://twitter.com/motherduck Blog: https://motherduck.com/blog/ #datascience #dataengineering #duckdb -------------------------------------- If you're a data engineer or analyst wondering "Why use DuckDB?" amidst the explosion of big data tools, this video is for you. We cut through the noise of the modern data landscape—a crowded space with endless data warehouses, data lakes, and ETL processes—to explain why DuckDB is a worthwhile investment of your time. We'll explore how, despite the complexity of today's big data ecosystem, SQL remains the universal language for data processing, setting the stage for a tool that simplifies and empowers. Curious about DuckDB's momentum? We dive into the data, comparing DuckDB vs Snowflake's early growth trajectories. By analyzing metrics from database engine rankings and GitHub, we demonstrate the rapid adoption of this powerful database technology. This trend suggests DuckDB is on track to become an essential part of every data professional's toolkit, making now the perfect time to learn how it can enhance your data analysis workflows. Discover what makes DuckDB so powerful: it’s an in-process database that runs anywhere. We explain how you can install DuckDB with a simple command and run it on your laptop, in CI/CD pipelines, or even directly in your browser. This simplicity eliminates the complexity of servers, credentials, and firewalls. Learn about its seamless integration with Pandas data frames, its robust extension mechanism for reading formats like Parquet and JSON directly from S3, and its out-of-memory performance optimizations that make local data processing faster and cheaper than ever. Finally, we explore three powerful features of DuckDB's SQL syntax that will streamline your queries. See how `GROUP BY ALL` eliminates repetitive code and makes your SQL easier to maintain. We'll also cover the convenience of the `SELECT ... EXCLUDE` syntax and demonstrate the power of the `ASOF JOIN`, a specialized join perfect for time-series data analysis that simplifies complex timestamp comparisons. These DuckDB optimization techniques are designed to make you a more productive data analyst.

You May Also Like

No Recommendations Found

No products were found for the selected channel.