A duck in the hand is worth two in the cloud
2024/11/08What if I told you that you could complete a JSON parse and extract task on your laptop before a distributed compute cluster even finishes booting up? DuckDB is a lightweight, in-process analytical database that runs on your laptop inside of Python and can wrangle large datasets efficiently, both from local and remote data sources. In this talk, we will show you how to query a dataset with DuckDB to extract, load and transform data right on your laptop. We'll then show you how to move your workloads to the Cloud, so you can run them at scale. By developing locally and pushing to the Cloud it's not only easy to develop, debug and iterate, but also makes it easy to quickly switch back and forth between workloads that do and don't require Cloud compute resources, cutting both cost and time.
Related Videos

2025-11-20
Data-based: Going Beyond the Dataframe
Learn how to turbocharge your Python data work using DuckDB and MotherDuck with Pandas. We walk through performance comparisons, exploratory data analysis on bigger datasets, and an end-to-end ML feature engineering pipeline.
Webinar
Python
AI, ML and LLMs



