A duck in the hand is worth two in the cloud
2024/11/08What if I told you that you could complete a JSON parse and extract task on your laptop before a distributed compute cluster even finishes booting up? DuckDB is a lightweight, in-process analytical database that runs on your laptop inside of Python and can wrangle large datasets efficiently, both from local and remote data sources. In this talk, we will show you how to query a dataset with DuckDB to extract, load and transform data right on your laptop. We'll then show you how to move your workloads to the Cloud, so you can run them at scale. By developing locally and pushing to the Cloud it's not only easy to develop, debug and iterate, but also makes it easy to quickly switch back and forth between workloads that do and don't require Cloud compute resources, cutting both cost and time.
Related Videos

20:44
2025-06-13
What can Postgres learn from DuckDB? (PGConf.dev 2025)
DuckDB an open source SQL analytics engine that is quickly growing in popularity. This begs the question: What can Postgres learn from DuckDB?
YouTube
Ecosystem
Talk

2025-06-12
pg_duckdb: Ducking awesome analytics in Postgres
Supercharge your Postgres analytics! This talk shows how the pg_duckdb extension accelerates your slowest queries instantly, often with zero code changes. Learn practical tips and how to use remote columnar storage for even more speed.
Talk
Sources

2025-06-02
Taming file zoos: Data science with DuckDB database files
Tame "file zoos" with DuckDB! This talk reveals how DuckDB seamlessly integrates with Python to read/write diverse files (CSV, Parquet), organize data efficiently, enable larger-than-memory analysis, and streamline your workflows.
Python
Talk