
2023/05/24 - Marcos Ortiz
This Month in the DuckDB Ecosystem: May 2023
DuckDB news: v0.8.0 brings Pivot/Unpivot and time series joins. Project hits 10,000 GitHub stars. Spatial extension and native Swift API launch.
It’s Marcos again, aka “DuckDB News Reporter” with another issue of “This Month in the DuckDB Ecosystem for June 2023.
This month keeps showing the rising popularity of DuckDB as a great developer tool. From analyzing music data to being the choice to work with 50k+ datasets in the Hugging Face Hub, from using it for creating dummy data to analyzing your own Fitbit data with it.
As always we share here, this is a two-way conversation: if you have any feedback on this newsletter, feel free to send us an email to duckdbnews@motherduck.com
-Marcos

Max Gabrielsson is a Junior Software engineer at DuckDB labs but he has already made some impressive waves! He’s the creator of the official spatial DuckDB extension. While it’s still WIP, it’s much more welcome for any geo data processing. You can read more about this one here.

The Hugging Face team just announced the integration with DuckDB, which means that now you can use the simplicity of SQL on 50k+ datasets on its Hub.
This new feature from DuckDB will allow building more readable and easier-to-maintain complex queries.
In this video, Laurens shows how to work with deeply nested JSON data in DuckDB
Matt Palmer shares a very interesting perspective in this post on why DuckDB is so popular these days.
The Dagster team just released a tutorial to show how to combine DuckDB I/O Manager and Dagster’s Software-Defined Assets. If you use Dagster in production today, you will benefit a lot from this seamless integration here
Researchers from the CMU Data Interaction Group just shared this notebook on Observable where they combined the power of FalconVIS and DuckDB to cross-filter 10 Million rows.

In this post, Simon Aubury analyzed its own Fitbit activity with the help of DuckDB and Seaborn

The Vantage team shared an insightful comparison between clickhouse-local and DuckDB. The post is worth a read because it highlights a very important point on why people are selecting DuckDB for more and more projects: developer productivity with DuckDB is just awesome

Mark Needham (a regular person in this newsletter) wrote about how to use the potential of UDFs on DuckDB to generate dummy data. If you are a visual person, you can watch the video Mark did explaining the same thing here.
Max Halford show a simple way to work with graphs with Python and DuckDB
Arthur Dryomov wrote about how to analyze music data with DuckDB
In this post, Stephen Abbott Pugh explains in great detail how DuckDB could be the perfect tool to work with the Beneficial Ownership Data Standard (BODS)
“DuckCon,” the DuckDB user group, will be held for the first time outside of Europe in San Francisco Museum of Modern Art (SFMOMA), in the Phyllis Wattis Theater. In this edition, there will be talks from DuckDB creators Hannes Mühleisen and Mark Raasveldt about the current state of DuckDB and future plans. It will also talks from data industry notables Lloyd Tabb (of Looker and Malloy fame) and Josh Wills (creator of dbt-duckdb). The full agenda is available here.
Grab your ticket here, as there is limited space!
Following DuckCon, MotherDuck will host a party celebrating ducks at 111 Minna (located very close to SFMOMA). DuckCon attendees are cordially invited to attend to eat, drink, listen to music and play games (skeeball!). MotherDuck’s Chief Duck Herder will also demo the latest work bringing DuckDB to the cloud.
Register now before they run out of space!
DuckDB co-creator Hannes will be giving a keynote at this 10-track data conference hosted by Databricks. Additionally, Ryan Boyd (co-founder at MotherDuck) will be delivering a technical session: If A Duck Quacks In The Forest And Everyone Hears, Should You Care?

2023/05/24 - Marcos Ortiz
DuckDB news: v0.8.0 brings Pivot/Unpivot and time series joins. Project hits 10,000 GitHub stars. Spatial extension and native Swift API launch.
2023/06/08 - Mehdi Ouazza
A comparaison through a pragmatic analytic project of DuckDB, Polars and Pandas