Hey, friend đź‘‹
Hi everyone, Jacob here with my first newsletter. I am pumped to share some great links and highlights, delivered right to your inbox.
In this December issue, I’ve gathered ten great links, covering topics from business models to benchmarks and a few other pieces. My personal favorite of course has to do with spreadsheets, and importantly the ability to read from and write to them, all with SQL inside of DuckDB. Enjoy!
If you have any feedback or links you think we missed, we would love your feedback over at duckdbnews@motherduck.com.
Featured Community Member |
Tobias MĂĽller
Tobias Muller is a notable builder of DuckDB things, in addition to writing about them on his blog, tobilg.com. Of note recently is the great sql-workbench.com, an in-browser IDE with some really nice capabilities that extend the experience of DuckDB, like charting and integrated AI query assistance. He wrote an awesome article explaining it on his blog. Thank you Toby for always pushing the envelope with DuckDB and writing about it in public for us to share!
Top DuckDB Links this Month |
Data goes Blue? Extracting analytics from Bluesky
Mehdi dives into Bluesky, the open source social network with fully open APIs (remember when the other site had that?). Thanks to some great work done by community members, it's simple to connect to this massive dataset and build your analytics playground. You can also say hello to both Mehdi and yours truly over there! (and, of course, MotherDuck's Bluesky account)
LLMs in SQL? A real-world application to clean up your CRM data
Inspired by some of the AI features integrated into MotherDuck, this article discusses leveraging Large Language Models (LLMs) within SQL queries to enhance the quality of firmographic data in Customer Relationship Management (CRM) systems. Nate addresses common challenges in maintaining accurate industry classifications and leverages LLMs, specifically within Snowflake's database environment, to automate the categorization of company names into predefined industry sectors. He aims to streamline data cleaning processes, reduce manual effort, and improve the reliability of CRM data for analytical purposes. It “mostly works” but per usual you still need a human in the loop.
DuckDB GSheets
Archie at Evidence.dev has been up to stuff - specifically building a DuckDB community extension for Google Sheets. While he originally set out to build this to save himself some time, it ended up manifesting as a really nice extension. It supports Auth, Read, and Write to Google Sheets, with a really nice sql abstraction that will look familiar to anyone who has copied Postgres tables:
COPY source_table TO 'gsheet_id' (FORMAT gsheet);
Anywhere you can write DuckDB SQL, you can also import and export Google Sheets - powerful! Give the github repo a star and check it out!
Generating a Data App with your MotherDuck Data
MotherDuck’s own Till Döhmen writes about experimenting with Claude Artifacts to build a MotherDuck data app generator. I think we are seeing things move quite quickly in the data + AI space, and so this end to end example from Till is a great check-in point on current “state of the art” for generative apps. While this is merely a peak into the future, I think he really lays out a clean way to think about these types of flows for data science & other analytical workflows.
Driving CSV Performance: Benchmarking DuckDB with the NYC Taxi Dataset
Pedro cannot avoid continuing to work on CSVs, this time in the context of the NYC taxi dataset. He is very thoughtful about what it means to have a well designed benchmark, and provides code examples for each step of the process. Of particular note is the choice to track “Avg. deviation of CPU usage from 100%” as metric, which informs a bit about how the DuckDB Labs folks think about the effectiveness of DuckDB - it should use all available compute all the time!
Why the Quack will you use DuckDB?
Dudhraj Sandeep gives us five reasons why you should check out DuckDB. You’ll have to click through to the article to see all five reasons, but he offers examples from handling complex queries with ease (and speed!) to operating in resource-constrained environments, and a few other scenarios where DuckDB shines.
David's Substack on the DuckDB Foundation Model
David Jayatillake writes a bit about DuckDb’s amazing rise, where he clearly lays out how the DuckDB Foundation with DuckDB Labs and supporting orgs (like MotherDuck & Volton) are building an alternative model for bringing this type of infrastructure to life. This framing is helpful especially for those that are unclear on how the organizations differ. It also shares a timely reference to “Santa as a Duck” that is worth clicking through just to see!Â
DuckDB WebMacro
The team behind quackscience brings us a new DuckDB function to allow github gists to be shared and loaded as Macros.
Lightning-Fast Analytics: DuckDB + WASM for Large Datasets in the Browser
David Rodriguez shows off how to use DuckDB-WASM to make your analytics lightning-fast and your workflows ridiculously efficient—no backend required. This article dives into the mechanics of how DuckDB-WASM works, explores its limitations, and provides practical examples to get you started.
Valentina adds MotherDuck support
I always love to see more tools adding MotherDuck support, and Valentina is no exception. They have recently added support in their Valentina Studio product to allow users to seamlessly connect to MotherDuck and build analytical queries inside their IDE. This feature is available in version 14.6 and later.
Upcoming Events |
DuckCon #6 in Amsterdam
31 January, Amsterdam, Netherlands - 2:30 PM Central European Time
DuckCon #6, DuckDB's next user group meeting in Amsterdam, the Netherlands. The event will be in person + streamed online on the DuckDB YouTube channel. The agenda has been published on the DuckDB website.
Airbyte + MotherDuck $10,000 Hackathon
Now Until January 20th, 2025
With the launch of the new MotherDuck connector for Airbyte, we're thrilled to continue our partnership with MotherDuck by announcing our upcoming hackathon that brings together the power of Airbyte and MotherDuck to solve the needs of delivering modern data integration, AI, and analytics solutions.
CONTENT
- Hey, friend đź‘‹
- Featured Community Member
- Top DuckDB Links this Month
- Upcoming Events
Subscribe to DuckDB Newsletter