MOTHERDUCK BLOG

DuckDB, MotherDuck, Data Engineering and Data Analytics

Search and filter

Search and filter

Topic

Does "AI-Ready Data" simply mean "Good Data Modeling"?

2026/02/06 - Jacob Matson

Does "AI-Ready Data" simply mean "Good Data Modeling"?

I ran 500 text-to-SQL questions against three frontier LLMs with zero context beyond the schema. 95% accuracy. No semantic layer required.

Building an Obsidian RAG with DuckDB and MotherDuck

2026/02/05 - Simon Späti

Building an Obsidian RAG with DuckDB and MotherDuck

Build a local-first RAG for your Obsidian notes using DuckDB's vector search, then deploy it as a serverless web app with MotherDuck

Subscribe to MotherDuck Blog

Blog subscription card section illustration
More Control, Less Hassle: Self-Serve Recovery with Point-in-Time Restore

2026/02/04 - Sheila Sitaram, Joseph Hwang, Nicholas Greenspan

More Control, Less Hassle: Self-Serve Recovery with Point-in-Time Restore

MotherDuck now supports point-in-time restores, making it easier than ever to roll back changes, undrop databases, and debug pipelines.

SQL Golf: Lessons from Quackmas 2025

2026/01/23 - Jacob Matson

SQL Golf: Lessons from Quackmas 2025

From #N column references to boolean math, explore the extreme techniques used to solve the Quackmas 2025 SQL Golf challenge.

This Month in the DuckDB Ecosystem: January 2026

2026/01/17 - Simon Späti

This Month in the DuckDB Ecosystem: January 2026

DuckDB news: Iceberg extension adds full DML (INSERT/UPDATE/DELETE). Process 1TB in 30 seconds. Query data via AI agents with MCP server. TypeScript macros for APIs.

We Built Our Employees a Wrapped—Using SQL and MotherDuck

2025/12/29 - Hannah Rifkin

We Built Our Employees a Wrapped—Using SQL and MotherDuck

We built a Spotify-style "Wrapped" for MotherDuck employees using our own platform. Discover the SQL queries, data tricks, and fun personas behind our internal leaderboard.

What If We Don't Need the Semantic Layer?

2025/12/23 - Jacob Matson

What If We Don't Need the Semantic Layer?

What if AI could discover your business logic by mining query history instead of requiring manual metric definitions? Explore how LLMs are replacing static semantic layers with systems that learn from actual usage.

A Quack-Packed Fall

2025/12/22 - Carly Spoljaric, Jerel Navarrete

A Quack-Packed Fall

MotherDuck spent fall on the conference circuit across Europe and the US. Here's what we presented and the pattern that emerged in nearly every conversation about analytics.

Stop Paying the Complexity Tax

2025/12/19 - Ryan Boyd

Stop Paying the Complexity Tax

My personal reflections from watching Jordan Tigani’s keynote "The Unbearable Bigness of Small Data" at Small Data SF. Most data warehouses are overbuilt for scale you'll never need. Learn how to stop paying the complexity tax.

Building the MotherDuck Remote MCP Server: A Journey Through Context Engineering and OAuth Proxies

2025/12/18 - Till Döhmen

Building the MotherDuck Remote MCP Server: A Journey Through Context Engineering and OAuth Proxies

How MotherDuck built a production-ready remote MCP server—from OAuth proxy challenges with Auth0 to tool design patterns that help AI agents query data warehouses effectively. Includes lessons from a hackathon where agents ran 4,000+

Building an answering machine

2025/12/17 - Jordan Tigani

Building an answering machine

Discover how MotherDuck's MCP server enables true self-service analytics through AI agents. Query data in plain English with Claude, ChatGPT, or Gemini—no SQL required.

MotherDuck Integrates with PlanetScale Postgres

2025/12/16 - Garrett O'Brien, Jacob Matson

MotherDuck Integrates with PlanetScale Postgres

Run analytics 200x faster on PlanetScale Postgres with MotherDuck integration. Keep millisecond transactions while pushing analytical queries to serverless compute.

Calling All SQL Sleuths: The Christmas Heist Awaits

2025/12/12 - Jacob Matson

Calling All SQL Sleuths: The Christmas Heist Awaits

Christmas presents have vanished! Use your SQL skills to solve the mystery on DBQuacks and win prizes. 15 challenges. One leaderboard.

Streaming Pipelines with MotherDuck and Artie

2025/12/11 - Garrett O'Brien

Streaming Pipelines with MotherDuck and Artie

CDC streaming takes flight with Artie’s new MotherDuck destination.

This Month in the DuckDB Ecosystem: December 2025

2025/12/10 - Simon Späti

This Month in the DuckDB Ecosystem: December 2025

DuckDB news: v1.4 adds AES-256 encryption. DuckLake brings ACID-compliant lakehouse with time-travel queries. Gaggle extension queries Kaggle datasets directly via SQL.

Simplicity of a Database, but the Speed of a Cache: OLAP Caches for DuckDB

2025/12/03 - Simon Späti

Simplicity of a Database, but the Speed of a Cache: OLAP Caches for DuckDB

Speed up slow dashboards without adding new infrastructure. Learn how DuckDB's caching extensions can drop query times from minutes to seconds.

Branch, Test, Deploy: A Git-Inspired Approach for Data

2025/11/24 - Simon Späti

Branch, Test, Deploy: A Git-Inspired Approach for Data

This article explores how to bring Git style workflows like branching, testing, and deploying to your data stack. Learn how concepts like zero copy cloning and metadata pointers can finally give you isolated test environments.

Small Data SF 2025: the Recap!

2025/11/14 - Garrett O'Brien

Small Data SF 2025: the Recap!

Dive into a recap of the world's hottest efficiency-themed data conference, Small Data SF!

DuckDB Ecosystem: November 2025

2025/11/12 - Simon Späti

DuckDB Ecosystem: November 2025

DuckDB news: QuackStore caching cuts query time from 49s to 3s. Infera runs ONNX ML models in SQL. 127 community extensions analyzed. DuckLake architecture explained.

4 Senior Data Engineers Answer 10 Top Reddit Questions

2025/10/30 - Simon Späti

4 Senior Data Engineers Answer 10 Top Reddit Questions

A great panel answering the most voted/commented data questions on Reddit

Faster Ducks

2025/10/28 - Jordan Tigani

Faster Ducks

Benchmarks, efficiency, and how MotherDuck just got nearly 20% faster.

DuckDB 1.4.1 and DuckLake 0.3 Land in MotherDuck: New SQL Syntax, Iceberg Interoperability, and Performance Gains

2025/10/09 - Alex Monahan, Garrett O'Brien

DuckDB 1.4.1 and DuckLake 0.3 Land in MotherDuck: New SQL Syntax, Iceberg Interoperability, and Performance Gains

MotherDuck now supports DuckDB 1.4.1 and DuckLake 0.3, with new SQL syntax, faster sorting, Iceberg interoperability, and more. Read on for the highlights from these major releases.

Why Python Developers Need DuckDB (And Not Just Another DataFrame Library)

2025/10/08 - Mehdi Ouazza

Why Python Developers Need DuckDB (And Not Just Another DataFrame Library)

Understand why a database is much more than just a dataframe library

DuckDB Ecosystem: October 2025

2025/10/07 - Simon Späti

DuckDB Ecosystem: October 2025

DuckDB news: v1.4.0 LTS brings AES-256 encryption, MERGE statements, and Iceberg writes. 100x faster than Spark on local Parquet. Official Docker images released.

PAINLESS GEOSPATIAL ANALYTICS USING MOTHERDUCK’S NATIVE INTEGRATION WITH GALILEO.WORLD

2025/09/09 - Patrick Garcia

PAINLESS GEOSPATIAL ANALYTICS USING MOTHERDUCK’S NATIVE INTEGRATION WITH GALILEO.WORLD

Discover how Galileo.world is revolutionizing geospatial analysis. Say goodbye to slow performance and complex setups. Analyze and visualize big data right in your browser.

DuckDB Ecosystem: September 2025

2025/09/09 - Simon Späti

DuckDB Ecosystem: September 2025

DuckDB news: Spatial joins 58x faster via R-tree indexing. pg_duckdb 1.0 adds OLAP analytics to PostgreSQL. One team cut Snowflake costs 79% using DuckDB caching.

Announcing Pg_duckdb Version 1.0

2025/09/03 - Jelte Fennema-Nio, Jacob Matson

Announcing Pg_duckdb Version 1.0

PostgreSQL gets a DuckDB-flavored power-up for faster analytical queries without ever leaving Postgres.

DuckDB × cognee: Run SQL Analytics Right Beside Your Graph-Native RAG

2025/08/29 - Vasilije Markovic

DuckDB × cognee: Run SQL Analytics Right Beside Your Graph-Native RAG

SQL analytics and graph-native retrieval together, eliminating the trade-off between fast analytics and one-off RAG retrievals.

Nine Keyboard Shortcuts for SQL Flow State

2025/08/22 - Jacob Matson

Nine Keyboard Shortcuts for SQL Flow State

Tired of clicking? Master 9 essential SQL keyboard shortcuts to achieve a true flow state and make your data analysis faster and more joyful. Learn to run queries, comment, format, and even use AI without leaving your keyboard.

Why Semantic Layers Matter — and How to Build One with DuckDB

2025/08/19 - Simon Späti

Why Semantic Layers Matter — and How to Build One with DuckDB

Learn what a semantic layer is, why it matters, and how to build a simple one with DuckDB and Ibis using just YAML and Python

When Spark Meets DuckLake: Tooling You Know, Simplicity You Need

2025/08/11 - Mehdi Ouazza

When Spark Meets DuckLake: Tooling You Know, Simplicity You Need

Learn how to combine Apache Spark’s scale with DuckLake’s simplicity to build a lakehouse with ACID, time travel, and schema evolution

DuckDB Ecosystem: August 2025

2025/08/07 - Simon Späti

DuckDB Ecosystem: August 2025

DuckDB news: 50.7% YoY developer growth. DuckLake v0.2 adds credential secrets. BigQuery extension hits 21.7k weekly downloads. Vector search enables RAG applications.

Real-Time MySQL to MotherDuck Streaming with Streamkap: A Shift Left Architecture Guide

2025/08/07 - Oli Dinov

Real-Time MySQL to MotherDuck Streaming with Streamkap: A Shift Left Architecture Guide

Build real-time MySQL to MotherDuck pipelines with Streamkap. Learn Shift Left architecture, streaming CDC, and how to replace batch ETL for instant analytics and customer dashboards.

Just Enough SQL to be Dangerous with AI

2025/08/04 - Jacob Matson, Alex Monahan

Just Enough SQL to be Dangerous with AI

Learn essential SQL to verify AI-generated queries. Master SELECT, JOIN, and CTEs to safely analyze data with LLMs. Includes DuckDB examples and safety tips

MotherDuck's Latest AI Features: Smarter SQL Error Fixes and Natural Language Editing

2025/07/25 - Hamilton Ulmer, Jacob Matson

MotherDuck's Latest AI Features: Smarter SQL Error Fixes and Natural Language Editing

Stay in flow with MotherDuck's latest features. Real-time SQL feedback and natural language editing.

Summer Data Engineering Roadmap

2025/07/21 - Simon Späti

Summer Data Engineering Roadmap

A comprehensive 3-week structured roadmap for learning data engineering fundamentals, from SQL and Git basics to advanced topics like streaming, data quality, and DevOps.RetryClaude can make mistakes. Please double-check responses.

Introducing Mega and Giga Ducklings: Scaling Up, Way Up

2025/07/17 - Ryan Boyd

Introducing Mega and Giga Ducklings: Scaling Up, Way Up

New MotherDuck instance sizes allow data warehousing users more flexibility for complex queries and transformations. Need more compute to scale up? Megas and Gigas will help!

Small Data SF Returns November 4-5, 2025: First Speakers Announced

2025/07/17 - Ryan Boyd

Small Data SF Returns November 4-5, 2025: First Speakers Announced

Conference with two days of practical innovation on data and AI: workshops and talks from industry leaders, including Benn Stancil, Joe Reis, Adi Polak, George Fraser, Jordan Tigani, Holden Karau, Ravin Kumar, Sam Alexander and more!

Teaching Your LLM About DuckDB the Right Way: How to Fix Outdated Documentation

2025/07/15 - Mehdi Ouazza

Teaching Your LLM About DuckDB the Right Way: How to Fix Outdated Documentation

Learn how to keep LLMs updated with llms.txt and Cursor's docs feature.

This Month in the DuckDB Ecosystem: July 2025

2025/07/08 - Simon Späti

This Month in the DuckDB Ecosystem: July 2025

DuckDB news: Tributary streams Kafka data to SQL queries. SQLRooms enables browser analytics via DuckDB-WASM. pg_duckdb benchmarks show 1,500x TPC-DS speedup.

The Data Engineer Toolkit: Infrastructure, DevOps, and Beyond

2025/07/03 - Simon Späti

The Data Engineer Toolkit: Infrastructure, DevOps, and Beyond

Data engineers increasingly own DevOps. This guide breaks down the best DevOps and CI/CD tools, Kubernetes practices, data engineering platforms, workflow orchestration, and infrastructure strategies used in modern data stacks.

MotherDuck Managed DuckLakes Now in Preview: Scale to Petabytes

2025/07/01 - Ryan Boyd

MotherDuck Managed DuckLakes Now in Preview: Scale to Petabytes

Preview support of MotherDuck includes both fully-managed DuckLake support and ability to bring your own bucket. Combined with MotherDuck's storage, you get both high-speed access to recent data and support for massive scale historical data.

I Made Cursor + AI Write Perfect SQL. Here's the Exact Setup

2025/06/27 - Jacob Matson

I Made Cursor + AI Write Perfect SQL. Here's the Exact Setup

Stop debugging AI-generated SQL queries. Learn the exact Cursor + MotherDuck setup that makes AI write working SQL on the first try, with step-by-step instructions.

PostgreSQL and Ducks: The Perfect Analytical Pairing

2025/06/16 - Jacob Matson, Aditya Somani

PostgreSQL and Ducks: The Perfect Analytical Pairing

Learn how to integrate PostgreSQL with DuckDB and MotherDuck for faster analytics. Compare DuckDB Postgres Extension, pg_duckdb, and CDC approaches with practical examples and best practices for each method.

Getting Started with DuckLake: A New Table Format for Your Lakehouse

2025/06/09 - Mehdi Ouazza

Getting Started with DuckLake: A New Table Format for Your Lakehouse

Learn how DuckLake simplifies metadata and brings fast, database-like features to your data lakehouse — with a hands-on example using DuckDB and PostgreSQL

DuckDB Ecosystem: June 2025

2025/06/06 - Simon Späti

DuckDB Ecosystem: June 2025

DuckDB news: DuckLake combines catalog and table format with ACID metadata in SQL. Radio extension adds WebSocket and Redis Pub/Sub. Top CSV benchmark results.

DuckDB 1.3 Lands in MotherDuck: Performance Boosts, Even Faster Parquet, and Smarter SQL

2025/06/01 - Sheila Sitaram

DuckDB 1.3 Lands in MotherDuck: Performance Boosts, Even Faster Parquet, and Smarter SQL

DuckDB 1.3 has launched, with performance boosts, faster Parquet reads and writes, and new SQL syntax for ducking awesome analytics with full support in MotherDuck. Read on for highlights from this major release.

From BigQuery to DuckDB and MotherDuck : Efficient Local and Cloud Data Pipelines

2025/05/30 - Mehdi Ouazza

From BigQuery to DuckDB and MotherDuck : Efficient Local and Cloud Data Pipelines

Learn how to query load table from BigQuery to DuckDB and MotherDuck using SQL and Python!

A Duck Walks into a Lake

2025/05/28 - Jordan Tigani

A Duck Walks into a Lake

DuckDB introduces a new table format, what does it mean for the future of data lakes ?

Breaking the Excel-SQL Barrier: Leveraging DuckDB's Excel Extension

2025/05/27 - Jacob Matson

Breaking the Excel-SQL Barrier: Leveraging DuckDB's Excel Extension

Now in MotherDuck & DuckDB, its never been easier to join in your data from spreadsheet sources.

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

2025/05/23 - Simon Späti

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

Learn how DuckDB and open table formats like Iceberg power a fast, composable analytics stack on affordable cloud storage

Taming Wild CSVs: Advanced DuckDB Techniques for Data Engineers

2025/05/17 - Ryan Boyd

Taming Wild CSVs: Advanced DuckDB Techniques for Data Engineers

How to ingest and query CSV files in DuckDB using auto-detection, sniffing, manual configuration and more.

DuckDB Ecosystem: May 2025

2025/05/08 - Simon Späti

DuckDB Ecosystem: May 2025

DuckDB news: Metabase driver queries Parquet files directly. FlockMTL integrates LLMs into SQL workflows. Doom clone runs in DuckDB-WASM. Spatial wins top honors.

MotherDuck lands on Tableau Cloud: Live, Fast Analytics Unleashed

2025/05/06 - Jacob Matson

MotherDuck lands on Tableau Cloud: Live, Fast Analytics Unleashed

Use MotherDuck to power your Tableau Cloud, Server, and Desktop dashboards.

Instant SQL is here: Speedrun ad-hoc queries as you type

2025/04/23 - Hamilton Ulmer

Instant SQL is here: Speedrun ad-hoc queries as you type

Type, see, tweak, repeat! Instant SQL is now in Preview in MotherDuck and the DuckDB Local UI. Bend reality with SQL superpowers to get real-time query results as you type.

The Data Engineer's Guide to Efficient Log Parsing with DuckDB/MotherDuck

2025/04/18 - Simon Späti

The Data Engineer's Guide to Efficient Log Parsing with DuckDB/MotherDuck

How to Query JSON and Log Files with SQL Using DuckDB and MotherDuck

Streaming in the Fast Lane: Oracle CDC to MotherDuck Using Estuary

2025/04/17 - Emily Lucek

Streaming in the Fast Lane: Oracle CDC to MotherDuck Using Estuary

Ducks and estuaries go together. So it’s no surprise that MotherDuck, a cloud data warehouse, pairs well with Estuary, a data pipeline platform.

MCP + DuckDB: Connect AI Assistants to Your Data Pipelines

2025/04/15 - Mehdi Ouazza

MCP + DuckDB: Connect AI Assistants to Your Data Pipelines

Use the Model Context Protocol (MCP) to connect Claude, Cursor, or other AI tools directly to DuckDB. Query data, generate SQL, and automate pipelines—without copy-pasting

DuckDB Ecosystem: April 2025

2025/04/05 - Simon Späti

DuckDB Ecosystem: April 2025

DuckDB news: Streaming support with new remote file caching. Community extensions expand real-time analytics. Event-driven processing patterns for data pipelines.

Simplifying IoT Analytics with MotherDuck

2025/04/03 - Faraz Hameed

Simplifying IoT Analytics with MotherDuck

Exploring the sweet spot between simplicity and capability in data systems, one IoT hackathon at a time.

Prompting? That’s so 2024. Welcome to Quack-to-SQL.

2025/04/01 - MotherDuck team

Prompting? That’s so 2024. Welcome to Quack-to-SQL.

Quack to SQL — our first AI model that understands duck sounds and translates them into queries.

Vector Technologies for AI: Extending Your Existing Data Stack

2025/03/28 - Simon Späti

Vector Technologies for AI: Extending Your Existing Data Stack

Understand when to use a vector database and how it differs from vector search engines.

DuckDB Ecosystem: March 2025

2025/03/07 - Simon Späti

DuckDB Ecosystem: March 2025

DuckDB news: v1.2 adds Google Sheets extension for SQL on spreadsheets. Duckberg queries Iceberg tables via Python. Smallpond sorts 110TB in 30 min using Ray.

A Beginner’s Guide to Geospatial with DuckDB Spatial and MotherDuck

2025/02/26 - Simon Späti

A Beginner’s Guide to Geospatial with DuckDB Spatial and MotherDuck

Unlock the power of geospatial analysis with DuckDB Spatial and MotherDuck, making location-based data processing faster, simpler, and more accessible for data engineers.

Effortless ETL for Unstructured Data with MotherDuck and Unstructured.io

2025/02/20 - Adithya Krishnan

Effortless ETL for Unstructured Data with MotherDuck and Unstructured.io

In this tutorial, learn how to load unstructured data into MotherDuck with Unstructured.io to build modern data pipelines and business applications that turn unstructured data intro structured data.

Faster health data analysis with MotherDuck & Preswald

2025/02/14 - Amrutha Gujjar

Faster health data analysis with MotherDuck & Preswald

Faster health data analysis with MotherDuck & Preswald

MotherDuck for Business Analytics: GDPR, SOC 2 Type II, Tiered Support, and New Plan Offerings

2025/02/11 - Sheila Sitaram

MotherDuck for Business Analytics: GDPR, SOC 2 Type II, Tiered Support, and New Plan Offerings

Introducing new features designed to better support businesses looking for their first data warehouse, including SOC 2 Type II and GDPR compliance, tiered support, read scaling, and a new Business Plan.

DuckDB Ecosystem: February 2025

2025/02/09 - Simon Späti

DuckDB Ecosystem: February 2025

DuckDB news: DuckCon #6 runs TPC-H SF300 on Raspberry Pi. SQL/PGQ graph queries 10-100x faster than Neo4j. Arrow Flight enables concurrent read/write access.

MotherDuck Now Supports DuckDB 1.2: Faster, Friendlier, Better Performance

2025/02/05 - Sheila Sitaram

MotherDuck Now Supports DuckDB 1.2: Faster, Friendlier, Better Performance

DuckDB 1.2 has launched, with improvements in performance, the SQL experience, CSV handling, and scalability - all fully supported in MotherDuck!

Why CSV Files Won’t Die and How DuckDB Conquers Them

2025/02/04 - Mehdi Ouazza

Why CSV Files Won’t Die and How DuckDB Conquers Them

Learn how you can pragmatically use DuckDB to parse any CSVs

The Data Engineering Toolkit: Essential Tools for Your Machine

2025/01/22 - Simon Späti

The Data Engineering Toolkit: Essential Tools for Your Machine

Master the essential data engineering toolkit—Linux commands, Docker, Python, SQL, and developer tools. A practical guide to the tools every DE needs.

Local dev and cloud prod for faster dbt development

2025/01/16 - Jacob Matson

Local dev and cloud prod for faster dbt development

Spark the Joy of beautiful local development workflows with MotherDuck & dbt

DuckDB Ecosystem: January 2025

2025/01/10 - Simon Späti

DuckDB Ecosystem: January 2025

DuckDB news: PyIceberg enables local Iceberg catalogs in Python. Zero-egress data sharing via Cloudflare R2. SQLFlow streams Kafka and Bluesky data with DuckDB SQL.

What’s New: Streamlined User Management, Metadata, and UI Enhancements

2024/12/21 - Sheila Sitaram

What’s New: Streamlined User Management, Metadata, and UI Enhancements

December’s feature roundup is focused on improving the user experience on multiple fronts. Introducing the User Management REST API, the Table Summary, and a read-only MD_INFORMATION_SCHEMA for metadata.

Why web developers should care about analytical databases

2024/12/18 - Mehdi Ouazza

Why web developers should care about analytical databases

Learn how analytical database can help your web app, with MotherDuck and the native integration with Vercel

Separating Storage and Compute in DuckDB

2024/12/17 - Jordan Tigani, Boaz Leskes

Separating Storage and Compute in DuckDB

Why separate storage and compute in DuckDB, how we do it in MotherDuck to enable sharing and future work.

Visualizing text embeddings using MotherDuck and marimo

2024/12/11 - Myles Scolnick

Visualizing text embeddings using MotherDuck and marimo

Visualizing text embeddings using MotherDuck and marimo

This Month in the DuckDB Ecosystem: December 2024

2024/12/11

This Month in the DuckDB Ecosystem: December 2024

DuckDB news: Query Bluesky social data via SQL. LLMs clean CRM data inside queries. Google Sheets extension launches. DuckDB-WASM powers sql-workbench.com browser IDE.

The Serverless Backend for Analytics: Introducing MotherDuck’s Native Integration on Vercel Marketplace

2024/12/09 - Sheila Sitaram

The Serverless Backend for Analytics: Introducing MotherDuck’s Native Integration on Vercel Marketplace

MotherDuck's native integration is now available on Vercel Marketplace. Developers can finally streamline their application maintenance overhead when building embedded analytics components and data apps. Start building with templates and a demo app!

Introducing Read Scaling

2024/12/04 - Jordan Tigani

Introducing Read Scaling

Read Scaling is now in preview! Read Scaling improves DuckDB SQL query performance by scaling out to multiple DuckDB instances, known as Read Scaling replicas. It is useful to speed up BI dashboards and data apps significantly.

Non-Profits <3 Small Data’s ROI

2024/12/03 - Sahil Gupta

Non-Profits <3 Small Data’s ROI

Learn how DoSomething, the premier platform fueling young people to change the world and actively shape the future of their communities, decided to adopt MotherDuck as their analytics data warehouse for efficient, high ROI analytics without the overhead.

Improved Control and Ergonomics on MotherDuck

2024/11/25 - Sheila Sitaram

Improved Control and Ergonomics on MotherDuck

November's Feature Roundup focuses on efficient query control and ergonomics. Now that's something to flap your wings about! Read on for updates on query management, in-memory performance, and connection management.

From Data Lake to Lakehouse: Can DuckDB be the best portable data catalog?

2024/11/14 - Mehdi Ouazza

From Data Lake to Lakehouse: Can DuckDB be the best portable data catalog?

Discover how catalog became crucial for Lakehouse and how DuckDB can help as a catalog

Quacking at the Edge: DuckDB on Raspberry Pi

2024/11/07 - Jordan Tigani

Quacking at the Edge: DuckDB on Raspberry Pi

Using MotherDuck’s Dual Query execution on a Raspberry Pi to play a quack sound when users sign up for our service.

This Month in the DuckDB Ecosystem: November 2024

2024/11/04 - Simon Späti

This Month in the DuckDB Ecosystem: November 2024

DuckDB news: HTTP extension queries REST APIs in SQL. Unity Catalog integration via dbt. Pivot tables extension. Drug database demo processes 6M records per minute.

The Data Warehouse powered by DuckDB SQL

2024/11/01 - Jacob Matson

The Data Warehouse powered by DuckDB SQL

Learn how DuckDB and MotherDuck transform data into business insights. DuckDB’s fast SQL processing meets MotherDuck’s cloud integration, creating a flexible, powerful data warehouse solution to solve complex business challenges and drive impact.

DuckDB Tutorial For Beginners

2024/10/31 - Mehdi Ouazza, Ryan Boyd

DuckDB Tutorial For Beginners

Learn DuckDB from scratch: install in 2 minutes, set up VSCode, and build your first SQL analytics project. No database experience required.

You asked, We Listened: Sharing, UI and Performance Improvements

2024/10/22 - Doug Raymond

You asked, We Listened: Sharing, UI and Performance Improvements

Recently-launched features in the MotherDuck data warehouse: preview result cell contents UI, dual execution performance improvements, auto update of data shared within your organization (or globally!)

Small Data is bigger (and hotter 🔥) than ever

2024/10/19 - Sheila Sitaram

Small Data is bigger (and hotter 🔥) than ever

Catch up on the latest developments around simple, scalable workflows for Real data volumes from the first Small Data SF!

Union and MotherDuck's Orchestrated Approach to Advanced Analytics

2024/10/18 - Daniel Sola

Union and MotherDuck's Orchestrated Approach to Advanced Analytics

Leverage MotherDuck & Union to orchestrate advanced analytics flows, AI & LLMs

Introducing the prompt() Function: Use the Power of LLMs with SQL!

2024/10/17 - Till Döhmen

Introducing the prompt() Function: Use the Power of LLMs with SQL!

We make your database smarter with small language model (and LLM) support in SQL

The Enterprise Case for DuckDB: 5 Key Categories and Why to Use it

2024/10/16 - Simon Späti

The Enterprise Case for DuckDB: 5 Key Categories and Why to Use it

Let's take a closer look to understand the various Enterprise use cases of DuckDB and how they can help on your data and analytics journey. 

Performant dbt pipelines with MotherDuck

2024/10/07 - Jacob Matson

Performant dbt pipelines with MotherDuck

Learn how to take your dbt pipelines to new heights with MotherDuck. This blog walks through a recap of our recent dbt + MotherDuck workshop from Small Data SF. Happy building!

This Month in the DuckDB Ecosystem: October 2024

2024/10/04 - Simon Späti

This Month in the DuckDB Ecosystem: October 2024

DuckDB news: v1.1 hits 6M monthly Python downloads. Spark API compatibility layer added. Build RAG apps with GPT-4o embeddings. Extensions reach 17M monthly downloads.

MotherDuck at Coalesce 2024: Your Ultimate Guide to Quack-tastic Fun!

2024/10/01 - Jerel Navarrete

MotherDuck at Coalesce 2024: Your Ultimate Guide to Quack-tastic Fun!

Get ready to make a splash at Coalesce 2024! 🦆 MotherDuck is bringing the fun to Las Vegas, and we can't wait to see you there. Whether you're a seasoned Coalesce pro or a first-timer, we've got everything you need to make this year's event unforgettable.

5 Hidden gems in DuckDB 1.1

2024/09/27 - Mehdi Ouazza

5 Hidden gems in DuckDB 1.1

Discover some underrated features from DuckDB 1.1

Getting started with modern GIS using DuckDB

2024/09/18 - Mehdi Ouazza

Getting started with modern GIS using DuckDB

Learn how DuckDB can help you started with Geospatial analysis

Generating a data app with your MotherDuck data

2024/09/06 - Till Döhmen

Generating a data app with your MotherDuck data

How to generate a web app dashboard based on your data

Swimming in Google Sheets with MotherDuck

2024/09/04 - Jacob Matson

Swimming in Google Sheets with MotherDuck

Learn how to use DuckDB's read_csv functionality to easily load data from Google Sheets into MotherDuck for Analysis!

This Month in the DuckDB Ecosystem: September 2024

2024/09/03 - Mehdi Ouazza

This Month in the DuckDB Ecosystem: September 2024

DuckDB news: pg_duckdb brings analytical queries to PostgreSQL. Ibis makes DuckDB its default backend, dropping Pandas. Getting Started with DuckDB book released.

Small Data SF: The Agenda is now live…with *NEW* hands-on workshops

2024/08/29 - Sheila Sitaram

Small Data SF: The Agenda is now live…with *NEW* hands-on workshops

We had such an awesome response to Small Data SF after launch: It was so great that we decided to add an additional day of hands-on workshops! Learn more about the full lineup on 9/23 - 9/24 and grab a ticket before it's too late.

Splicing Duck and Elephant DNA

2024/08/15 - Jordan Tigani, Brett Griffin

Splicing Duck and Elephant DNA

Introducing the DuckDB + Postgres Extension: You can have your analytics and transact them too with pg_duckdb by DuckDB Labs, MotherDuck, Hydra, Neon and Microsoft.

Introducing the embedding() function: Semantic search made easy with SQL!

2024/08/14 - Till Döhmen

Introducing the embedding() function: Semantic search made easy with SQL!

Doing RAG for LLMs or making semantic search results pop? MotherDuck and DuckDB make it easy!

Redshift Files: The Hunt for Big Data

2024/08/07 - Jordan Tigani

Redshift Files: The Hunt for Big Data

Jordan Tigani revisits his popular Big Data is Dead blog post with analysis of the data from the Redshift TPC is Not Enough paper.

This Month in the DuckDB Ecosystem: August 2024

2024/08/01 - Mehdi Ouazza

This Month in the DuckDB Ecosystem: August 2024

DuckDB news: Ranked #3 most desired database (StackOverflow 2024). Community extensions launch with registry. Delta Lake support via kernel. Memory management deep dive.

Secoda x MotherDuck: The newest member of the Modern Duck Stack 🦆

2024/07/19 - Andrew McEwen

Secoda x MotherDuck: The newest member of the Modern Duck Stack 🦆

The MotherDuck x Secoda integration allows you to enable data producers and consumers, regardless of technical ability, to easily locate and access the data they need! Learn how to enable the integration in two easy steps.

The Small Data Manifesto

2024/07/18 - Sheila Sitaram, MotherDuck team

The Small Data Manifesto

Read through the key principles and ethos behind the Small Data movement. Small data and AI is more powerful than you think. Data and AI that was once "Big" can now be handled by a single machine.

Pushing the Boundaries of Geo Data with MotherDuck and Geobase!

2024/07/03 - Saqib Rasul

Pushing the Boundaries of Geo Data with MotherDuck and Geobase!

Learn how to integrate MotherDuck and Geobase to visualize and build applications that have never been possible before using spatial-temporal data.

Reflections on SIGMOD/PODS 2024: Insights and Highlights

2024/07/02 - Stephanie Wang, Till Döhmen

Reflections on SIGMOD/PODS 2024: Insights and Highlights

MotherDuck Founding Engineer Stephanie Wang and AI/ML Lead Till Döhmen recap their highlights and key takeaways from SIGMOD PODS 2024 in Santiago, Chile. Learn more about emerging trends in Text2SQL, Hybrid Resource Allocation, Data Discovery, and more.

This Month in the DuckDB Ecosystem: July 2024

2024/07/01

This Month in the DuckDB Ecosystem: July 2024

DuckDB news: Native Delta Lake joins Iceberg for lakehouse support. DuckDB-WASM embeds full analytics in browsers. Vector search and embeddings tutorials. 12GB test data in 12s.

Search in DuckDB: Integrating Full Text and Embedding Methods

2024/06/20 - Adithya Krishnan

Search in DuckDB: Integrating Full Text and Embedding Methods

Explore search methods with DuckDB using Full-Text-Search and embeddings in a hybrid search engine fully accessible using SQL

This Month in the DuckDB Ecosystem: June 2024

2024/06/08 - Ryan Boyd

This Month in the DuckDB Ecosystem: June 2024

DuckDB news: v1.0.0 Snow Duck is production-ready. Query 150,000+ Hugging Face datasets directly. Crunchy Bridge adds DuckDB analytics to PostgreSQL. RAG implementations.

Congratulations to DuckDB Labs On Reaching 1.0!

2024/06/03 - MotherDuck team

Congratulations to DuckDB Labs On Reaching 1.0!

MotherDuck congratulates DuckDB Labs on their milestone, landmark 1.0 release. Learn more about its significance and what it means for MotherDuck! And stay tuned for some exciting news heading your way soon...

How we Saved 95% on Log Processing with Bacalhau and MotherDuck

2024/05/08 - Sean M. Tracey

How we Saved 95% on Log Processing with Bacalhau and MotherDuck

We stopped sifting our log data and started generating speedy logging insights to realize 95% in cost savings by pre-processing logs with Bacalhau and MotherDuck. How is that even possible? Let's walk through a step-by-step overview together.

Developing a RAG Knowledge Base with DuckDB

2024/05/06 - Adithya Krishnan

Developing a RAG Knowledge Base with DuckDB

Using DuckDB as the underlying storage for an AI-powered knowledge base, walk through a step-by-step tutorial using LlamaIndex, a data framework for LLMs, and Ollama, a simple API for creating, running, and managing models.

This Month in the DuckDB Ecosystem: April 2024

2024/04/30 - Luciano Galvão Filho

This Month in the DuckDB Ecosystem: April 2024

DuckDB news: Improved malformed CSV parsing. duckplyr brings DuckDB speed to R workflows. Parse 1 billion rows in Python efficiently. Geospatial raster via spatial extension.

Structured memory management for AI Applications and AI Agents with DuckDB

2024/04/29 - Vasilije Markovic

Structured memory management for AI Applications and AI Agents with DuckDB

Learn how to optimize Retrieval-Augmented Generation (RAG) systems with DuckDB, dlt, and Cognee to streamline data management and workflows for accurate LLM outputs.

Build sub-second data applications with MotherDuck’s Wasm SDK

2024/04/24 - Tino Tereshko

Build sub-second data applications with MotherDuck’s Wasm SDK

Learn how to use the MotherDuck WebAssembly (Wasm) SDK to solve the longstanding challenges of building and maintaining efficient, highly performant data-driven components and analytics applications.

Building Vector Search in DuckDB

2024/04/19 - Adithya Krishnan

Building Vector Search in DuckDB

Discover the power of AI search by using vector embeddings in natural language processing in the first blog in our informative three-part series! We'll cover the basics of vector embeddings and cosine similarity using DuckDB and MotherDuck.

This Month in the DuckDB Ecosystem: March 2024

2024/03/28 - Mehdi Ouazza

This Month in the DuckDB Ecosystem: March 2024

DuckDB news: Federated queries join PostgreSQL and blockchain data. PuppyGraph adds graph modeling on MotherDuck. End-to-end dbt pipelines. Co-creator Hannes interview.

How PuppyGraph Enables a Graph Model On MotherDuck Without a Graph Database

2024/03/26 - Zhenni Wu

How PuppyGraph Enables a Graph Model On MotherDuck Without a Graph Database

Discover how integrating DuckDB and PuppyGraph enables the incorporation of graph querying into your existing data warehouse!

DuckDB & dbt | End-To-End Data Engineering Project (2/3)

2024/03/22 - Mehdi Ouazza

DuckDB & dbt | End-To-End Data Engineering Project (2/3)

Explore the essentials of vector embeddings and cosine similarity in our latest blog series on DuckDB. Learn how to enhance your data analysis by effectively utilizing DuckDB and MotherDuck for text-related tasks in e-commerce and beyond.

Differential Storage: A Key Building Block For A DuckDB-Based Data Warehouse

2024/03/11 - Joseph Hwang

Differential Storage: A Key Building Block For A DuckDB-Based Data Warehouse

Differential Storage: A Key Building Block For A DuckDB-Based Data Warehouse

This Month in the DuckDB Ecosystem: February 2024

2024/03/01 - Ryan Boyd

This Month in the DuckDB Ecosystem: February 2024

DuckDB news: v0.10.0 adds backwards-compatible storage and faster CSV parsing. PyAirbyte uses DuckDB as default cache. DuckDB-NSQL-7B LLM generates SQL locally.

Why You Should Learn SQL in 2024

2024/01/31 - David Neal

Why You Should Learn SQL in 2024

SQL is an accessible, ubiquitous, and valuable language you can learn in 2024. It’s a marketable skill that practically every organization needs.

This Month in the DuckDB Ecosystem: January 2024

2024/01/30 - Ryan Boyd

This Month in the DuckDB Ecosystem: January 2024

DuckDB news: Query Google Sheets with one line of SQL. Join across PostgreSQL, SQLite, and MySQL databases. ERPL extension connects SAP data. Harlequin terminal IDE launches.

Streaming Data To MotherDuck With Estuary

2024/01/24 - Mehdi Ouazza

Streaming Data To MotherDuck With Estuary

Using CDC pipelines to stream data to MotherDuck

Perf is not enough

2024/01/18 - Jordan Tigani

Perf is not enough

Are database benchmarks still relevant ? Let's understand why it's a poor way to choose a database.

Just Released: Hybrid Query Processing Paper at CIDR 2024

2024/01/16 - Peter Boncz

Just Released: Hybrid Query Processing Paper at CIDR 2024

MotherDuck released its paper on Hybrid Query Processing at the Conference on Innovative Data (systems) Research [CIDR].

Analyze JSON Data Using SQL and DuckDB

2024/01/10 - David Neal

Analyze JSON Data Using SQL and DuckDB

Learn to read, parse, and query JSON data from files and APIs using SQL and DuckDB!

Introducing FixIt: an unreasonably effective AI error fixer for SQL

2024/01/03 - Till Döhmen, Hamilton Ulmer

Introducing FixIt: an unreasonably effective AI error fixer for SQL

FixIt will correct mistakes in your SQL queries based on the schema and DuckDB syntax. Based on a large language model (LLM).

This Month in the DuckDB Ecosystem: December 2023

2023/12/28 - Marcos Ortiz

This Month in the DuckDB Ecosystem: December 2023

DuckDB news: WASM extensions load directly in browsers. Query 32GB anti-money laundering datasets locally. BI-as-code tools Rill and Evidence gain traction. DuckCon #4 announced.

The Future of BI: Exploring the Impact of BI-as-Code Tools with DuckDB

2023/12/07 - Mehdi Ouazza

The Future of BI: Exploring the Impact of BI-as-Code Tools with DuckDB

The Future of BI: Exploring the Impact of BI-as-Code Tools with DuckDB

MotherDuck's HQ Nest is Ready for the Flock

2023/12/05 - Leslie Thomas

MotherDuck's HQ Nest is Ready for the Flock

MotherDuck's Seattle office opened as one of four company hubs, which also includes San Francisco, NYC and Amsterdam

Announcing: DuckDB code snippet sets with MotherDuck Sharing

2023/11/28 - Ryan Boyd

Announcing: DuckDB code snippet sets with MotherDuck Sharing

The DuckDB Snippets site has been upgrade to include sharing sets of code snippets in SQL and Python plus also sharing data with MotherDuck share URLs.

This Month in the DuckDB Ecosystem: November 2023

2023/11/22 - Marcos Ortiz

This Month in the DuckDB Ecosystem: November 2023

DuckDB news: Query 148TB of Hugging Face image data remotely. Awesome DuckDB resource collection launches. Benchmarks vs Spark, Dask, and Polars frameworks.

Analyze Your X (Twitter) Data with Node.js and DuckDB

2023/11/08 - David Neal

Analyze Your X (Twitter) Data with Node.js and DuckDB

Learn to use Node.js and DuckDB to query your X data!

Analyze Data in Azure with DuckDB or MotherDuck

2023/11/01 - David Neal

Analyze Data in Azure with DuckDB or MotherDuck

Analyze data stored in Azure blob storage using DuckDB or MotherDuck

This Month in the DuckDB Ecosystem: October 2023

2023/10/30 - Marcos Ortiz

This Month in the DuckDB Ecosystem: October 2023

DuckDB news: Enhanced CSV reader with dialect detection. sqlfmt formatter hits 1.5M downloads. Harlequin terminal IDE. Spatial data management course launches.

Exploring StackOverflow with DuckDB on MotherDuck (Part 2)

2023/10/02 - Michael Hunger

Exploring StackOverflow with DuckDB on MotherDuck (Part 2)

Exploring StackOverflow with DuckDB on MotherDuck (Part 2)

This Month in the DuckDB Ecosystem: September 2023

2023/09/30 - Marcos Ortiz

This Month in the DuckDB Ecosystem: September 2023

DuckDB news: v0.9 adds Azure storage and Iceberg support. MotherDuck raises Series B, opens to public. Harlequin terminal IDE launches. Vector similarity search.

This Month in the DuckDB Ecosystem: August 2023

2023/08/21 - Marcos Ortiz

This Month in the DuckDB Ecosystem: August 2023

DuckDB news: VS Code extension enables local and MotherDuck connections. Rill Data goes production. DuckDB Wasm Kit adds React hooks. Process 100s of GB with Coiled.

Beyond Storing Data: How to Use DuckDB, MotherDuck and Kestra for ETL

2023/08/18 - Anna Geller

Beyond Storing Data: How to Use DuckDB, MotherDuck and Kestra for ETL

Scheduled and event-driven data workflows with S3, MotherDuck, DuckDB and Kestra

Exploring StackOverflow with DuckDB on MotherDuck (Part 1)

2023/08/09 - Michael Hunger

Exploring StackOverflow with DuckDB on MotherDuck (Part 1)

Exploring StackOverflow with DuckDB on MotherDuck (Part 1)

This Month in the DuckDB Ecosystem: July 2023

2023/07/17 - Marcos Ortiz

This Month in the DuckDB Ecosystem: July 2023

DuckDB news: MotherDuck launches managed cloud service with hybrid execution. Apache Iceberg extension arrives. Vectorized Python UDFs added. DuckCon SF talks published.

Announcing MotherDuck: Hybrid Execution Scales DuckDB from your Laptop into the Cloud

2023/06/22 - MotherDuck team

Announcing MotherDuck: Hybrid Execution Scales DuckDB from your Laptop into the Cloud

Announcing MotherDuck: Hybrid Execution Scales DuckDB from your Laptop into the Cloud

This Month in the DuckDB Ecosystem: June 2023

2023/06/16 - Marcos Ortiz

This Month in the DuckDB Ecosystem: June 2023

DuckDB news: Query 50,000+ Hugging Face datasets directly via SQL. FalconVis cross-filters 10 million entries. Dagster integration tutorial. Spatial extension advances.

DuckDB vs Pandas vs Polars for Python Developers

2023/06/08 - Mehdi Ouazza

DuckDB vs Pandas vs Polars for Python Developers

A comparaison through a pragmatic analytic project of DuckDB, Polars and Pandas

This Month in the DuckDB Ecosystem: May 2023

2023/05/24 - Marcos Ortiz

This Month in the DuckDB Ecosystem: May 2023

DuckDB news: v0.8.0 brings Pivot/Unpivot and time series joins. Project hits 10,000 GitHub stars. Spatial extension and native Swift API launch.

The Simple Joys of Scaling Up

2023/05/11 - Jordan Tigani

The Simple Joys of Scaling Up

Explores why scale-out became so dominant, whether those rationales still hold, and some joyful advantages of scale-up architecture.

This Month in the DuckDB Ecosystem: April 2023

2023/04/17 - Marcos Ortiz

This Month in the DuckDB Ecosystem: April 2023

DuckDB news: Mode adopts DuckDB for visual data exploration. DataCamp Workspace adds SQL-first tool. LangChain Document Loader integration. dbt extension launches.

This Month in the DuckDB Ecosystem: March 2023

2023/03/23 - Marcos Ortiz

This Month in the DuckDB Ecosystem: March 2023

DuckDB news: JSON extension queries nested data as tables. Spatial analysis runs on AWS Lambda. JupySQL enables large dataset plotting. Streamlit integration.

This Month in the DuckDB Ecosystem: February 2023

2023/02/22 - Marcos Ortiz

This Month in the DuckDB Ecosystem: February 2023

DuckDB news: v0.7.0 adds JSON ingestion, partitioned Parquet export, and UPSERT support. Benchmarks show 4-200x faster than Postgres on AWS cost queries.

Solving Advent of Code with DuckDB and dbt

2023/02/09 - Graham Wetzler

Solving Advent of Code with DuckDB and dbt

Tackling 10 days of AOC with DuckDB and dbt-duckdb, a DuckDB adapter for dbt

Big Data is Dead

2023/02/07 - Jordan Tigani

Big Data is Dead

Big data is dead. Long live easy data.

Python Faker for DuckDB Fake Data Generation

2023/01/31 - Ryan Boyd

Python Faker for DuckDB Fake Data Generation

Using the Python Faker library to generate data for exploring DuckDB

How to analyze SQLite databases in DuckDB

2023/01/24 - Ryan Boyd

How to analyze SQLite databases in DuckDB

DuckDB is often referred to as the SQLite for analytics. This blog post talks about how to query SQLite transactional databases from within the DuckDB analytics database.

This Month in the DuckDB Ecosystem: January 2023

2023/01/12 - Marcos Ortiz

This Month in the DuckDB Ecosystem: January 2023

DuckDB news: Hits 1 million monthly PyPI downloads. Modern Data Stack in a Box runs analytics on single machine. CDC pipelines from PostgreSQL via Redpanda.

How We're Making Analytics Ducking Awesome

2023/01/02 - Ryan Boyd

How We're Making Analytics Ducking Awesome

MotherDuck on Podcasts, in the News and at conferences.

This Month in the DuckDB Ecosystem

2022/12/15 - Marcos Ortiz

This Month in the DuckDB Ecosystem

DuckDB news: Query 250GB Common Crawl data locally via HTTPFS. Polars and Arrow integration for Rust. Co-creator Mark Raasveldt featured. DuckCon Brussels announced.

MotherDuck Raises $47.5 Million to Make Analytics Fun, Frictionless and Ducking Awesome

2022/11/15 - MotherDuck team

MotherDuck Raises $47.5 Million to Make Analytics Fun, Frictionless and Ducking Awesome

MotherDuck is a new serverless data warehouse and backend for data apps based on DuckDB. MotherDuck provides SQL analytics at scale.

Why Use DuckDB for Analytics?

2022/11/11 - Tino Tereshko, Ryan Boyd

Why Use DuckDB for Analytics?

Fast aggregations, excellent SQL support, runs anywhere, provides simplified data access: cloud and local, works with your tools and frameworks.

Hello, World! Quack. Quack.

2022/11/08 - MotherDuck team

Hello, World! Quack. Quack.

MotherDuck is building a serverless SQL analytics platform to use as a data warehouse and backend to data apps. We believe that big data is dead and we should be focused on making data analysis easier with DuckDB.