What the duck is a Duckling? MotherDuck’s approach to scaling DuckDB

2025/04/16 - 5 min read

BY

In the very first days of MotherDuck as a company, back before the co-founders had even met in person to kick off the company in mid-2022, we realized we needed a name to call the DuckDB instances we were running on behalf of users in the cloud. The idea behind the name MotherDuck, in the first place, was that we were marshaling a flock of DuckDB instances. What does a mother duck manage? "Ducklings", of course. The name stuck, and MotherDuck's DuckDB instances became Ducklings.

How is a Duckling different from a standard Data Warehouse instance?

Most data warehouses are built as monoliths, where every user in the organization shares the same data warehouse compute resources. Unless this warehouse is over-provisioned (calling all admins with 3XL instances out there!), it often begins to crack under high concurrency. Many analysts know the pain of trying to run a query while someone else is running a giant report, and having their workload slow to a crawl.

legacy_data_warehouse.png

Per-user Tenancy for Internal Analytics / BI

MotherDuck’s approach with Ducklings is very different. Instead of all users sharing the same instance, each user gets their own Duckling which handles their workload and automatically shuts down if not being used.

motherduck_data_warehouse.png

And, of course, all users are accessing a consistent view of the data warehouse shared either throughout the organization or with individual users in the org.

Vertical Scaling: Configurable per-user

Is your CEO complaining that they need more compute? Each Duckling can be scaled up or down to meet the needs of the user.

MotherDuck has three Duckling sizes: Pulse, Standard and Jumbo.

duckling_sizes.png

Author’s note: We have a multi-terabyte data warehouse at MotherDuck and our CEO, Jordan, is able to use the smallest Ducklings, called Pulses to understand what is going on in the business every day

duckling_size_example.png

Horizontal Read Scaling: Configurable per-user

Sometimes the data warehouse doesn't know the identity of the end users. For example, BI tools typically share a single database connection but then may have dozens of users running queries at the same time. This would ordinarily break the "one-user-per-duckling" pattern.

MotherDuck’s read scaling is designed for these types of cases – providing an extra boost in compute through horizontal scaling and maintaining the pattern of “one-user-per-duckling!”

duckling_read_scaling.png

Duckling-powered Customer-facing Analytics

Customer-facing analytics use cases have different requirements than an analytics stack built to power your internal data teams. It often starts with a simple customer ask – eg “I want to see a dashboard of revenue trends” – which engineering implements on top of the transactional database (like Postgres). Eventually, with more customer demands and growth, your transactional database is on fire. You’re spending all day experimenting with different indexes or blocked by an eng team that owns database configuration and you’re searching for an analytics solution.

traditional_app.png

MotherDuck’s per-user tenancy model is especially powerful for these types of applications. Each customer can have their own Duckling(s) with isolated data, mitigating many types of security concerns with multitenant databases. Since each user has their own Duckling(s), you can rid yourself of scale anxiety and know that MotherDuck will always be ready to handle new customers as fast as your sales team can sign deals.

data_app_scaling_motherduck.png

As we saw with the internal data analytics use case, you can configure the Duckling size per customer, enabling you to offer higher levels of service and scale to your most important customers.

Scaling from your Duckling back to the Laptop

Historically, laptops were extremely under-powered and you needed to scale to the cloud to get fast compute resources. With laptops now being as powerful as supercomputers of yesteryear, we still scale to the cloud for 24x7 availability, sharing/collaboration, and centralized data management, but the powerful chips on our laps are underutilized.

With MotherDuck, you can scale your workloads back to your laptop to take advantage of local compute power and zero-latency in combination with the power offered by your cloud-based Duckling. This happens automatically in the MotherDuck UI to enable the quick aggregation and filtering of data in the Column Explorer. The MotherDuck SQL query planner automatically decides whether to bring the compute to the data or the data to the compute. We call this Dual Execution and we wrote a CIDR paper on this technology (formerly called hybrid query execution).

As you build your own applications, you can decide whether to take advantage of client-side compute and zero latency queries using Dual Execution, or have all the compute happen on MotherDuck’s servers.

Go launch your flock of Ducklings

MotherDuck makes it easy to scale from megabytes to terabytes with a combination of per-user Duckling tenancy, vertical scaling to more powerful Ducklings, horizontal scaling to more Ducklings and dual execution. These scaling techniques enable the super-efficient DuckDB SQL engine to power internal data analytics as well as customer-facing analytics with ease.

Try MotherDuck today with our 21-day free trial. And, if you want to learn more about how others (including Okta and smallpond) are scaling data workloads using DuckDB, watch our recent panel of experts discussing scale.

"We've now got these new levers for performance scaling because we can split and store the data and query efficiently as needed. If we need to handle a load spike or a huge amount of queries, we can spin up more ducklings on demand."
Ravi Chandra, CTO @ Dexibit

CONTENT
  1. How is a Duckling different from a standard Data Warehouse instance?
  2. Per-user Tenancy for Internal Analytics / BI
  3. Vertical Scaling: Configurable per-user
  4. Horizontal Read Scaling: Configurable per-user
  5. Duckling-powered Customer-facing Analytics
  6. Scaling from your Duckling back to the Laptop
  7. Go launch your flock of Ducklings

Start using MotherDuck now!

blog subscription icon

Subscribe to motherduck blog

PREVIOUS POSTS

DuckDB Ecosystem: April 2025

2025/04/05 - Simon Späti

DuckDB Ecosystem: April 2025

DuckDB Monthly #28: DuckDB goes streaming, local caching & more!

Close the Loop: Faster Data Pipelines with MCP, DuckDB & AI

2025/04/15 - Mehdi Ouazza

Close the Loop: Faster Data Pipelines with MCP, DuckDB & AI

How the MCP can accelerate data engineering workflows by connecting AI copilots directly to data tools like DuckDB