Question 1

What is MotherDuck and how does it handle AI agent queries?

Accepted Answer

MotherDuck is a cloud data warehouse built on DuckDB. It runs SQL queries using vectorized, parallel execution on your CPU, so it's fast and cheap. In this webinar, Jacob Matson compares MotherDuck's cost and speed against legacy warehouses and walks through why the architecture works well for agents that hit the database constantly.

Question 2

What is Orchestra and how does it work with AI agents?

Accepted Answer

Orchestra is a serverless orchestration engine. You define pipelines in YAML, and it handles monitoring, metadata, alerting, and lineage. In the demo, Hugo has Claude Code generate the Python ingestion code and the Orchestra pipeline YAML, then kick off the pipeline — all from a single natural language prompt.

Question 3

How do MotherDuck snapshots keep agent pipelines safe?

Accepted Answer

MotherDuck snapshots are immutable, point-in-time copies of a database. They use zero-copy cloning, so promoting data from staging to production is fast and costs almost nothing in extra storage. They also work as an undo button — if an agent breaks something, you roll back to the snapshot instead of re-running the whole pipeline. MotherDuck keeps 7 days of snapshots by default, and for incremental workloads the extra storage is negligible.

Question 4

What is a MotherDuck Dive and how is it used in this pipeline?

Accepted Answer

A MotherDuck Dive is a set of TSX (React) files that render interactive visualizations from your warehouse data. Dives live in source control, so they're versioned and reproducible. In this demo, the AI agent generates and updates a Dive at the end of the pipeline — no manual dashboard building needed.

Question 5

How complex is the Claude Code skill used in the demo?

Accepted Answer

The demo used a single Claude Code skill — a markdown file with step-by-step instructions telling the agent how to scaffold the pipeline. Hugo's skill was verbose (AI-generated), but you could write the same thing in about 20 lines. The skill references the MotherDuck MCP server for database operations, so no custom MotherDuck-specific skills were needed.

Agentic Data Engineering: Building Pipelines End-to-End with AI

TL;DR

Why AI changes how we build pipelines

The demo: one skill, one agent, one pipeline

Staging, snapshots, and safe promotion

Dives: BI as code from your pipeline

Q&A highlights

FAQS

What is MotherDuck and how does it handle AI agent queries?

What is Orchestra and how does it work with AI agents?

How do MotherDuck snapshots keep agent pipelines safe?

What is a MotherDuck Dive and how is it used in this pipeline?

How complex is the Claude Code skill used in the demo?

Related Videos

Streaming Made Easy: Practical Postgres CDC with Streamkap + MotherDuck

The Database Inside Your Lakehouse: A DuckLake Architecture Deep Dive

What Makes a Great Data Viz? DiveMaxxing Winners Revealed