From Curiosity to Impact How DoSomething Democratized Data
2025/09/10For any organization, especially lean data teams at startups or non-profits, DoSomething's story is a powerful testament to the fact that you don't need to choose between over-engineered, expensive solutions and the limitations of spreadsheets. With the right modern data stack, you can achieve speed, affordability, and a culture of data empowerment.For a lean data team, the choice between slow, costly enterprise tools and wrestling with spreadsheets is a constant struggle. Hear how DoSomething.org, a non-profit fueling young people to change the world, escaped this data trap. By building a new modern data stack with MotherDuck, they achieved a 20x cost reduction, transformed their data culture, and empowered their entire team with self-serve analytics—all managed by a single data engineer.
The Challenge: A High-Cost, Low-Value Data System
DoSomething has a long history of leveraging technology for good, but their previous data stack, built between 2015 and 2024, had become a significant burden. The architecture consisted of Fivetran for data extraction, a Postgres database used as a data warehouse, and dbt Cloud for nightly transformations.
This system presented several critical problems:
- Cripplingly Slow Queries: As an OLTP (Online Transaction Processing) database, Postgres is not optimized for heavy analytical queries. Sahil, DoSomething's Senior Data Engineer, recalls queries taking hours to run, "if they ran at all." This made it impossible for stakeholders to get timely insights.
- Exorbitant Costs: The infrastructure was expensive. Fivetran, used to ingest massive volumes of web event data that were rarely used, became the "largest single line item" in their data budget. The 4TB Postgres data warehouse was also costly to maintain.
- Data Request Bottlenecks: The slow performance of Tableau dashboards connected to Postgres meant most insights were locked behind the data team. Non-technical users had no direct access, leading to a constant stream of data requests and an inability for them to answer their own questions.
The Solution: A Lean, Fast, and Accessible Modern Data Stack
During a complete platform rebuild in 2024, Sahil discovered DuckDB and the "small data movement." The simplicity and performance of DuckDB for local analysis led him to MotherDuck, a serverless data warehouse built on DuckDB. This became the cornerstone of their new, cost-effective data stack.
The new architecture is a model of efficiency:
- Extraction: Fivetran is now used sparingly, only for Google Analytics 4 data, putting them on the free tier.
- Loading: Data is loaded directly into MotherDuck, which provides significant compression and performance.
- Transformation: Nightly dbt Cloud jobs that took 8 hours were replaced with a dbt Core project running via GitHub Actions. The new runtime? Just two to three minutes, falling within GitHub's free tier.
The Impact: 20x Lower Costs and True Self-Service Analytics
The shift to MotherDuck delivered transformative results, enabling DoSomething to remain a sustainable, data-driven organization.
1. Massive Cost Reduction (95% Savings) The new data system costs roughly 5% of the old one. The bill for their MotherDuck data warehouse is approximately 20 times less than what they were paying AWS for their previous Postgres-based system. This dramatic cost reduction was not just a budget win; it was essential for the non-profit's long-term sustainability.
2. Simplicity and Blazing Speed The new stack is orders of magnitude more performant. With dbt runs dropping from 8 hours to 2 minutes and queries returning in seconds, the solo data engineer can develop and iterate with unprecedented speed. This is the power of using a dedicated OLAP (Online Analytical Processing) system like MotherDuck for analytics instead of an OLTP database like Postgres.
3. Democratized Data with Self-Serve Analytics The most profound change was the empowerment of non-technical team members. Sahil describes a "revelatory moment" when a colleague answered their own data question using the MotherDuck UI without knowing SQL.
As demonstrated in the video, MotherDuck's user-friendly interface makes this possible:
- Column & Table Explorer: Users can instantly see aggregations, value distributions, and schema information to quickly understand the data without writing a single line of code.
- Natural Language Queries: Using the "Command-K" feature, anyone can ask questions in plain English (e.g., "how many active actions have a scholarship?") and get back a SQL query and the answer.
- AI in the Warehouse: The
PROMPT()
function allows users to perform complex analysis, like sentiment analysis or categorization, on text data directly within the warehouse, opening the door for agentic AI workflows.
This shift from data request dependence to self-sufficiency has allowed the entire DoSomething team to have a clearer, more immediate understanding of their user activity, fostering a true data-driven culture.
For any organization, especially lean data teams at startups or non-profits, DoSomething's story is a powerful testament to the fact that you don't need to choose between over-engineered, expensive solutions and the limitations of spreadsheets. With the right modern data stack, you can achieve speed, affordability, and a culture of data empowerment.
Related Videos

2025-07-26
How to Efficiently Load Data into DuckLake with Estuary
Learn how DuckLake, MotherDuck, and Estuary enable fast, real-time data integration and analytics with modern open table formats, cloud data warehousing, and no-code streaming pipelines.
YouTube

20:44
2025-06-13
What can Postgres learn from DuckDB? (PGConf.dev 2025)
DuckDB an open source SQL analytics engine that is quickly growing in popularity. This begs the question: What can Postgres learn from DuckDB?
YouTube
Ecosystem
Talk

2025-06-12
pg_duckdb: Ducking awesome analytics in Postgres
Supercharge your Postgres analytics! This talk shows how the pg_duckdb extension accelerates your slowest queries instantly, often with zero code changes. Learn practical tips and how to use remote columnar storage for even more speed.
Talk
Sources