Skip to main content

Data Warehousing How-to

What is a Data Warehouse?

A data warehouse is used for storing and analyzing data from multiple sources into one place. It uses a common query language (SQL) and is often the jumping off point for reporting, analytics, and supporting strategic decision making. The data warehouse serves as the bridge from raw data to a governed, scalable data set to serve downstream consumers. While DuckDB is excellent at processing and serving large datasets, MotherDuck adds the missing components to make it a true data warehouse.

Architecture

Some common tools in a data stack are:

  • BI tools for data visualization and reporting
    • Omni, Tableau, PowerBI
  • Ingestion tools to load data in from business apps across your enterprise
    • Fivetran, Airbyte, Dlthub
  • Transformation tools to make the data more usable
    • dbt, sqlmesh, paradime.io
  • Orchestration tools to stitch it all together
    • Airflow, Kestra, Dagster

Some of these groups of tools have specific pages, which are linked here:

Please do not hesitate to contact us if you need help along your journey.