Simon Späti

Simon Späti's photo

Simon Späti

Technical Author & Data Engineer

Simon is a Data Engineer and Technical Author with 20+ years of experience in the data field. He's the author of the Data Engineering Blog (ssp.sh), curator of the Data Engineering Vault (vault.ssp.sh), and currently writes a book about Data Engineering Design Patterns (dedp.online). Simon maintains an awareness of open-source data engineering technologies and enjoys sharing his knowledge with the community.

26 POSTS

Branch, Test, Deploy: A Git-Inspired Approach for Data

2025/11/24 - Simon Späti

Branch, Test, Deploy: A Git-Inspired Approach for Data

This article explores how to bring Git style workflows like branching, testing, and deploying to your data stack. Learn how concepts like zero copy cloning and metadata pointers can finally give you isolated test environments.

DuckDB Ecosystem: November 2025

2025/11/12 - Simon Späti

DuckDB Ecosystem: November 2025

DuckDB Monthly #35: DuckDB extensions, DuckLake, DataFrame, and more!

4 Senior Data Engineers Answer 10 Top Reddit Questions

2025/10/30 - Simon Späti

4 Senior Data Engineers Answer 10 Top Reddit Questions

A great panel answering the most voted/commented data questions on Reddit

DuckDB Ecosystem: October 2025

2025/10/07 - Simon Späti

DuckDB Ecosystem: October 2025

DuckDB Monthly #34: DuckDB 1.4.0 LTS, 100× Spark benchmarks, official Docker image and more!

DuckDB Ecosystem: September 2025

2025/09/09 - Simon Späti

DuckDB Ecosystem: September 2025

DuckDB Monthly #33: DuckDB 58× faster spatial joins, pg_duckdb 1.0, and 79% Snowflake cost savings

Why Semantic Layers Matter — and How to Build One with DuckDB

2025/08/19 - Simon Späti

Why Semantic Layers Matter — and How to Build One with DuckDB

Learn what a semantic layer is, why it matters, and how to build a simple one with DuckDB and Ibis using just YAML and Python

DuckDB Ecosystem: August 2025

2025/08/07 - Simon Späti

DuckDB Ecosystem: August 2025

DuckDB Monthly #32: DuckDB hits 50.7% growth—vector search, WASM, and analytics take the spotlight

Summer Data Engineering Roadmap

2025/07/21 - Simon Späti

Summer Data Engineering Roadmap

A comprehensive 3-week structured roadmap for learning data engineering fundamentals, from SQL and Git basics to advanced topics like streaming, data quality, and DevOps.RetryClaude can make mistakes. Please double-check responses.

This Month in the DuckDB Ecosystem: July 2025

2025/07/08 - Simon Späti

This Month in the DuckDB Ecosystem: July 2025

DuckDB Monthly #31: Kafka Integration, Browser-Based Analytics, and Lake Format Innovations

The Data Engineer Toolkit: Infrastructure, DevOps, and Beyond

2025/07/03 - Simon Späti

The Data Engineer Toolkit: Infrastructure, DevOps, and Beyond

A comprehensive guide to advanced data engineering tools covering everything from SQL engines and orchestration platforms to DevOps, data quality, AI workflows, and the soft skills needed to build production-grade data platforms.

DuckDB Ecosystem: June 2025

2025/06/06 - Simon Späti

DuckDB Ecosystem: June 2025

DuckDB Monthly #30: DuckDB's new table format, Radio extension and more!

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

2025/05/23 - Simon Späti

The Open Lakehouse Stack: DuckDB and the Rise of Table Formats

Learn how DuckDB and open table formats like Iceberg power a fast, composable analytics stack on affordable cloud storage

SUBSCRIBE

Subscribe to MotherDuck Blog

Subscription Blog Lottie