Simon Späti

Simon Späti
Technical Author & Data Engineer
Simon is a Data Engineer and Technical Author with 20+ years of experience in the data field. He's the author of the Data Engineering Blog (ssp.sh), curator of the Data Engineering Vault (vault.ssp.sh), and currently writes a book about Data Engineering Design Patterns (dedp.online). Simon maintains an awareness of open-source data engineering technologies and enjoys sharing his knowledge with the community.
26 POSTS

2025/11/24 - Simon Späti
Branch, Test, Deploy: A Git-Inspired Approach for Data
This article explores how to bring Git style workflows like branching, testing, and deploying to your data stack. Learn how concepts like zero copy cloning and metadata pointers can finally give you isolated test environments.

2025/11/12 - Simon Späti
DuckDB Ecosystem: November 2025
DuckDB Monthly #35: DuckDB extensions, DuckLake, DataFrame, and more!

2025/10/30 - Simon Späti
4 Senior Data Engineers Answer 10 Top Reddit Questions
A great panel answering the most voted/commented data questions on Reddit

2025/10/07 - Simon Späti
DuckDB Ecosystem: October 2025
DuckDB Monthly #34: DuckDB 1.4.0 LTS, 100× Spark benchmarks, official Docker image and more!

2025/09/09 - Simon Späti
DuckDB Ecosystem: September 2025
DuckDB Monthly #33: DuckDB 58× faster spatial joins, pg_duckdb 1.0, and 79% Snowflake cost savings

2025/08/19 - Simon Späti
Why Semantic Layers Matter — and How to Build One with DuckDB
Learn what a semantic layer is, why it matters, and how to build a simple one with DuckDB and Ibis using just YAML and Python

2025/08/07 - Simon Späti
DuckDB Ecosystem: August 2025
DuckDB Monthly #32: DuckDB hits 50.7% growth—vector search, WASM, and analytics take the spotlight
2025/07/21 - Simon Späti
Summer Data Engineering Roadmap
A comprehensive 3-week structured roadmap for learning data engineering fundamentals, from SQL and Git basics to advanced topics like streaming, data quality, and DevOps.RetryClaude can make mistakes. Please double-check responses.

2025/07/08 - Simon Späti
This Month in the DuckDB Ecosystem: July 2025
DuckDB Monthly #31: Kafka Integration, Browser-Based Analytics, and Lake Format Innovations
2025/07/03 - Simon Späti
The Data Engineer Toolkit: Infrastructure, DevOps, and Beyond
A comprehensive guide to advanced data engineering tools covering everything from SQL engines and orchestration platforms to DevOps, data quality, AI workflows, and the soft skills needed to build production-grade data platforms.
SUBSCRIBE
Subscribe to MotherDuck Blog




