Categories - engineering

2026

Tuition I Paid on This Pipeline

May 15 · 4 min

Four Tests Worth Adding to Every dbt schema.yml — and the Real Bugs They Catch

May 15 · 3 min

Unit-Testing Airflow DAGs Without Starting a Scheduler

May 14 · 3 min

Managing "Semi-Static" Reference Tables with dbt Seeds

May 14 · 2 min

dbt's staging → dim/fact → mart Layering: What Problem Does It Actually Solve?

May 14 · 3 min

Partition + Cluster Without the Hand-Waving: A Worked Example on Taxi Data

May 14 · 3 min

External Table + CTAS vs. LoadJob: A Two-Step Path to Land Parquet in BigQuery

May 13 · 2 min

Manual Trigger + params: Treating Airflow as a Programmable Batch CLI

May 13 · 2 min

Airflow DAG Details That Look Pointless — Until They Save the Run

May 13 · 3 min

GCS Lifecycle Rules: Two Mindsets — Delete vs. Demote

May 13 · 2 min

0 %