Using Amazon SageMaker Lakehouse with DuckDBand Glue Data Catalog·tobilg.com·Jun 27, 2025Using Amazon SageMaker Lakehouse with DuckDB
Incremental Processing using Netflix Maestro and Apache Iceberg·netflixtechblog.com·Nov 23, 2023Incremental Processing using Netflix Maestro and Apache Iceberg
Exploring Spark Catalog — Mastering Pyspark·pyspark.itversity.com·Jun 20, 2023Exploring Spark Catalog — Mastering Pyspark
Data cataloguing in Spark | by Petrica Leuca | Medium·12ft.io·Jun 20, 2023Data cataloguing in Spark | by Petrica Leuca | Medium
Streaming from Apache Iceberg - QCon NY 2023Streaming from Apache IcebergBuilding Low-Latency and Cost Effective Data PipelinesSteven Wu @ Apple·speakerdeck.com·Jun 14, 2023Streaming from Apache Iceberg - QCon NY 2023
red-data-tools/YouPlot: A command line tool that draw plots on the terminal.·github.com·May 10, 2023red-data-tools/YouPlot: A command line tool that draw plots on the terminal.
Data processing with Spark: data catalog – own your data·ownyourdata.ai·Apr 26, 2023Data processing with Spark: data catalog – own your data
Delivering High Quality Analytics at Netflix·youtube.com·Apr 21, 2023Delivering High Quality Analytics at Netflix
Same Data, Sturdier Frame: Layering in Dimensional Data Modeling at Whatnot·medium.com·Mar 20, 2023Same Data, Sturdier Frame: Layering in Dimensional Data Modeling at Whatnot
Unit Testing for Data Engineers.·dataengineeringcentral.substack.com·Feb 14, 2023Unit Testing for Data Engineers.
r/dataengineering - What did ETL look like before the "modern data stack" was a thing?·reddit.com·Feb 13, 2023r/dataengineering - What did ETL look like before the "modern data stack" was a thing?
r/dataengineering - Which lakehouse table format do you expect your organization will be using by the end of 2023?·reddit.com·Jan 31, 2023r/dataengineering - Which lakehouse table format do you expect your organization will be using by the end of 2023?
Data Systems Tend Towards Production·ian-macomber.medium.com·Nov 29, 2022Data Systems Tend Towards Production
Airbyte Monitoring with dbt and Metabase - Part I | Airbyte·airbyte.com·Nov 18, 2022Airbyte Monitoring with dbt and Metabase - Part I | Airbyte
Building a Data Engineering Project in 20 Minutes·sspaeti.com·Nov 18, 2022Building a Data Engineering Project in 20 Minutes
r/dataengineering - Has anyone built a data warehouse primarily using Databricks?·reddit.com·Oct 18, 2022r/dataengineering - Has anyone built a data warehouse primarily using Databricks?
The Contract-Powered Data Platform | Buz·buz.dev·Oct 11, 2022The Contract-Powered Data Platform | Buz
The Breakdown: Databricks, Snowflake, and Open Source Positioning in the Data World·productofdata.substack.com·Oct 10, 2022The Breakdown: Databricks, Snowflake, and Open Source Positioning in the Data World
Yet another post on Data Contracts - Part 1·davidsj.substack.com·Oct 8, 2022Yet another post on Data Contracts - Part 1
The missing piece of the modern data stack·benn.substack.com·Sep 30, 2022The missing piece of the modern data stack
The modern data experience (w/ Benn Stancil)·youtube.com·Sep 30, 2022The modern data experience (w/ Benn Stancil)
Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department | Stitch Fix Technology – Multithreaded·multithreaded.stitchfix.com·Sep 30, 2022Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department | Stitch Fix Technology – Multithreaded
Upgrading Data Warehouse Infrastructure at Airbnb·medium.com·Sep 27, 2022Upgrading Data Warehouse Infrastructure at Airbnb
Ep 30: The Personal Data Warehouse (w/ Jordan Tigani of MotherDuck)·roundup.getdbt.com·Sep 22, 2022Ep 30: The Personal Data Warehouse (w/ Jordan Tigani of MotherDuck)