← Explore

Posts tagged with data-pipelines

Data Eng Daily · ·5 min read

Kafka to Iceberg in 2026: Nine Options, Three That Matter

Every data team running Kafka eventually hits the same wall: how do I get these events into my lakehouse so analysts can actually query them?

kafkaapache-icebergstreaming
Data Eng Daily · ·6 min read

Flink CDC 3.6.0: Oracle Finally Gets a Real Pipeline Connector

If you've been duct-taping Oracle CDC into Flink pipelines using the DataStream API and custom Debezium wrappers, version 3.6.

apache-flinkcdcoracle
Data Eng Daily · ·5 min read

Stop Running Spark for 40 GB Jobs

Every quarter, someone on the team asks: "Do we really need this Spark cluster?" For most of the jobs running on it, the answer in 2026 is no.

duckdbapache-sparkbenchmarks
Data Eng Daily · ·4 min read

Airflow 2 EOL Is April 22 — Here's What Actually Breaks

Twenty days from now, Apache Airflow 2.x reaches end of life.

apache-airflowmigrationorchestration
Data Eng Daily · ·5 min read

dbt on Flink Won't Unify Your Data Stack

#dbt on Flink Won't Unify Your Data Stack Three days ago Confluent dropped the dbt-confluent adapter, and the data engineering corner of the internet lost...

dbtapache-flinkstreaming