CDC Pipeline With Lakeflow Declarative Pipeline

Demo Type

Product Tutorial

Duration

Self-paced

Social

What you’ll learn

This demo highlights how Lakeflow Declarative Pipeline simplifies CDC (change data capture).

CDC is typically done by ingesting changes from external systems (ERP, SQL databases) with tools like Fivetran, Debezium, etc.

In this demo, we’ll show you how to re-create your table consuming CDC information.

We’ll also implement an SCD2 (Slowly Changing Dimension table of type 2). While this can be really tricky when data arrives out of order, Declarative Pipeline makes it super simple with just one keyword.

Ultimately, we’ll show you how to programatically scan multiple incoming folders and trigger N streams (one for each CDC table), leveraging Declarative Pipeline with Python.

 

To install the demo, get a free Databricks workspace and execute the following two commands in a Python notebook

Dbdemos is a Python library that installs complete Databricks demos in your workspaces. Dbdemos will load and start notebooks, Declarative Pipelines, clusters, Databricks SQL dashboards, warehouse models … See how to use dbdemos

 

Dbdemos is distributed as a GitHub project.

For more details, please view the GitHub README.md file and follow the documentation.
Dbdemos is provided as is. See the
License and Notice for more information.
Databricks does not offer official support for dbdemos and the associated assets.
For any issue, please open a ticket and the demo team will have a look on a best-effort basis.

 

Note - at Data + AI Summit in June 2025, Databricks released Lakeflow. Lakeflow unifies Data Engineering with Lakeflow Connect, Lakeflow Declarative Pipelines (previously known as DLT), and Lakeflow Jobs (previously known as Workflows).

Recommended

<p>Unit Testing Lakeflow Declarative Pipeline for Production-Grade Pipelines</p>

Tutorial

Unit Testing Lakeflow Declarative Pipeline for Production-Grade Pipelines

<p>Full Lakeflow Declarative Pipeline&nbsp;- Loan</p>

Tutorial

Full Lakeflow Declarative Pipeline - Loan

<p>CDC Pipeline With Delta</p>

Tutorial

CDC Pipeline With Delta

These assets will be installed in this Databricks demos:

orchestrate

Ready to get started?