Reliable data pipelines made easy
Simplify batch and streaming ETL with automated reliability and built-in data quality.
TOP TEAMS SUCCEED WITH INTELLIGENT DATA PIPELINESData pipeline best practices, codified
Simply declare the data transformations you need — let Spark Declarative Pipelines handle the rest.Built to simplify data pipelining
Building and operating data pipelines can be hard — but it doesn’t have to be. Spark Declarative Pipelines is built for powerful simplicity, so you can perform robust ETL with just a few lines of code.Use Genie Code to automate ETL workloads, optimize queries and build pipelines through natural conversation.

Leveraging Spark’s unified API for batch and stream processing, Spark Declarative Pipelines allows you to easily toggle between processing modes.

Spark Declarative Pipelines makes it easy to optimize pipeline performance by declaring an entire incremental data pipeline with streaming tables and materialized views.

Spark Declarative Pipelines supports a broad ecosystem of sources and sinks. Load data from any source — including cloud storage, message buses, change data feeds, databases and enterprise apps.

Expectations allow you to guarantee data arriving in tables meets data quality requirements and provides insights on data quality with each pipeline update.

Develop pipelines in the IDE for Data Engineering without any context switching. See the DAG, data preview and execution insights in one UI. Develop code easily with autocomplete, in-line errors and diagnostics.







More features
Streamline your data pipelines

Make sources, transformations and destinations simple
Declarative programming means you get to harness the power of ETL on Databricks with just a few lines of code.
Explore Spark Declarative Pipelines demos
Usage-based pricing keeps spending in check
Only pay for the products you use at per-second granularity.Discover more
Explore other integrated, intelligent offerings on Databricks.Lakeflow Connect
Efficient data ingestion connectors from any source and native integration with the Data Intelligence Platform unlock easy access to analytics and AI, with unified governance.
Lakeflow Jobs
Easily define, manage and monitor multitask workflows for ETL, analytics and machine learning pipelines. With a wide range of supported task types, deep observability capabilities and high reliability, your data teams are empowered to better automate and orchestrate any pipeline and become more productive.
Genie Code
Your autonomous AI partner for data work.
Lakehouse Storage
Unify the data in your lakehouse, across all formats and types, for all your analytics and AI workloads.
Unity Catalog
Seamlessly govern all your data assets with the industry’s only unified and open governance solution for data and AI, built into the Databricks Data Intelligence Platform.
The Data Intelligence Platform
Find out how the Databricks Data Intelligence Platform enables your data and AI workloads.
Take the next step
Spark Declarative Pipelines FAQ
Ready to become a data + AI company?
Take the first steps in your transformation











