LAKEHOUSE STORAGE

Built for open, intelligent data storage

Choose your storage location and format, with full ownership and portability of your data.

TOP TEAMS SUCCEED WITH DATA INTELLIGENCE

Your compact guide to modern analytics

Your essential guide for delivering trusted, modern analytics for AI on the Databricks Platform

Read now

benefits

Lakehouse storage that’s flexible and fast

Eliminate data management headaches with open table formats, centralized governance and automatic data optimizations.

Compatible formats

A single copy of source data in Delta Lake or Apache Iceberg™ that can be accessed by any engine.

Unified governance

A single catalog for data discovery and governance, across your data and AI assets.

AI-driven performance

AI-powered models autonomously optimize and maintain data for speed and low cost.

Features

Your data, your way

Choose the storage location and open format that works for you. Keep your data portable, without vendor lock-in.

Best-in-class read and write performance for Delta Lake and Apache Iceberg™ tables, out of the box, with storage optimizations not available in any other lakehouse.

More about managed tables

Access tables that are managed by external catalogs like Glue, HMS and Snowflake Horizon and leverage advanced Unity Catalog features like fine-grained access controls.

More about foreign tables

Unity Catalog architecture with client connections

The Unity REST and Iceberg REST Catalog APIs unlock the entire lakehouse ecosystem, across formats and engines.

More about using external systems

Unity Catalog architecture with connected clients

More features

ACID Transactions

Atomicity, consistency, isolation and durability guarantees provided by open table format protocols.

Learn more

Predictive Optimization

AI-driven table optimizations based on your data and usage patterns that keep your tables tuned, automatically.

Learn more

Liquid Clustering

Out-of-the-box, self-tuning data layout that scales with your data — no partitions required.

Learn more

Change Data Feed

Track row-level changes between versions of a Delta table.

Learn more

Time Travel

Historical information about tables lets you audit operations, roll back a table or query a table at a specific point in time.

Learn more

Structured Streaming

Integration with Apache Spark™ Structured Streaming, a near real-time processing engine that offers end-to-end fault tolerance with exactly-once processing guarantees.

Learn more

USE CASES

For all your analytics and AI workloads

Build and manage reliable data pipelines

Managed tables act as both batch tables and a streaming source and sink. Streaming data ingest, batch historic backfill and interactive queries all work out of the box and directly integrate with Spark Structured Streaming.

Learn more