Unity Catalog

Unified governance for all data and AI assets

Sign up for Public PreviewContact us

Unity Catalog is a unified governance solution for all data and AI assets including files, tables and machine learning models in your lakehouse on any cloud.

Centralized governance for data and AI

With a common governance model based on ANSI SQL, centrally govern files, tables, dashboards and ML models on any cloud.

Built-in data search and discovery

Quickly find, understand and reference data from across your data estate, boosting productivity.

Performance and scale

Benefit from enhanced query performance with low-latency metadata serving and auto-tuning of the tables.

Automated lineage for all workloads

Create a unified, transparent view of your entire data ecosystem with automated and granular lineage for all workloads in SQL, R, Python, Scala and across all asset types — tables, notebooks, workflows and dashboards.

Integrated with your existing tools

Future-proof your data and AI governance with the flexibility to leverage your existing data catalogs and governance solutions.
background-image

How it works

Centrally manage and govern all data assets

With a common governance model based on open standard ANSI SQL, simplify governance for files, tables, dashboards and ML models on any cloud. Define access policies once at the account level and enforce across all workloads and workspaces. Unity Catalog also provides centralized fine-grained auditing by capturing an audit log of actions performed against the data and helps you meet your compliance and audit requirements.

Manage fine-grained access controls with ease

Use standard SQL functions to define row filters and column masks, allowing fine-grained access controls on rows and columns. Leverage Attribute-Based Access Controls to define access policies based on custom tags (attributes), further simplifying data stewardship.

Unified and secure data search experience

Quickly find, understand and reference relevant data from across your data estate with a unified data search experience for data analysts, data engineers and data scientists. Data search in Unity Catalog is secure by default, limiting search results based on the access privileges of users, and adding an additional layer of security for privacy considerations.

Enhanced query performance at any scale

Unity Catalog provides enhanced query performance with low-latency metadata serving and table auto-tuning, resulting in faster executions of queries at any scale. Asynchronous automatic data compaction optimizes file sizes and reduces input/output (I/O) latency automatically in the background.

Automated and real-time data lineage

Gain end-to-end visibility into how data flows in your lakehouse with automated and real-time data lineage across all workloads in SQL, Python, Scala and R. Quickly perform data quality checks, complete impact analysis of data changes, and debug any errors in your data pipelines. Benefit from lineage across tables, columns, notebooks, workflows and dashboards. Lineage graphs in Unity Catalog are privilege-aware, restricting access to lineage graphs based on the access privileges of users. Lineage can also be retrieved via REST API to support integrations with other catalogs.

Secure data sharing across organizations

Unity Catalog natively supports Delta Sharing, the world’s first open protocol for secure data sharing, enabling you to easily share existing data in Delta Lake and Apache Parquet formats to any computing platform. Consumers don’t have to be on the Databricks platform, same cloud or any cloud at all. You can share live data, without replicating or copying it to another system. Native integrations with Power BI, Tableau, Spark, pandas and Java allow recipients to consume shared data directly from the tools of their choice. You can centrally manage, govern, audit and track usage of the shared data on one platform.

background-image

Seamless integrations with your existing tools

Unity Catalog works with your existing data catalogs, data storage systems and governance solutions so you can leverage your existing investments and build a future-proof governance model without expensive migration costs.

Data and cloud storage

Data governance and catalog partners

Customers


logo graphic grammarly
“Unity Catalog significantly simplified our implementation of role-based access control, allowing us to secure data at a catalog, database, table and column level. We seamlessly provision the appropriate level of access to all users and can confidently manage and secure data access to meet our strict compliance and privacy policies.”
— Chris Locklin, Engineering Manager of Data Platforms, Grammarly


logo graphic milliman
“Unity Catalog has allowed Milliman to tie its expansive data assets directly to its users, simplifying access management and reducing storage complexity. Unity Catalog allows Milliman to meet stringent health care standards, provide collaborative data exploration and reduce storage complexity. With Unity Catalog, we can quickly scale to meet our clients’ needs.”
— Dan McCurley, Cloud Solutions Architect, Milliman


logo graphic warner bros
“Unity Catalog has been a game changer for us. Our analysts can access AWS external tables in real time in Databricks workspaces and can join them with Delta Lake tables. It gives us the performance benefits of Databricks SQL without migrating Glue tables. With Unity Catalog, various teams can collaborate with each other without moving the data from one place to another.”
— Rajat Gupta, Engineering Manager, Warner Bros. Discovery


Sign up for public preview

Ready to get
started?