Trust You Can Measure: Data Quality Standards in The Lakehouse
Overview
Experience | In Person |
---|---|
Type | Breakout |
Track | Data and AI Governance |
Industry | Enterprise Technology, Professional Services |
Technologies | Apache Spark, Unity Catalog |
Skill Level | Beginner |
Duration | 40 min |
Do you trust your data? If you’ve ever struggled to figure out which datasets are reliable, well-governed, or safe to use, you’re not alone. At Databricks, our own internal lakehouse faced the same challenge—hundreds of thousands of tables, but no easy way to tell which data met quality standards. In this talk, the Databricks Data Platform team shares how we tackled this problem by building the Data Governance Score—a way to systematically measure and surface trust signals across the entire lakehouse. You’ll learn how we leverage Unity Catalog, governed tags, and enforcement to drive better data decisions at scale. Whether you're a data engineer, platform owner, or business leader, you’ll leave with practical ideas on how to raise the bar for data quality and trust in your own data ecosystem.
Session Speakers
Amit Pahwa
/Staff Software Engineer
Databricks
Sergiy Kanyshchev
/Staff Software Engineer
Databricks