Session

Learning from Goldman Sachs' Legend Lakehouse for Data Governance

Overview

Wednesday

June 11

1:50 pm

ExperienceIn Person
TypeBreakout
TrackData Strategy
IndustryFinancial Services
TechnologiesApache Spark, Apache Iceberg, Unity Catalog
Skill LevelIntermediate
Duration40 min

Data is the backbone of modern decision-making, but centralizing it is only the tip of the iceberg. Entitlements, secure sharing and just-in-time availability are critical challenges to any large-scale platform. Join Goldman Sachs as we reveal how our Legend Lakehouse, coupled with Databricks, overcomes these hurdles to deliver high-quality, governed data at scale. By leveraging an open table format (Apache Iceberg) and open catalog format (Unity Catalog), we ensure platform interoperability and vendor neutrality. Databricks Unity Catalog then provides a robust entitlement system that aligns with our data contracts, ensuring consistent access control across producer and consumer workspaces. Finally, Legend functions, integrating with Databricks User Defined Functions (UDF), offer real-time data enrichment and secure transformations without exposing raw datasets. Discover how these components unite to streamline analytics, bolster governance and power innovation.

Session Speakers

George Wu

/Vice President
Goldman Sachs

Abhishek Narang

/Managing Director & Technology Fellow
Goldman Sachs