Session

Building the Commercial Lakehouse: AI-Driven, Open and Interoperable

Overview

ExperienceIn Person
TrackGovernance & Security
IndustryHealthcare & Life Sciences
TechnologiesDelta Sharing, Lakeflow, Unity Catalog
Skill LevelIntermediate

AstraZeneca modernized its Commercial data estate, transitioning from legacy EMR to a high-performance global commercial lakehouse. At the core lies Unity Catalog (UC), established as the unified governance layer. We demonstrate how UC enables a true open architecture, orchestrating seamless federation for legacy Glue workloads and utilizing Iceberg REST integration to provide zero-copy access for iceberg clients e.g., Snowflake.The transformation was powered by a multi-agent AI migration system, utilizing LLMs and testing agents to automate the conversion of 250+ pipelines into a rigorous Medallion architecture. Key components include Databricks Asset Bundles for versioned CI/CD, Liquid Clustering for storage optimization.Processing complex external data sources (Claims, CRM, Digital), this platform delivered 50% cost savings. This session covers the multi-agent migration architecture, UC-centric interoperability design patterns and lessons from modernizing commercial data at scale.

Session Speakers

Speaker placeholderIMAGE COMING SOON

Paul Kuntz

/Commercial Data Lake Capability Lead
AstraZeneca Pharmaceuticals