Keynotes: Lakehouse Data Architecture, Data Engineering, and Analytics
Wednesday, May 26, 08:00 AM (PT)
Hear from Databricks co-founders and the original creators of popular projects Apache Spark, Delta Lake and MLflow on how the open source community is tackling the biggest challenges in data.
They’ll also reveal some of the latest innovations in data engineering and data analytics to simplify and scale your work. We’ll also be joined by data leaders from Atlassian and Microsoft, as well as the Nobel Laureate Malala Yousafzai, an inspiring human rights advocate.
Watch All SessionsFuture is Open. Lakehouse is Here | Ali Ghodsi | Keynote Data + AI Summit NA 2021
Ali Ghodsi Co-founder & CEO Original Creator of Apache Spark, Databricks
Databricks CEO Ali Ghodsi kicks off Summit, live from the Lakehouse. Ali talks about momentum in open source data technologies and the growing adoption of the data lakehouse architecture, combining the best of data warehouses and data lakes. The lakehouse is one platform to unify all your data, analytics and AI workloads.
Ali Ghodsi and Bill Inmon | Fireside Chat | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Bill Inmon, Computer scientist, author, and technology pioneer. Best known as the Father of Data Warehousing, Father of Data Warehouse
Databricks CEO Ali Ghodsi interviews Bill Inmon, the “Father of the Data Warehouse,” about the industry’s evolution to the Lakehouse architecture. Bill discusses the need for an open lakehouse architecture built on top of data lakes that natively supports data warehousing and machine learning. Bill says enterprises who don’t build a Lakehouse will have a mountain of data that goes to waste. The lakehouse will unlock the data and present opportunities we’ve never seen before.
Announcing Delta Lake 1.0 | Michael Armbrust | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Michael Armbrust, Distinguished Engineer, Databricks
Delta Lake co-creator and Databricks Distinguished Engineer Michael Armbrust announces the Delta Lake 1.0 milestone and key features including: generated columns, querying from data federated across multiple clouds, standalone Delta Lake in Python and more. He also introduces a set of new open source committers.
Building the lakehouse at Atlassian | Rohan Dhupelia | Keynote Data + AI Summit NA 2021
Michael Armbrust, Distinguished Engineer, Databricks • Rohan Dhupelia Data Platform Senior Manager, Atlassian
Rohan Dhupelia of Atlassian talks about the evolution of their internal data architecture to the lakehouse as the “sweet spot” in between the data warehouse and data lake.
Announcing Delta Sharing with Demo | Matei Zaharia | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Matei Zaharia, Assistant Professor of Computer Science; Original Creator of Apache Spark & MLflow, Databricks
“Data needs to flow beyond the borders of individual organizations,” says Databricks CEO Ali Ghodsi. He announces Delta Sharing, the industry’s first open protocol for secure data sharing, as open source under the Linux Foundation. Databricks Chief Technologist Matei Zaharia dives into the goals and the details of being a data provider or a data recipient.
Ali Ghodsi and Matt Garman (SVP, AWS) | Fireside Chat | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Matt Garman, Senior Vice President, AWS WW Sales and Marketing, AWS
Matt Garman, SVP at AWS, talks about some of the early lessons of scale at Amazon Web Services. Matt also addresses the trends around lakehouse adoption that AWS has observed, and the advantages that Delta Sharing can bring to data sharing.
Announcing Delta Live Tables with Demo | Michael Armbrust | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databrickss • Michael Armbrust, Distinguished Engineer, Databricks
Distinguished Engineer Michael Armbrust announces Delta Live Tables, making it possible to do production-quality ETL using only SQL queries. Live Tables runtime takes care of operational, governance and quality concerns, allowing you to spend more time getting value from the data. Can even mix Python with SQL to do advanced analytics and AI. Learn more at databricks.com.
Announcing the Unity Catalog | Matei Zaharia | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Matei Zaharia, Assistant Professor of Computer Science; Original Creator of Apache Spark & MLflow, Databricks
Databricks CEO Ali Ghodsi announces the Unity Catalog, the industry’s first unified catalog for the Lakehouse. It allows organizations to standardize on one security model based on ANSI SQL. Chief Technologist Matei Zaharai then dives into the details on governance challenges solved by the Unity Catalog.
SQL Analytics & Photon Updates with Demo | Reynold Xin | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Reynold Xin, Co-founder & Chief Architect, Databricks
Databricks Chief Architect Reynold Xin talks about the performance improvements and simplified administration now available in Photon and SQL Analytics. Get a first-class SQL development experience backed by an engine with improved concurrent querying capabilities.
Ali Ghodsi and Rohan Kumar (CVP, Microsoft) | Fireside Chat | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Rohan Kumar, Corporate Vice President, Azure Data, Microsoft
Rohan Kumar, CVP of Azure Data at Microsoft, shares what customers like Grab and ABN AMRO are able to achieve with data and AI using Azure Databricks, and how innovations like Photon and Delta Sharing are coming to life on Azure.
Malala Yousafzai and Ali Ghodsi | Fireside Chat | Keynote Data + AI Summit NA 2021
Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks • Malala Yousafzai, Co-Founder of Malala Fund and Nobel Laureate
Malala, an internationally-recognized activist, joins to share her work to enable every girl around the globe to have access to high quality education.

Co-founder and CEO at Databricks
In addition to leading Databricks, Ali is an original creator of Apache Spark and an Adjunct Professor at the University of California, Berkeley. Ali will be leading the morning keynotes, kicking them off by talking about the opportunities recent innovations enable around simplifying and scaling data.

Father of Data Warehousing
Computer scientist, author and technology pioneer Bill Inmon will join a fireside chat with Databricks Co-founder and CEO Ali Ghodsi. They’ll talk about the evolution of data infrastructure – from data warehouses to data lakes to data lakehouses – and give a preview of Bill’s upcoming book.

Original Creator of Spark SQL and PMC Member
Michael Armbrust, a Distinguished Engineer and leader of the Delta Lake and Streaming efforts at Databricks, will review the momentum of the Delta Lake open source project and share some of the latest initiatives the team has been working on.

Leader of Analytics Platform at Atlassian
Atlassian, makers of the popular Jira, Trello and Bitbucket, made the journey to the lakehouse to enable data democratization at scale. Rohan will discuss the cost and challenges of their data warehouse origins, including data duplication, data latency and concurrency issues. Rohan will also dive into the benefits of their latest data lakehouse, which reduced cost, simplified access and governance and increased the pace of innovation with greater autonomy for teams.

Original creator of MLflow and Apache Spark
Matei, Co-founder and Chief Technologist of Databricks and an Assistant Professor at Stanford University, will talk about the latest features in both open source and the Databricks Lakehouse Platform.

SVP of Amazon Web Services
Matt will discuss his experience launching Amazon EC2 and chat with Databricks Co-founder and CEO about their perspectives on the emergence of the Lakehouse. They’ll also wrap up the conversation with insights into the new keynote announcements.

Top contributor to Apache Spark
Reynold, Co-founder and Chief Architect at Databricks, will share updates in open source and Databricks that improve performance and scale of SQL analytics using a lakehouse architecture.

Corporate Vice President, Azure Data, Microsoft
As the Corporate Vice President of Azure Data, Rohan is the engineering leader responsible for the product strategy, technical vision, long range planning, design, development/implementation, and engineering process involving the certification and release of SQL Server and all Azure Data Services, including SQL DB, Cosmos DB, Database for MySQL, Database for PostgreSQL, Database for Maria DB, SQL Data Warehouse, Azure Databricks, Azure Data Lake, HDInsight, Azure Stream Analytics, Azure Data Factory, Azure Data Catalog and Microsoft’s Analytics Platform System (APS).

Nobel Laureate and Cofounder of Malala Fund
Malala, an internationally-recognized activist, joins to share her work to enable every girl around the globe to have access to high quality education.