SESSION

Data Warehouse Performance on the Data Lakehouse

Accept Cookies to Play Video

OVERVIEW

EXPERIENCEIn Person
TYPELightning Talk
TRACKData Lakehouse Architecture
INDUSTRYMedia and Entertainment
TECHNOLOGIESApache Spark, Delta Lake, SQL Analytics / BI / Visualizations
SKILL LEVELIntermediate
DURATION20 min
DOWNLOAD SESSION SLIDES

Data lakehouses promise flexibility, scalability, and cost-effectiveness but often fail to deliver these benefits due to the shortcomings of query engines. This has forced users to copy their data from the lakehouse into proprietary data warehouses to achieve their desired query performance—through a complex, costly ingestion pipeline that undermines data governance and freshness. In this talk, we will dive into the latest developments in data lakehouse querying and how you can ensure your data lakehouse realizes its full potential. This talk will cover:

  • Why you should avoid using proprietary data warehouses purely for accelerating queries
  • The latest technical developments in query engines that will empower data lakehouse performance
  • Coinbase's data architecture with Databricks Lakehouse and StarRocks

SESSION SPEAKERS

Sida Shen

/Product Manager
CelerData

Eric Sun

/Senior Engineering Manager
Coinbase