HomepageData + AI Summit 2022 Logo
Watch on demand

So Fresh and So Clean: Learn How to Build Real-Time Warehouses on Lakehouse

On Demand

Type

  • Session

Format

  • Hybrid

Track

  • Data Lakes, Data Warehouses and Data Lakehouses

Difficulty

  • Intermediate

Room

  • Moscone South | Level 2 | 202

Duration

  • 80 min
Download session slides

Vue d'ensemble

Warehouses? Where we are going, we won't need warehouses! Join Dillon, Franco, and Shannon as they take an industry-standard Data Warehouse integration benchmark, called TPC-DI, which is a typical 80s style data warehouse, and bring it into the future. We will review how to implement standard data warehousing practices on Lakehouse, and show you how to deliver optimal price/performance in the cloud and keep your data so fresh and so clean. We will take an assortment of structured, semi-structured, and unstructured data in the form of CSV, TXT, XML, and Fixed-Width files, and transform them warehouse-style into Lakehouse with a historical load and incremental CDC loads.

Session Speakers

Franco Patano

Product Specialist DBSQL

Databricks

Dillon Bostwick

Sr. Solutions Architect

Databricks

Shannon Barrow

Sr Solutions Architect

Databricks

Visionnez les temps forts du Data+AI Summit

Watch on demand