JUNE 26-29, 2023
SAN FRANCISCO + VIRTUAL
지금 등록하기

Building a Lakehouse for Data Science at DoorDash

On Demand

Type

  • Session

Format

  • Hybrid

Track

  • 데이터 레이크, 데이터 웨어하우스 및 데이터 레이크하우스

업종

  • 소매 및 소비재

Difficulty

  • Beginner

Room

  • Moscone South | Level 2 | 202

Duration

  • 35 min
Download session slides

개요

DoorDash was using a data warehouse but found that they needed more data transparency, lower costs, and the ability to handle streaming data as well as batch data. With an engineering team rooted in big data backgrounds at Uber and LinkedIn, they moved to a Lakehouse architecture intuitively, without knowing about the term. In this session, learn more about how they arrived at that architecture, the process of making the move, and the results they have seen. While addressing both data analysts and data scientists from their lakehouse, this session will focus on their machine learning operations, and how their efficiencies are enabling them to tackle more advanced use cases such as NLP and image classification.

Session Speakers

Headshot of Hien Luu

Hien Luu

Sr. Engineering Manager

DoorDash

Headshot of Brian Dirking

Brian Dirking

Sr. Director Partner Marketing

Databricks

Data+AI Summit 하이라이트 보기

Watch on demand