ホームData + AI Summit 2022 のロゴ
Watch on demand

Radical Speed on the Lakehouses: Photon under the hood

On Demand

Type

  • Session

フォーマット

  • Hybrid

Track

  • データレイク、データウェアハウス、データレイクハウス

Room

  • Moscone South | Level 3 | 306

Duration

  • 35 min
Download session slides

概要

Many organizations are standardizing on the lakehouse, however, this new architecture poses challenges with an underlying query execution engine for accessing structured and unstructured data. The execution engine needs to provide the performance of a data warehouse and the scalability of data lakes. To ensure optimum performance, the Databricks Lakehouse Platform offers Photon. This next-gen vectorized query execution engine outperforms existing data warehouses in SQL workloads and implements a more general execution framework for efficient processing of data with support of the Apache Spark™ API. With Photon, analytical queries are seeing a 3 to 5x speed increase, with a 40% reduction in compute hours for ETL workloads. In this session, we will dive into Photon, describe its integration with the Databricks Platform and Apache Spark™ runtimes, talk through customer use cases, and show how your SQL and DataFrame workloads can benefit from the performance of Photon.

Session Speakers

Headshot of Sriram Krishnamurthy

Sriram Krishnamurthy

Databricks

Headshot of Justin Breese

Justin Breese

Sr. Product Manager

Databricks

Data+AI サミットの様子をご覧いただけます

Watch on demand