How Photon Unlocks New Performance Gains for Streaming
Overview
| Experience | In Person |
|---|---|
| Track | Data Engineering & Streaming |
| Industry | Enterprise Technology, Communications, Media & Entertainment, Financial Services |
| Technologies | Lakeflow |
| Skill Level | Intermediate |
Databricks’ Photon engine has revolutionized Spark execution, delivering state-of-the-art query performance and cost efficiency. While many Structured Streaming workloads already benefit from this vectorized execution engine, a subset of streaming-specific operators has historically relied on the standard JVM execution model, leaving potential optimization on the table. Until now.
In this session, we reveal our recent engineering initiatives to expand Photon coverage to these critical streaming operators. We will dive into the technical journey of “Photonizing” these components, detailing the complex engineering challenges we overcame to bring vectorized execution to stateful and streaming-specific logic.
Attendees will see side-by-side performance benchmarks demonstrating how these enhancements translate into lower latency, higher throughput, and reduced Total Cost of Ownership (TCO) for streaming pipelines
Session Speakers
Siying Dong
/Senior Staff Software Engineer
Databricks
Arsenii Krasikov
/Staff Software Engineer
Databricks