Session

Introducing Indexes for the Lakehouse

Overview

ExperienceIn Person
TrackGovernance & Security
IndustryEnterprise Technology
TechnologiesDatabricks SQL, Unity Catalog
Skill LevelIntermediate

Substring searches that take hours. Point lookups that scan terabytes. Pattern-matching queries that force you to maintain a separate tools alongside your lakehouse. These are problems that traditional databases solved decades ago — and that the lakehouse is now solving too.

In this session, we'll introduce two new indexing capabilities coming to Databricks:

  • Full-Text Search indexes for fast substring, regex, and IP-address matching with up to 20x query speedup; 
  • Standard (Secondary) indexes for needle-in-a-haystack point lookups with 30-60x speedup on non-clustered columns; 

We'll cover how each index type works under the hood, when to use which, and how they complement Liquid Clustering and data skipping. Then we'll show a live demo — creating indexes on real tables, running queries that previously took minutes in under seconds.

If you're running log analytics, SIEM workloads, UUID lookups, or any query pattern where you're scanning far more data than you need to — this session is for you.

Session Speakers

Speaker placeholderIMAGE COMING SOON

Sirui Sun

/Sr. Manager, Product Management
Databricks