Session

Introducing Indexes for the Lakehouse

Overview

Experience	In Person
Track	Governance & Security
Industry	Enterprise Technology
Technologies	Databricks SQL, Unity Catalog
Skill Level	Intermediate

Substring searches that take hours. Point lookups that scan terabytes. Pattern-matching queries that force you to maintain a separate tools alongside your lakehouse. These are problems that traditional databases solved decades ago — and that the lakehouse is now solving too.

In this session, we'll introduce two new indexing capabilities coming to Databricks:

Full-Text Search indexes for fast substring, regex, and IP-address matching with up to 20x query speedup;
Standard (Secondary) indexes for needle-in-a-haystack point lookups with 30-60x speedup on non-clustered columns;

We'll cover how each index type works under the hood, when to use which, and how they complement Liquid Clustering and data skipping. Then we'll show a live demo — creating indexes on real tables, running queries that previously took minutes in under seconds.

If you're running log analytics, SIEM workloads, UUID lookups, or any query pattern where you're scanning far more data than you need to — this session is for you.

Session Speakers

IMAGE COMING SOON

Ivan Vezilić

/Staff Software Engineer
Databricks