Session
Petabytes in Plain Sight: Extending Databricks to the On-Prem Data You Already Have
Overview
| Experience | In Person |
|---|
Every enterprise building a lakehouse hits the same wall: the most valuable data lives on-premises. You can mask it. Tokenize it. Replicate a fraction. But data bound by HIPAA, GDPR, or sovereignty mandates cannot leave at all. Either way, Databricks never sees the full picture.Until now.MinIO and Databricks have solved this by embedding open sharing protocols directly into the data foundation. When your data foundation natively speaks open sharing protocols, data movement becomes unnecessary. Databricks queries on-prem tables directly, in place with full governance retained by the data owner.MinIO CTO Ugur Tigli shows how customer demand drove MinIO and Databricks to run analytics on full-fidelity production data on-prem through open protocols like Delta Sharing and the Apache Iceberg™ REST API. He covers what happens to your security posture when pipelines disappear, what it means when you unlock petabytes to become first-class Databricks assets.