Industry's first open protocol for sharing data, models, agents, and skills across any cloud, any vendor, and any format
by Huey Han, Harish Gaur, Akram Chetibi and Mengxi Chen
When Databricks pioneered Delta Sharing in 2021, we set out to solve a problem that every data team knew too well: sharing live data across organizational boundaries was slow, fragile, and full of compromise. You either copied data — creating stale replicas and compliance headaches — or you constrained yourself to only sharing with partners on the same platform as you, thereby significantly restricting innovation.
Delta Sharing changed that. A single open protocol. No data copying. No platform silos. And in the five years since, it has become the most widely adopted open zero-copy data-sharing protocol — with 28,000+ data recipients and 33% of shares flowing across platforms via open connectors. Leading companies such as SAP, Atlassian, Mercedes-Benz, The Trade Desk, LSEG, S&P Global, and many more have adopted Delta Sharing to share and collaborate on data.
But the world has moved on. The rise of agentic AI has fundamentally changed what enterprises need to share. Today, we're taking the next step.
We're excited to announce OpenSharing — the next evolution of Delta Sharing, and the industry's first open protocol built for the agentic era. OpenSharing advances Delta Sharing into an independent open-source project, expanding its scope from data sharing to the full AI stack: models, agents — across any cloud, any vendor, and any format.

Delta Sharing was built for a world of tables and files. But organizations must now exchange semantic context, AI skills, unstructured data, and autonomous agents across cloud, vendor, and company boundaries. Today's sharing protocols remain locked into vendor-specific formats, can't handle AI logic, and depend on brittle networking that takes weeks to configure for each new partner.
The result: collaboration slows, data silos persist, and the value locked inside enterprise data goes unrealized.
OpenSharing solves this. It's a single open protocol that shares data and AI across any format, any cloud, and any organizational boundary — natively supporting Delta Lake, Apache Iceberg, and Parquet so data stays where it lives and flows to whoever needs it.
"Delta Sharing proved the industry would choose open over locked-in. OpenSharing extends that principle to the full AI stack, while expanding the cross-platform ecosystem to Iceberg recipients and on-premises providers. The agentic era deserves an open foundation, and OpenSharing delivers it." — Matei Zaharia, Co-founder and CTO of Databricks.
OpenSharing exists at two layers. The open-source protocol — now hosted by the Linux Foundation — is the published spec that any vendor or community member can implement. Databricks OpenSharing is the enterprise implementation of the open protocol, built on top of other Databricks features such as Unity Catalog for governance and audit logging, Marketplace for discoverability, and more.
We are excited to launch a suite of features for OpenSharing on Databricks.
For the first time, organizations can share governed AI experiences — not just datasets — across organizational boundaries.
Genie Agents are Databricks' AI-powered conversational analytics environments. With OpenSharing, a provider can now share Genie Agents — including their underlying semantic context, business metrics, and reusable AI logic — with any partner or customer, with governance end-to-end via Unity Catalog. Optionally, providers can control how recipients access data — including hiding proprietary Genie instructions, restricting data access to the Genie Agent only, setting daily prompt quotas, and capping row export limits. These controls unlock new monetization opportunities for data providers, such as usage-based pricing instead of a full data license.

Cross-cloud data sharing has always had two distinct problems. OpenSharing on Databricks now solves both.
The first is networking. When provider storage sits behind a private network — which is almost always the case for sensitive data exchanges or regulated industries — onboarding a new recipient can take weeks of manual IP allowlisting, firewall coordination, and back-and-forth with cloud admins. For providers with dozens or hundreds of recipients, this doesn't scale. SecureConnect solves this problem: a Databricks-managed proxy that routes storage access on behalf of all recipients. Configure it once — no per-recipient firewall changes needed, ever. Read the announcement blog.

The second is egress cost. Cross-cloud queries generate egress fees that compound at scale, becoming a significant and unpredictable cost that makes broad multi-cloud sharing economically impractical. Global Distribution solves this with automatic cross-region and cross-cloud replication. Recipients query a local replica — fast, with no egress fees. Providers get a predictable cost structure. Global teams get low-latency access regardless of where the source data lives.
OpenSharing is built on the conviction that data ecosystems thrive when they are truly open — not just in name, but in practice. That means supporting the formats, storage systems, and clients your partners already use.
Storage Ecosystem: Govern everything, wherever it lives
Not all enterprise data can — or should — move to the cloud. Regulatory mandates, data gravity, edge latency, and sheer economics mean that some of the world's most valuable data will stay on-premises. OpenSharing reaches it.
The Databricks Storage Ecosystem brings the Databricks Data Intelligence Platform directly to on-premises, private cloud, and edge environments — powered by OpenSharing. Storage partners implement the OpenSharing server, connecting their data estates to Unity Catalog without moving a single byte. No migration. No duplication. Read the announcement.
Launch partners include MinIO (GA), Everpure (Private Preview), Qumulo (Private Preview Soon), and VAST Data (Private Preview Soon) — with Cohesity, Commvault, NetApp, and Nutanix coming by the end of the year. Collectively, these partners manage hundreds of exabytes of enterprise data.
Iceberg interoperability
Delta Sharing is already supported in a wide range of platforms and connectors, including Databricks, Tableau, Power BI, Apache Spark, and Snowflake. OpenSharing has now added support for Apache Iceberg REST Catalog API — making it possible to share data with any Iceberg-compatible client. Providers can also share tables from external catalogs including AWS Glue, Hive Metastore, and Snowflake Horizon — bringing external data into the governed OpenSharing ecosystem without replication.

Building on the same simplicity that made Delta Sharing successful, OpenSharing extends the protocol to support the full AI asset stack:
For enterprise deployments on Databricks, SecureConnect and Global Distribution layer on top of this flow — handling cross-cloud networking and replication automatically, with no changes to how providers or recipients interact with their shares.
Subscribe to our blog and get the latest posts delivered to your inbox.