As organizations scale real-time operational intelligence, traditional streaming architectures have become costly bottlenecks. Managing message buses like Kafka and handling schema registries and connector frameworks creates a significant “complexity tax” that diverts high-value engineering resources from strategic business initiatives. Meanwhile, duplicate storage inflates cloud bills and multi-hop architectures delay critical insights. Finally, data in transit often sits outside centralized governance frameworks, creating compliance risks and lineage blind spots.
Today, we’re excited to announce the General Availability of Zerobus Ingest, part of Lakeflow Connect. Zerobus Ingest is a fully managed, serverless service that streams data directly into governed Delta tables, removing intermediate layers to deliver a simplified, high-performance architecture.
By enabling data to flow directly from producers to the lakehouse, Zerobus Ingest slashes costs and eliminates tool sprawl. It also delivers high performance at scale, supporting thousands of concurrent connections and achieving over 10GB/second of aggregate throughput to a table in under 5 seconds.
Traditional message buses like Kafka were designed as multi-sink architectures: universal hubs that route data to dozens of independent consumers. However, this flexibility can come at a steep cost when your sole destination is the lakehouse. Zerobus Ingest uses a fundamentally different approach, with a single-sink architecture optimized for a single job: pushing data directly to the lakehouse.
This architectural choice eliminates complexity and drastically reduces cost:
With Zerobus Ingest, there’s a single, managed Databricks endpoint. Create your table in Unity Catalog, start writing data with the API or SDK, and you’re done. That’s it, nothing else to set up. The serverless architecture automatically scales up to support gigabytes-per-second ingestion without any configuration changes.

Zerobus Ingest allows data producers to bypass the message bus and push events directly into managed Delta tables in your Lakehouse.
Zerobus Ingest simplifies the traditional streaming architecture from five managed systems down to two components, eliminating multiple failure points, reducing operational overhead and removing the need for specialized expertise.
By eliminating the intermediate message bus, you remove two major cost centers: the compute and storage for the bus itself, and the dedicated engineering time needed to manage it. Zerobus Ingest offers ingestion at a fraction of the cost per gigabyte compared to running and maintaining a self-managed Kafka cluster.
Zerobus Ingest offers ingestion at a fraction of the cost per gigabyte compared to running and maintaining a self-managed Kafka cluster.
Learn more about how Zerobus works in this deep dive Databricks Community blog or in the documentation.
Developers can integrate via gRPC and REST APIs, or use language-specific SDKs. Zerobus Ingest provides a broad set of push-based interfaces for industry-specific integrations, making it a flexible, single tool that simplifies ingestion.
Learn more about the differences between REST and gRPC.
Also, since every write is governed by Unity Catalog, you get automatic lineage tracking and fine-grained access control from the moment data is created—ensuring your streaming data has unified governance with the rest of your lakehouse.
Toyota sought a unified solution to instantly process telemetry from thousands of factory devices, without the latency and complexity of traditional IoT architectures.
"Zerobus Ingest allows us to detect overheating factory conditions in minutes rather than hours, directly supporting our carbon-neutrality strategy and operational efficiency. But it's not just about tracking temperature telemetry; having Zerobus Ingest as an additional data ingestion option gives us the ability to collect diverse factory data in near real-time and trigger immediate countermeasures that have transformed our operations.” —Kento Izumi, General Manager, Digital Transformation Promotion Division, Toyota Motor Corporation
Instead of stitching together multiple cloud services, Toyota uses Zerobus Ingest, integrated with global IoT connectivity from Soracom, to mitigate the high maintenance costs of real-time operations, transform its manufacturing operations, and support its sustainability goals.
IoT Data Pipeline Architecture: From Edge to Analytics Platform with Zerobus Ingest and Soracom Beam
Izumi also explained that they are able to accelerate their operational efficiency, “When combined with 'vista,' our unified data and AI platform powered by Databricks, we aren't just collecting data faster; we are optimizing our data operations."
An early adopter of Zerobus Ingest, Joby Aviation streams gigabytes of aircraft telemetry every minute directly to the lakehouse, enabling their engineering teams to analyze flight performance in near real-time. Read the Joby Aviation case study.
"Zerobus Ingest reduced our telemetry resolution latency from days to minutes. This allows our engineering teams to analyze flight performance in near real-time and accelerate our mission to transform transportation." —Dominik Müller, Factory Systems Lead, Joby Aviation
Traditional infrastructure slows down real-time operations. By removing the complexity of intermediate message buses, Zerobus Ingest creates a direct, sub-5-second path to value across industries.
Accelerate your digital transformation by pushing data from any source across industries directly to your lakehouse.
Manufacturing: Maximize factory floor efficiency. Use the Zerobus Ingest SDKs to build custom forwarding agents that stream massive sensor volumes to the Lakehouse. This optimizes machine performance by eliminating heavy network infrastructure overhead.
Telecommunications and IoT: Monitor global networks at scale. Deployed at the edge, Zerobus Ingest pipes telemetry from your network to the lakehouse to track your network load in near real-time. Our partnership with Soracom expands integration with secure, reliable global IoT data ingestion through cellular, satellite, and LPWAN networks.
IT and Cybersecurity: Identify threats without the ETL delay. Bypass complex pipelines by streaming logs and behavioral events directly to the Lakehouse. This enables threat detection within seconds, adaptive model retraining, and faster incident response.
Commerce and Clickstream: Personalize experiences in near real-time. Capture high-volume clickstream data from apps and devices with minimal infrastructure overhead. This enables instant data availability to power personalization engines, A/B testing, and conversion optimization.
Zerobus Ingest is now Generally Available on AWS, Microsoft Azure, and Google Cloud platform support coming soon. Pricing is volume-based under the Lakeflow Jobs Serverless SKU.
As part of the GA launch, we are introducing a 6-month promotional pricing period. Learn more at the Lakeflow Connect pricing page.
Ready to eliminate streaming infrastructure complexity? With just a few lines of code, you can begin streaming data directly to your Unity Catalog-governed tables, ensuring your data is ready the moment it arrives to help deliver insights.
Check out the following Zerobus Ingest resources to get started today:
Product
November 21, 2024/3 min read

