For years, a data wall has separated the world of application development from the world of analytics. Developers have been forced to bridge this gap with brittle ETL pipelines just to move data from an operational PostgreSQL instance to a data lake. This fragmentation does more than just slow down delivery; it creates a data tax of duplicated storage and a persistent lag between reality and insight.
Today, we are breaking down that wall with the General Availability (GA) of Azure Databricks Lakebase, a milestone also announced by Microsoft.
Lakebase is managed, serverless Postgres that’s optimized for the Databricks Platform on Azure. It introduces a new category of database architecture that separates compute from storage, allowing you to write operational data directly to lakehouse storage. By collapsing the space between transactional systems and analytics, Azure Databricks Lakebase provides the final piece of the puzzle for a unified data architecture. Lakebase is a first-party service in the Microsoft ecosystem, built to complement your existing Azure investments while dramatically accelerating developer productivity. Through features like instant branching and zero-copy clones, teams can now iterate against production-grade data without the infrastructure delays that traditionally stall innovation.
“Azure Databricks Lakebase gave us one governed foundation for apps, analytics, and AI, so we stopped duplicating data and shipped real‑time features faster.” — Simon Gilles Fassot, Head of Global Data and Analytics, Hafnia
While traditional cloud databases act as isolated islands, Lakebase is natively integrated into the Azure ecosystem. Because Lakebase and the lakehouse share the same storage layer, you no longer need to worry about building and maintaining complex data pipelines or having your data jobs being out of sync. You can also get insights from your operational database system without impacting the performance on your operational workload.
Lakebase delivers an enterprise-ready Postgres experience with the efficiency of a serverless model. The platform automatically scales to handle heavy application traffic and scales to zero when idle, ensuring your compute resources match your actual demand. This usage-based pricing ensures the lowest TCO, as you only pay for the compute you actually use while Azure manages the underlying infrastructure and availability.
Modern development requires agentic speed and safety. Lakebase supports instant clones and data branching, allowing teams to create zero-copy branches of production data in seconds. This lets you work in a safe, isolated environment to test schema migrations or debug queries, without impacting live users. For added resiliency, Lakebase includes instant Point-in-Time Recovery (PITR), allowing you to immediately restore your database to a precise moment to recover from errors or incidents.
Lakebase is built on standard Postgres, ensuring full compatibility with the tools and libraries you already use. It supports dozens of popular extensions, including pgvector for AI-driven search and PostGIS for advanced geospatial analysis. By supporting the standard Postgres ecosystem, Lakebase ensures that developers can leverage the latest open-source innovations while Azure handles the security, identity, and networking requirements.
Security should not be fragmented across different database engines. With Lakebase, your operational data lives under the same Unity Catalog umbrella as your analytical and AI workloads. This provides a single governance model across your entire Azure Databricks data estate, enabling consistent access control, automated lineage, and enterprise-grade auditing.
"Azure Databricks Lakebase gives enterprise teams a clear path from Lakehouse to relational, governed data without a costly migration. As AI agents start operating directly on investment data, that foundation matters. We've already seen what it does to the speed and quality of traditional analysis at Quantum." — Ian Brown, Head of Digital Engineering, Quantum Capital Group
By unifying the database and the lakehouse, Lakebase unlocks new scenarios for developers building the next generation of intelligent software:
Azure Databricks Lakebase lets developers can continue using familiar tools and libraries like pgAdmin, DBeaver, and the PostgREST API while Azure handles security, identity, networking, and compliance. By integrating with Microsoft Entra ID and Azure networking protections, Lakebase accelerates application delivery while simplifying the underlying DevOps burden.
“The data platform we’ve built with Azure Databricks Lakebase gives us a treasure trove of usable, enriched data that sets us apart from anyone else in the industry. Lakebase is the intelligent foundation powering our ability to solve problems no one else can.” — Grant Veazey, CTO, Ensemble
The General Availability of Azure Databricks Lakebase offers a new foundation for the pace and complexity of modern data systems. It is the simplest path for Azure customers to build intelligent, real-time applications directly on their lakehouse foundation.
Ready to build? Azure Databricks Lakebase is integrated into the Azure Databricks experience and can be provisioned directly within your workspaces. Create your first project today and see how collapsing the wall between apps and analytics can accelerate your innovation.
Product
November 21, 2024/3 min read

