Skip to main content

Introducing Lakebridge: Free, Open Data Migration to Databricks SQL

AI-powered tooling for fast, predictable migrations to modernize your data warehouse

Announcing Lakebridge

Summary

  • Lakebridge is a free tool designed to automate the migration from legacy data warehouses to Databricks.
  • It provides end-to-end support for the migration process, including profiling, assessment, SQL conversion, validation, and reconciliation.
  • Lakebridge can automate up to 80% of migration tasks, accelerating implementation speed by up to 2x.

We’re excited to introduce Lakebridge, a free migration tool that simplifies and accelerates enterprise data warehouse (EDW) migrations to Databricks SQL.

Modernizing from legacy, siloed data warehouses is critical for many organizations to unlock faster insights, reduce costs, and consolidate analytics and AI workloads on Databricks' open, unified platform. However, migrations are often seen as high-risk: legacy SQL code is complex, dependencies are hidden, and documentation is scarce. Validating translated logic, ensuring data quality, and reconciling thousands of SQL scripts and stored procedures can slow down even the most committed teams.

Lakebridge helps overcome these challenges by automating up to 80% of the migration process—including profiling, SQL conversion, validation, and reconciliation—so teams can confidently move faster.

Modernize while you migrate with Lakebridge

Databricks SQL is the fastest-growing data warehouse in the industry, recognized for its price/performance, efficiency, and AI-powered insights. As more organizations choose Databricks as the foundation for their open and flexible data architecture, Lakebridge ensures that migrations are no longer a bottleneck but a strategic advantage.

  • Clear insight into migration scope and complexity
  • Automated, high-fidelity code conversion and validation
  • Easy reconciliation of migrated workloads for data accuracy
  • A proven path to accelerate your journey to Databricks SQL

Lakebridge supports lift-and-shift and hybrid migration approaches. With the hybrid approach, you don’t have to modernize everything on day one—you can build on a stable foundation while aligning with future requirements. 

Fast, predictable migrations to Databricks 

Built by experienced field engineers and practitioners who understand the challenges of large-scale migrations, Lakebridge addresses the complexity of legacy systems with a modern, automation-first approach. Additionally, Lakebridge is extensible, enabling Databricks partners to collaborate, contribute features, and evolve the tool to meet diverse migration needs.

Lakebridge delivers a comprehensive, end-to-end migration experience through three key components:

  • Analyzer: Performs a detailed assessment of your legacy data warehouse environment.
  • Converter: Intelligently converts legacy ETL workflows and SQL scripts--including stored procedures--into performant, compatible Databricks SQL or Spark SQL code.
  • Validator: Ensures data accuracy and correctness with built-in reconciliation tools.

Lakebridge also provides built-in dashboards and reports directly within Databricks to support transparency and control during migrations. These views allow teams to track progress, validate results, and better understand their evolving data landscape, speeding up adoption and delivering value faster.

With these features, Lakebridge automates up to 80% of migration tasks, helping teams accelerate project timelines by up to 2x. Today, Ladebridge supports more than 10 legacy data warehouses, with many more coming soon.

Lakebridge components: Analyzer, Converter, Validator

Better migration planning

Lakebridge reduces uncertainty in your migration project by providing early, detailed insight into your legacy environment through its built-in Analyzer tool. It scans metadata and legacy code to generate multi-tabbed reports that inventory all objects requiring migration, such as tables, views, ETL jobs, and stored procedures, and classify workloads by complexity from low to medium to complex to very complex.

These insights allow teams to scope projects accurately, prioritize by business impact, and confidently plan phased migrations. By identifying potential challenges early, Lakebridge helps ensure smoother execution, better resource planning, and more effective user acceptance testing—ultimately making the transition to Databricks more predictable and manageable.

Lakebridge analyzer sample report
Sample report from Lakebridge Analyzer showing analysis of complexity, jobs, components, and platforms

Comprehensive migration solution

Lakebridge is designed to handle complex enterprise migrations in the real world, offering robust capabilities across tooling, source coverage, and automation.

  • Broad source support: Migrate from more than 10 leading data warehouses and ETL tools, including Teradata, Snowflake, Oracle, SQL Server, Informatica, and more. Additional connectors are available in Private Preview and are continuously added. Contact your account team to learn more.
  • Battle-tested toolkit: Lakebridge integrates proven technology from BladeBridge—a migration engine trusted by system integrators like Accenture, Capgemini, Celebal Technologies, and Tredence. These tools have powered hundreds of successful migrations, including advanced SQL parsing, code conversion, and validation.
  • End-to-end workflow: From workload discovery and assessment to code conversion and data validation, Lakebridge covers every stage of the migration lifecycle. Tools like Code Converter and Data Validator ensure integrity and performance throughout the process.
  • Proven, predictable results: Built on technology adopted by leading enterprises and partners, Lakebridge helps teams stay on schedule and target modernization goals.

lakebridge migration sources

Partners and customers can also build migration tooling on top of Lakebridge. For example, DoorDash built its migration factory with Lakebridge

Data interoperability is essential in a modern lakehouse architecture. It enables seamless data access across different engines and teams without compromising performance, consistency, or developer velocity. SQL translators like Transaxle (built on Lakebridge) are critical in making this interoperability a reality. They bridge dialect gaps, reduce human error, and unlock faster, more collaborative use of shared data assets. 
—Harsha Venkat Annapa Reddy, Data Foundations Engineering Manager, DoorDash

Lakebridge also empowers you to modernize on open standards by migrating your data and logic to open formats, catalogs, and technologies. The solution converts proprietary syntax like BTEQ (Teradata), T-SQL (Microsoft), and PL/SQL (Oracle)  into open, ANSI-compliant SQL that runs seamlessly on Databricks SQL. This approach frees you from vendor lock-in and positions your business to take full advantage of Databricks’ open, interoperable, and future-ready data ecosystem.

Boost business confidence

Migrating data warehouses and ETL workloads to Databricks is a transformational business decision, but the business needs to be confident in the data quality, compatibility, and project risk. Lakebridge addresses these concerns by providing clear visibility for every step of your data warehouse migration. With real-time dashboards, automated validation, and end-to-end traceability, Lakebridge ensures that business leaders can move forward with clarity and assurance throughout the various phases of the migration, turning migration from a source of uncertainty into a strategic advantage.

What’s next

Databricks is committed to enhancing the migration process, making it faster, smarter, and more predictable. In the coming months, a next-generation migration powered by advanced AI will be introduced.

Coming soon, Lakebridge will incorporate:

  • Mosaic AI–powered code conversion, leveraging reinforcement learning to improve translation accuracy.
  • A dedicated Data Migration module to automate and optimize data movement workflows.
  • A graphical user interface (GUI) for intuitive navigation, migration tracking, and validation insights.

These enhancements are part of our broader vision to deliver a GenAI-powered migration solution built for enterprise needs that applies large language models with guardrails in place. Unlike general-purpose AI tools, Lakebridge AI is purpose-built to ensure all generated output is validated, traceable, and grounded in source logic.

Customers can opt into GenAI-powered features at their own pace, ensuring greater confidence, speed, and control throughout the migration journey.

Get started

Lakebridge Foundations is now free for Databricks customers and partners. Whether you’re planning your first migration or scaling to hundreds of workloads, Lakebridge provides the automation, intelligence, and reliability you need to modernize with confidence.

Never miss a Databricks post

Subscribe to the categories you care about and get the latest posts delivered to your inbox