For over a year, Databricks has been partnering with Capital One to build a managed Disaster Recovery offering for the lakehouse. During a vendor outage, this collaboration paid off when Capital One was able to quickly fail over interactive analytics in their Databricks platform to a secondary region.
Technology outages are a matter of “when”, not “if”. The Capital One and Databricks partnership shows that with a solid data platform, guided by a robust disaster recovery strategy, even large-scale outages can be weathered with minimal disruption to the business.
Partnering with Databricks on multi-region resilience helps keep critical analytical workloads working through events like a regional disruption. —Shehzad Mevawalla, Executive Vice President of Enterprise Data Technology, Capital One
Traditional backup tools only protect data. Traditional data warehouses may only protect data in their own proprietary format.
In contrast, the modern open data lakehouse is much more. It includes:
It is critical that all of these components be resilient to failure and can seamlessly resume operations in case of regional failures.
When it comes to disaster recovery, this presents a new set of challenges, including the ability to replicate all mission-critical elements of the lakehouse to a secondary cloud region with low latency and across a wide variety of asset types.
In collaboration with Capital One, Databricks has developed a managed Disaster Recovery solution to help tackle these challenges. It includes:
This successful collaboration highlights a few key ingredients for mission-critical workloads:
Capital One continues to push the envelope when it comes to cloud resiliency - expanding their Disaster Recovery coverage and pushing to reduce their Recovery Time Objective even lower.
Databricks is planning further improvements to the Managed Disaster Recovery solution, building on the lessons learned from past outages. Stay tuned for more details.
