Skip to main content

Databricks Repos

Ship code faster with repository-level Git operations in Databricks

Illustration node

How does it work?

databricks-repos-01

Standardize development across data projects

Git workflows and operations in Databricks help integrate data projects into larger software operations at your company. Pull changes, commit, compare and more, from the Databricks Repos UI or API.

Automate Git workflows

Automate Git workflows

The Repos REST API enables you to integrate data projects into CI/CD pipelines. You can use popular CI/CD tools to call the Repos API and update a repo to the latest version of a specific Git branch.

Use your existing Git provider

Use your existing Git provider

Native integration with your preferred Git provider. Clone remote repos, manage branches, pull and push changes, and visually compare differences in Databricks.

databricks-repos-04

Repo-level access control

In addition to platform-level encryption, Repos features allow lists to control access to critical Git repositories, and secrets detection to scan for access key leaks.

Resources

eBook

2021-11-EB-Big-Book-of-Machine-Learning-TY-tile-362x190-2

Discover the most exciting new ML use cases with code samples, how-tos and commentary

Documentation

git-integration-resource-tile

Repos for Git integration | Databricks on AWS

Blog

productionize-data-science-repos-resource-tile

Find out how Repos enables a better developer experience in Databricks

Ready to get started?