mlops-databricks

Here is a diagram on how to do feature-branching with Azure Databricks + Azure DevOps.

To start, define a master branch folder in Databricks workspace - this is the one that only Azure DevOps pipeline would write to, and different feature branches. Here's a reference on syncing Databricks notebook to Azure DevOps repo: https://docs.microsoft.com/en-us/azure/databricks/notebooks/azure-devops-services-version-control

You should have a notebook synced to a feature branch, so the notebook .py script will be present in the feature branch
Then run the clone_to_master.py script provided here in Azure DevOps Pipeline, to automatically clone the .py script to the master branch notebook.
Once you approve and complete a pull request merging the feature branch into master, the notebook .py script in master will be cloned to the master notebook in Azure Databricks

Please note this is done at this time at a per notebook level. If you have multiple notebooks, you will need some orchestrating logic in the Pipeline to clone multiple notebooks.

For creating a new feature branch, create the feature branch in Azure DevOps, clone the notebook from master folder in Databricks, and sync it to the newly created branch.

Here is a sample pipeline definition in Azure DevOps (using the classic editor), though of course it can look very different depending on the scenario and pipeline experience (e.g. with YAML file definition).

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
image		image
old		old
README.md		README.md
adb-ado.PNG		adb-ado.PNG
clone_to_master.py		clone_to_master.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mlops-databricks

About

Releases

Packages

Languages

hudua/mlops-databricks

Folders and files

Latest commit

History

Repository files navigation

mlops-databricks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages