How to implement DevOps on ADF with Databricks Activity

Question

I am trying to implement DevOps on ADF and it was successful with pipelines having activities which fetch data from ADLS location and SQL.

But now I have a pipeline in which one of the activity is to run a jar file from dbfs location as shown below.

This pipeline will run a jar file which is in the dbfs location and proceed.

The connection parameters for the cluster is as shown below.

While deploying the ARM template from dev ADF to UAT instance, which is having UAT instance of databricks, I was not able to override any of the cluster connection details from arm_template_parameter.json file.

How to configure the workspace URL and clusterID for UAT/PROD environment at the time of ARM deployment? There is no entry for any of the cluster details in the arm_template_parameter.json file.
As shown in the first picture, if there is an activity which picks the jar file from DEV instance dbfs loaction, with system generated jar file name, Will it fail when the ARM template for this pipeline is deployed in other environments? If so How to deploy the same jar file with same name in DEV/PROD databricks dbfs location?

Any leads appreciated!

DanielPerlovsky-MSFT DanielPerlovsky-MSFT · Accepted Answer · 2020-11-15T17:27:55

What you have to do here is modify the existing custom parameterization template to fit your needs. This template controls which ARM template parameters are generated when you publish the factory. This can be done in the Parameterization template tab in the management hub.

By default, the workspace name and URL should already be generated in the ARM template. To have your existing cluster id as part of this, you add existingClusterId (the JSON field name in the linked service) to the template under Microsoft.DataFactory/factories/linkedServices.

While I don't like sharing documentation on this forum, we actually have this exact use case demoed at https://docs.microsoft.com/azure/data-factory/continuous-integration-deployment#example-parameterizing-an-existing-azure-databricks-interactive-cluster-id

How to implement DevOps on ADF with Databricks Activity

2 Answers