Import python module to python script in databricks

Question

I am working on a project in Azure DataFactory, and I have a pipeline that runs a Databricks python script. This particular script, which is located in the Databricks file system and is run by the ADF pipeline, imports a module from another python script located in the same folder (both scripts are located in in dbfs:/FileStore/code).

The code below can import the python module into a Databricks notebook but doesn't work when is imported into a python script.

sys.path.insert(0,'dbfs:/FileStore/code/')
import conn_config as Connect

In the cluster logs, I get: Import Error: No module named conn_config

I guess that the problem is related to the inability of the python file of recognizing the Databricks environment. Any help?

That really took a while 😉 Well, thanks anyway 😊 PS: You should still go on the tour ... — Wolf

Alex Ott Alex Ott · Accepted Answer · 2021-05-28T15:02:40

You can't use path with dbfs: in it - Python doesn't know anything about this file system. You have two choices:

Replace dbfs:/ with /dbfs/ (won't work on Community edition)
Copy file(s) from DBFS to local file system with dbutils.fs.cp("dbfs:/FileStore/code", "file:/tmp/code", True), and refer to that local file name: /tmp/code

Import python module to python script in databricks

3 Answers