Disclaimer: Not a code query, but directly related to it.
I find it difficult in Databricks to handle such scenarios where there's no shell prompt; just the notebooks. I have two clusters on Azure dev & prod. The database & tables can be accessed via Databricks Notebooks of separate environments.
The problem arises when I want to:
- Query data in dev, but from prod environment & vice-versa. On a sql prompt, it just seems impossible to achieve this.
- If I want to populate dev table from prod table; there's no way to establish a connection from within the dev notebook to query the table of prod environment.
The workaround I've established for now to copy the prod data into dev is:
- Download full dump from production in
csvin my local machine. - Upload to
DBFSin dev environment. - Create temp table/directly insert the
csvin the dev table.
Any comments on how I remove this download-upload process & query prod directly from dev notebook?