0
votes

I'm approaching to Azure environment and watching tutorials/reading documents, but I'm trying to figure out how to setup a flow that enables the process that I will describe hereunder. The starting point are reports in .xlsx format produced monthly by Mktg Dept: the requirements are to bring them in Azure SQL DB so that data can be stored and analysed. Sofar I managed to put those files (previously manually converted in .csv format) in a BLOB storage and build an ADF pipeline that copy each file in a table on the SQL DB. The problem is that as far as I understood with ADF it's not possible to directly manage xlsx files, and I'm wondering how to set up an automated procedure that enables the conversion from .xlsx to .csv and save them on BLOB storage. I was thinking about adding to the pipeline a python script/Databricks notebook to convert format, but I'm not sure this could be the best solution. Any hint/reference to existing tutorial or resources would be very appreciated

1
Databricks can be pricy can you not manipulate the files before they go into blob storage? See this answer stackoverflow.com/questions/52271088/…Umar.H

1 Answers

0
votes

I found a tutorial which uses Logic Apps to do the conversion.

Datanovice indirectly suggested using a Custom activity to run either a C# or Python application to do the conversion for you.

The least expensive solution would be to do the conversion before uploading to blob, like Datanovice said.