0
votes

I have established a data factory pipeling to download zip file(open https linked source Url reference: https://clinicaltrials.gov/AllPublicXML.zip). Unzip the same files to a folder that has multiple files of xml format.Copy and Extracting Zipped XML files from HTTP Link Source to Azure Blob Storage using Azure Data Factory Now I want to merge all XML files in the folder to a CSV file. I am using copy activity inside the data factory. While importing schema I am getting the following error message

Input XML file 'input/ALLPublicXML.zip' is invalid with parsing error 'Data at the root level is invalid. Line 1, position 1.'. Data at the root level is invalid. Line 1, position 1. . Activity ID: 358b3545-f49c-4bba-8adc-13fb3de9bb20 enter image description here

Not exactly sure what is going wrong, but it would be really helpful if someone can guide me with the procedure.

1

1 Answers

0
votes

This seems to be an encoding issue, can you check the file once for encoding, it should follow the below encoding formats:

enter image description here

There might be some extra characters in file which are not allowing to read, more information can be found here