Azure Data lake store- convert JSON to CSV

Question

We are having JSON files of size ~ 100 GB in the Azure Data lake store. We need to convert them to CSV files and save to a different folder in the same azure data lake store.What are the options available?

Amit Kulkarni Amit Kulkarni · Accepted Answer · 2017-03-21T16:16:24

You have a couple of choices for this. This is typically a simple two step process: extract and output.

A. You can either run an ADLA/U-SQL job to do this. Here is an example of JSON extractor in U-SQL https://github.com/Azure/usql/tree/master/Examples/DataFormats/Microsoft.Analytics.Samples.Formats

B. Another choice is to create an HDInsight cluster to transform the data. You can use whatever is your choice of application. Here is an example of somebody doing this in PIG: https://acadgild.com/blog/converting-json-into-csv-using-pig/

Azure Data lake store- convert JSON to CSV

2 Answers