Azure Data Factory: Schema Change

Question

I have a blob with below format. First row gives header details and next 2 rows as data record and final row as trailer record which contains data records count. While record the file I want to define my schema as single field and once I remove my trailer record I want to convert it into proper schema format with "|" as delimiter. Could you let me know how can I achieve this please.

DeptID|DeptNAme
1|A
2|B
2

Thanks in Advance Kumar

Joseph Xu Joseph Xu · Accepted Answer · 2021-02-01T02:24:30

update:

After SurrogateKey1 activity mentioned previously at Step4, we can use Select activity to select the column DeptID|DeptNAme.
Then we can use DerivedColumn1 activity, expressions split({DeptID|DeptNAme},'|')[1] and split({DeptID|DeptNAme},'|')[2] to generate new columns manually.
The data preview is as follows:

@Kumar G we can use data-flow in ADF to achieve that.
For example, I created a simple test.

I created a bolb in Azure Data Lake Gen2 as follows:
I created a data source of this blob , select Pipe (|) as Column delimiter and First row as header. The schema is as follows:
I created a mapping data flow in ADF and the source data preview is as follows:
In SurrogateKey1, type in Row_No as Key Column, 1 as Start Value. The data preview is as follows:
In Conditional split1, use Row_No < 3 to exclude the last line.
In Select1, not select Row_No column, The data preview is as follows:

That's all!

Azure Data Factory: Schema Change

1 Answers