Incremental loading of files from On-prem file server to Azure Data Lake

Question

We would like to do incremental loading of files from our on-premises file server to Azure Data Lake using Azure Data Factory v2.

Files are supposed to store on daily basis in the on-prem fileserver and we will have to run the ADFv2 pipeline on regular intervals during the day and only the new un-processed files from the folder should be captured.

ShirleyWang-MSFT ShirleyWang-MSFT · Accepted Answer · 2018-04-13T03:08:28

Our recommendation is to put the set of files for daily ingestion into /YYYY/MM/DD directories. You can refer to this example on how to use system variables (@trigger().scheduledTime) to read files from the corresponding directory:

https://docs.microsoft.com/en-us/azure/data-factory/how-to-read-write-partitioned-data

Incremental loading of files from On-prem file server to Azure Data Lake

2 Answers