I have stored CSV files in AWS s3 daily basis. Bellow is my S3 file path structure:
s3://data-dl/abc/d=2019-09-19/2019-09-19-data.csv
In this structure, the date part of the s3 file path will be generated every day.
Now I want to use AWS glue for ETL to ship data from S3 to Redshift. To use this how can I add S3 path in the data catalog? I want to sync recent folders CSV file only.
Also for Job part how can I declare this dynamic path in Glue Pyspark script?