I have a million record DynamoDB table. I am using the Data Pipeline to export the DynamoDb table to S3. But the Data Pipeline exports the table as a group of raw json files in DynamoDB JSON format. After the Data Pipeline ran for an hour, the EMR failed due to timeout exception.
Is there any way to export the DynamoDB table as a CSV and increase the EMR timeout configuration in Data Pipeline?