I have Avro files (compressed using BZIP2) stored in HDFS and S3 and I want to load them into Amazon Redshift. The copy command gives an error:
error: Invalid AVRO file
code: 8001
context: Cannot init avro reader from s3 file File header contains an unknown codec
Does Redshift not support compressed Avro files?
If that's the case, what is the next best option to load this data into Redshift (without converting them back into Avro files without compression).
Can I use sqoop?