0
votes

Edit: Need to identify which one is better for exporting huge data volume from Teradata - Sqoop, TPT, or fexp

OP: I am already aware that teradata's fast export and TPT cannot be used to export data directly to Hadoop. I can bring data to local environment and move it to hadoop parallely.

I want to know which tool extracts the data from Teradata in the most efficient way.

I have to extract dataset having huge data volume (almost 25 billion records ~ 15 TB in size). Of course the data in Teradata is well partitioned and I am going to split my extraction strategy based on partitions and Unique PI.

I was not able to find enough content which would provide direct comparison between Teradata utilities and Sqoop.

Which tool would make least impact on currently running jobs in Teradata environment and extract the data in most optimized way.

1
TPT can be used to export directly to Hadoop, read the links I posteddnoeth

1 Answers

1
votes

Of course Teradata's FastExport can't be used to export to Hadoop directly, it's an old legacy tool which is not enhanced anymore.

Any new development should be done using TPT, e.g.

Using the DataConnector Operator to Write Files and Tables in Hadoop

Common Data Movement Jobs