I have json files, volume is approx 500 TB. I have loaded complete set into hive data warehouse.
How would I validate or test the data
that was loaded into hive warehouse. What should be my testing strategy
?
Client want us to validate the json data. Whether the data loaded into hive is correct ot not. Is there any miss? If yes, which field it was?
Please help.