I wanted to create one flat hive table from json data which his coming from another hive table(placed in one column-event_data). Below is the json data structure. I have created hive table by using later view but now i wanted to use pyspark with some UDF's to create the hive table.
'{"callId":"0000000","journey":{"channel":{"out":"[email protected]","outbound":"EMAIL"}},"application":{"componentId":"23456","name":"dfgt-ghy-svc","applicationReferenceId":"SRFC98756RD"},"servicingDetail":{"offerAttributes":{"id":"ADLC0110000"},"offerCommunicationAttributes":{"id":"CFRGTV10098","status":"SUCCESS"}},"customerInfo":{"calledInAccount":"ERFCVDG9801"},"correlationId":"9845-sd76-sdfr87","fulfiller":{"id":"DEFC1009","category":"TST","entity":"colleague"},"platform":{"name":"v-generation","id":"37664859"}}'
I need only 4 columns need to extract which are callId, correlationId, servicingDetail -offerAttributes-id,fulfiller-id
Please help me to do the same.