xml data coming in text files. ingesting them flume and kafka to hdfs and saving them in .txt file format.
exiting use case : xml files are ingesting through flume →kafka→flumeintercepter (to verify the valid schema or not ) —-> sends to valid or invalid kafka tpoic →hdfs sink (valid and invalid ) saving in .txt files
new one is:
i need to take the valid kafka topic and need to write own flume interceptor to convert the xml data to avro format and send to →hdfs sink (hdfs valid location ) final output need to be in avro file format..
any help would be appreciated
Thanks in advance ...