Till now my architecture includes Storm/Spark streaming for data pre-processing but I have read somewhere that integration of Flume with Kafka has many benefits. I am doing real time streaming of 5 sources including sensor, video, structured, telematics and social media data.
Can anybody let me know why one should use kafka and flume together?