I am new to Apache Flink and am trying to understand some best practices regarding scaling Flink streaming jobs along side with Kafka. Some questions I am not able to find suitable answers for include:
- How many streaming jobs can/should you be running? Is there a scalability issue with running too many streams? How much is too many?
- If we do run let's say 2,000 streams to meet a business need, what is the best way to manage these streams?
- What is the preferred way to read stream data from one stream to another? Can we join streams, perform continuous queries, etc...?
Thanks in advance for any support and apologize if these questions seems somewhat basic, but I'm trying to get a better handle on this technology. I've read through much of the documentation, but admittedly might not be putting some concepts together due to my lack of experience in this area. thanks for any help!