Say I have a Flink application to filter, transform, and process stream.
How to break this application into two jobs and communicate b/w them without using the intermittent store.
Refer to the below image for dataflow.
Reason for the use case :
Event size : 2KB, Event lite : 200B, TPS: 1M
For effective usage of Java Heap to store more events at any given time transformation is required. Doing all three on single TaskManager has a disadvantage of storing the ingested events as well, where nearly 80% of events are not required.
Running these jobs on different task managers will give great flexibility in scaling the processing function.
Need help in achieving this, any suggestion is welcome. Also trying to understand how multiple jobs can be submitted via a single Flink Application.