Flink consumer lag after union streams updated in different frequency

Question

We are using Flink 1.2.1, and we are consuming from 2 kafka streams by union one stream to another and process the unioned stream. e.g. stream1.union(stream2) However, stream2 has more than 100 times more volume than the stream1, and we are experiencing is there are huge consuming lag(more than 3 days of data) for stream2, but very little lag in stream1. We have already 9 partitions, but 1 as Parallelism, would increase paralelism solve the consuming lag for stream2, or we shouldn't do union in this case at all.

What's the TimeCharacteristic for the execution environment? — kkrugler
We are using the default processingtime as TimeCharacteristic. — user3285517

kkrugler kkrugler · Accepted Answer · 2019-03-15T15:10:31

The .union() shouldn't be contributing to the time lag, AFAIK.

And yes, increasing parallelism should help, if in fact the lag in processing is due to your consuming operators (or sink) being CPU constrained.

If the problem is with something at the sink end which can't be helped by higher parallelism (e.g. you are writing to a DB, and it's at its maximum ingest rate), then increasing the sink parallelism won't help, of course.

Flink consumer lag after union streams updated in different frequency

2 Answers