I'm writing a Beam data pipeline reading from an unbounded source like Kafka. I am not performing any analytic functions. I would like to transform the elements and write to the sink let's say after the record count of the PCollection reaches a certain threshold. This is to throttle the data being sent to the sink
Looked at the existing triggers but couldn't figure if they are a good fit
AfterPane.elementCountAtLeast(threshold)
, which appears in the Composite Triggers section of the Beam Programming Guide (beam.apache.org/documentation/programming-guide/…). – Jeff Klukas