I am currently using the stopit library https://github.com/glenfant/stopit to set per element processing timeouts in batch jobs. These jobs work on the direct runner and I am able to timeout functions that take too long.
What is the beam way of setting a per element process timeout for a batch job?
Is there a way I could set a processing timeout with a trigger for a dataflow batch job?
My use case is extracting named entities from a text. The NER process sometimes takes too long if the document being processed is too long.
It would be nice to get rid of this dependency and move to a beam native solution.