I have a dataflow setup with multiple pipelines, fetching data from pub sub topics. Since these pipelines fan out and merge with transformers and DoFunctions chains, there is a need to trace each pubsub message coming ingested throughout the pipeline.
What would be the right way to do this? Some thoughts:
- Side Input
- Each input to a ParDo function to have a context object with tracing ids etc.(A bit unintuitive)
Thanks!