In our application we are reading data from PubSub, using PubsubIO, in dataflow pipeline. Below is the code.
PCollection<String> pubsubMsg = pipeline
.apply(PubsubIO.readStrings().fromSubscription(options.getInputSubscription1()))
PCollection<String> groupByBigqueryResult = pubsubMsg.apply("Read from bigquery table",
ParDo.of(new ReadRawdataFromBiqueryTable()));
But when we attach the BigQuery read in this pipeline, since BigQuery read is slow as we are doing it in ParDo, seems there is some default flow control settings implemented in PubSub Subscriber, hence i can see the PubSub message flow rate is very slow, but if i comment out the BigQuery read implementation done in ReadRawdataFromBiqueryTable, then it is fast. How to override the flow control settings. Attached both the dataflow jobs 1.with bigquery read 2. commented out bigquery read part