It's really confusing that every Google document for dataflow is saying that it's based on Apache Beam now and directs me to Beam website. Also, if I looked for github project, I would see the google dataflow project is empty and just all goes to apache beam repo. Say now I need to create a pipeline, from what I read from Apache Beam, I would do : from apache_beam.options.pipeline_options
However, if I go with google-cloud-dataflow, I'll have error: no module named 'options'
, turns out I should use from apache_beam.utils.pipeline_options
. So, looks like google-cloud-dataflow is with an older beam version and is going to be deprecated?
Which one should I pick do develop my dataflow pipeline?