I am planning to prepare a server-less data pipeline with Google Cloud Platform. My plan is to use Dataflow/ Dataproc for batch processing data from three different sources.
My input sources are:
- Cloud SQL (MySQL)
- Cloud SQL (PostgreSQL)
- MongoDB
But after reading their documentation I got they don't have any input for cloud SQL or MongoDB.
Also I have checked their custom driver section but this is only for Java, but I am planning to use Python.
Is there any idea how I can ingest those 3 different sources with Data Flow/ Dataproc ?