I would like to write an ETL Transform job, to
- query elements from BigQuery
- apply some transformation on them
- serialise into the
TFRecords
format - dump the
.tfrecords
in a Cloud Storage bucket
The solution can possibly use Google Cloud products.
I tried using App Engine cron jobs, along with Google Cloud Python APIs but App Engine cannot be deployed with tensorflow (which is necessary for TFRecords
serialisation).
Any recommendation on how this could be done smoothly?