Apache Beam Dataflow BigQuery

Question

How can I get the list of tables from a Google BigQuery dataset using apache beam with DataflowRunner?

I can't find how to get tables from a specified dataset. I want to migrate tables from a dataset located in US to one in EU using Dataflow's parallel processing programming model.

using java, apache dataflow with python has some open issues... — user2291521

Ryan Yuan Ryan Yuan · Accepted Answer · 2018-07-18T00:40:26

Declare library

from google.cloud import bigquery

Prepares a bigquery client

client = bigquery.Client(project='your_project_name')

Prepares a reference to the new dataset

dataset_ref = client.dataset('your_data_set_name')

Make API request

tables = list(client.list_tables(dataset_ref))
if tables:
    for table in tables:
        print('\t{}'.format(table.table_id))

Reference: https://googlecloudplatform.github.io/google-cloud-python/latest/bigquery/usage.html#datasets