I have 6,500 csv files with ~250 different schema's. i.e. These files are from the F.D.I.C (USA bank regulator) dataset. They have been uploaded to a google cloud storage bucket:
Each financial quarter has ~250 different csv's. Each csv, within a financial quarter, has a different schema:
There are ~250 unique schemas. The schema's repeat themselves, each financial quarter. The csv files go back 100 financial quarters to 1992:
Multiple CSV's,with the same schema, can be uploaded using a wild card. e.g. gs/path/*.csv
. However each table name is not being auto generated from the file name. The UI requires a table name as an input:
How does one load multiple csv files with different schemas into bigquery?
bq
command line tool to achieve that. - khan