Google BigQuery: Slow streaming inserts performance

Question

We are using BigQuery as event logging platform.

The problem we faced was very slow insertAll post requests (https://cloud.google.com/bigquery/docs/reference/v2/tabledata/insertAll). It does not matter where they are fired - from server or client side.

Minimum is 900ms, average is 1500s, where nearly 1000ms is connection time. Even if there is 1 request per second (so no throttling here).

We use Google Analytics measurement protocol and timings from the same machines are 50-150ms.

The solution described in BigQuery streaming 'insertAll' performance with PHP suugested to use queues, but it seems to be overkill because we send no more than 10 requests per second.

The question is if 1500ms is normal for streaming inserts and if not, how to make them faster.

Addtional information: If we send malformed JSON, response arrives in 50-100ms.

Sjuul Janssen Sjuul Janssen · Accepted Answer · 2014-11-16T10:47:27

To my experience any request to bigquery will take long. We've tried using it as a database for performance data but eventually are moving out due to slow response times. As far as I can see. BQ is built for handling big requests within a 1 - 10 second response time. These are the requests BQ categorizes as interactive. BQ doesn't get faster by doing less. We stream quite some records to BQ but always make sure we batch them up (per table). And run all requests asynchronously (or if you have to in another theat).

PS. I can confirm what Pentium10 sais about faillures in BQ. Make sure you retry the stuff that fails and if it fails again log it to file for retrying it another time.

Google BigQuery: Slow streaming inserts performance

2 Answers