I've been using BigQuery for about 2 months. During that time I've used streaming insertion to add thousands of entries every minute. I've been able to then query over that data within a few minutes, if not practically instantly.
Starting a few days ago though, one of my tables suddenly starting showing delays in data availability ranging from 20 to 60 minutes. This only occurs with one of my tables. Data inserted into other tables remain available nearly instantly.
Is this kind of data availability delay normal for BigQuery?
The table experiencing this problem is accuAudience.trackPlays
. I will gladly provide project ID and other info to a Google team member.
The results of the streaming inserts into the problematic table are:
{'kind': 'bigquery#tableDataInsertAllResponse'}
Example query from problematic table, accuAudience.trackPlays
(ordered by date desc):
ROW DATE COUNT
1 2015-03-30 12:35:32 UTC 67
2 2015-03-30 12:35:31 UTC 65
3 2015-03-30 12:35:30 UTC 56
4 2015-03-30 12:35:29 UTC 45
5 2015-03-30 12:35:28 UTC 60
Same query made seconds later to different table (accuAudience.trackSkips
). Note the date field is 30 minutes ahead of the earlier query.
ROW DATE COUNT
1 2015-03-30 13:04:03 UTC 1
2 2015-03-30 13:04:02 UTC 1
3 2015-03-30 13:04:01 UTC 3
4 2015-03-30 13:04:00 UTC 3
5 2015-03-30 13:03:59 UTC 6
If there's other information needed, please let me know!
select *
queries helped find the data. But why does it sit in the streaming buffer for hours? A couple of similar issues, for reference: stackoverflow.com/questions/39407558/… stackoverflow.com/questions/22867090/… – Anton Tarasenko