Counting Values in a repeated field in BigQuery

Question

I want to select rows that have more thank k values in a repeated field. (consider for example selecting user that have more than 3 email addresses)

In Standard SQL I know I can use

SELECT * FROM dataset.users
WHERE array_length(email_address) > 3

But what is the way to do this in BigQuery legacy SQL?

Elliott Brossard Elliott Brossard · Accepted Answer · 2016-09-19T16:19:47

No need for a subquery; you should be able to filter with OMIT RECORD IF directly:

SELECT *
FROM dataset.users
OMIT RECORD IF COUNT(email_address) <= 3;

Do you mind commenting on why you want to use legacy SQL, though? If you encountered a problem with standard SQL I'd like to understand what it was so that we can fix it. Thanks!

Counting Values in a repeated field in BigQuery

2 Answers