This query:
SELECT x
FROM dataset.table_a
WHERE x NOT IN (SELECT x FROM dataset.table_b)
returns zero records even though:
Field
x
intable_a
contains 1,326,932 distinct string valuesField
x
intable_b
contains 18,885 distinct string values
I do not understand why. Moreover, in BigQuery legacy SQL, this query returns the right answer.