My recent experience seems to point out that BigQuery with join performs much slower than that with only one table. Normally most queries will finish in no more than a few minutes. But the following join query takes more than 6000 seconds to finish with table1 having ~ 1 billion rows and table2 ~ 1 million rows.
SELECT a.*, coalesce(b.field4, a.field3)
FROM `table1` a left outer join `table2` b
on a.field1 = b.field1 and a.field2 = b.field2
Just want to ask the community if this is expected or need further investigation?
Computepart of the last stage (Stage5)READ $80, $81, $82, $83, $84, ... FROM __SHUFFLE0 WRITE $80, $81, $82, $83, $84, ... TO __output- bignano