I have two hive tables -
Table1: col1 col2 --------- 1 b 2 c 3 d
Table2: col1 col2 col3 ---------------- 1 b x 1 b y 1 b z 2 c x 3 d x
Here as you can see in Table 2 I have duplicate values for col1 and col2. I want to remove these duplicates when I do select. I dont care about col3.
I am looking for select query which will give me -
col1 col2 col3 ---------------- 1 b y 2 c x 3 d x
I tried using different JOINs but end up getting duplicate values.