What's the difference between NOT EXISTS vs. NOT IN vs. LEFT JOIN WHERE IS NULL?

Question

It seems to me that you can do the same thing in a SQL query using either NOT EXISTS, NOT IN, or LEFT JOIN WHERE IS NULL. For example:

SELECT a FROM table1 WHERE a NOT IN (SELECT a FROM table2)

SELECT a FROM table1 WHERE NOT EXISTS (SELECT * FROM table2 WHERE table1.a = table2.a)

SELECT a FROM table1 LEFT JOIN table2 ON table1.a = table2.a WHERE table1.a IS NULL

I'm not sure if I got all the syntax correct, but these are the general techniques I've seen. Why would I choose to use one over the other? Does performance differ...? Which one of these is the fastest / most efficient? (If it depends on implementation, when would I use each one?)

Many common SQL engines give you the ability to see an execution plan. You can often spot significant differences in efficiency for logically equivalent queries in this way. The success of any method depends on factors such as table size, what indexes are present, and others. — Chris Farmer
@wich: no database cares about what exactly you return inside the EXISTS clause. You may return *, NULL or whatever: all this will be optimized away. — Quassnoi
@wich - why? Both here: techonthenet.com/sql/exists.php and here: msdn.microsoft.com/en-us/library/ms188336.aspx seem to use *... — froadie
@wich: this is not about "expressing interest". This is about the query parser demands you to put something between SELECT and FROM. And * is just easier to type. Yes, SQL does bear some resemblance to a natural language, but it is parsed and executed by a machine, a programmed machine. It's not that it will ever suddenly break into your cubicle and shout "stop demanding for the extra fields in an EXISTS query because I'm f**g sick of parsing them and then throwing them off!". It's OK with a computer, really. — Quassnoi
@Quassnoi if you wrote code for the sole purpose of a machine interpreting it the code would look horrible, and unfortunately quite a few people work like that. If however you write code in another optic, writing code to express what you want the machine to do as a communiqué to your peers you will write better and more maintainable code. Be smart, write code for people, not for the computer. — wich

Quassnoi Quassnoi · Accepted Answer · 2010-02-11T18:42:38

In a nutshell:

NOT IN is a little bit different: it never matches if there is but a single NULL in the list.

In MySQL, NOT EXISTS is a little bit less efficient
In SQL Server, LEFT JOIN / IS NULL is less efficient
In PostgreSQL, NOT IN is less efficient
In Oracle, all three methods are the same.

What's the difference between NOT EXISTS vs. NOT IN vs. LEFT JOIN WHERE IS NULL?

5 Answers