select(), recv() and EWOULDBLOCK on non-blocking sockets

10

votes

I would like to know if the following scenario is real?!

select() (RD) on non-blocking TCP socket says that the socket is ready
following recv() would return EWOULDBLOCK despite the call to select()

c++c sockets

6

votes

For recv() you would get EAGAIN rather than EWOULDBLOCK, and yes it is possible. Since you have just checked with select() then one of two things happened:

Something else (another thread) has drained the input buffer between select() and recv().
A receive timeout was set on the socket and it expired without data being received.

4

votes

It's possible, but only in a situation where you have multiple threads/processes trying to read from the same socket.

4

votes

On Linux it's even documented that this can happen, as I read it.

See this question:

Spurious readiness notification for Select System call

3

votes

I am aware of an error in a popular desktop operating where O_NONBLOCK TCP sockets, particularly those running over the loopback interface, can sometimes return EAGAIN from recv() after select() reports the socket is ready for reading. In my case, this happens after the other side half-closes the sending stream.

For more details, see the source code for t_nx.ml in the NX library of my OCaml Network Application Environment distribution. (link)

1

votes

Though my application is a single-threaded one, I noticed that the described behavior is not uncommon in RHEL5. Both with TCP and UDP sockets that were set to O_NONBLOCK (the only socket option that is set). select() reports that the socket is ready but the following recv() returns EAGAIN.

1

votes

Yes, it's real. Here's one way it can happen:

A future modification to the TCP protocol adds the ability for one side to "revoke" information it sent provided it hasn't been received yet by the other side's application layer. This feature is negotiated on the connection. The other side sends you some data, you get a select hit. Before you can call recv, the other side "revokes" the data using this new extension. Your read gets a "would block" error because no data is available to be read.

The select function is a status-reporting function that does not come with future guarantees. Assuming that a hit on select now assures that a subsequent operation won't block is as invalid as using any other status-reporting function this way. It's as bad as using access to try to ensure a subsequent operation won't fail due to incorrect permissions or using statfs to try to ensure a subsequent write won't fail due to a full disk.

0

votes

It is possible in a multithreaded environment where two threads are reading from the socket. Is this a multithreaded application?

-1

votes

If you do not call any other syscall between select() and recv() on this socket, then recv() will never return EAGAIN or EWOULDBLOCK.

I don't know what they mean with recv-timeout, however, the POSIX standard does not mention it here so you can be safe calling recv().

select(), recv() and EWOULDBLOCK on non-blocking sockets

8 Answers