NAN propagation and IEEE 754 standard

Question

I am designing a new microprocessor instruction set (www.forwardcom.info) and I want to use NAN propagation for tracing errors. However, there are a number of oddities in the IEEE 754 floating point standard that prevent this.

First, the reason why I want to use NAN propagation rather than error trapping is that I have vector registers with variable length. If, for example, I have a float vector with 8 elements and I have 1/0 in the first element and 0/0 in the sixth element, then I get only one trap, but if I run the same program on a computer with half the vector length then I get two traps: one for infinity and one for NAN. I want the result to be independent of the vector length so I need to rely on the propagation of NAN and INF rather than trapping. The NAN and INF values will propagate through the calculations so that they can be checked in the final result. The NAN representation contains some bits called payload that can be used for information about the source of error.

However, there are two problems in the IEEE 754 floating point standard that prevent reliable propagation of NAN values.

The first problem is that the combination of two NANs with different payloads is just one of the two values. For example NAN1 + NAN2 gives NAN1. This violates the fundamental principle that a+b = b+a. The compiler can swap the operands so that you get different results on different compilers or with different optimization options. I prefer to get the bitwise OR combination of the two payloads. This will work if you have one bit for each error condition, but of course not if the payload contains more complex information (such as NAN boxing in languages with dynamic types). The standards committee actually discussed the OR'ing solution (see http://grouper.ieee.org/groups/754/email/msg01094.html). I don't know why they rejected this proposal.

The second problem is that the min and max functions do not propagate the NAN if only one of the inputs is a NAN. In other words, min(1,NAN) = 1. Reliable NAN propagation would of course require that min(1,NAN) = NAN. I have no idea why the standard says this.

In the new microprocessor system, named ForwardCom, I want to avoid these unfortunate quirks and specify that NAN1 + NAN2 = NAN1 | NAN2, and min(1,NAN) = NAN.

And now to my questions: First, do I need an option switch to change between strict IEEE conformance and the reliable NAN propagation? Quoting the standard:

Quiet NaNs should, by means left to the implementer’s discretion, afford retrospective diagnostic information inherited from invalid or unavailable data and results. To facilitate propagation of diagnostic information contained in NaNs, as much of that information as possible should be preserved in NaN results of operations.

Note that the standard says "should" here, where it has "shall" elsewhere. Does that mean that my deviation from the recommendation is permissible?

And the second question: I cannot find any examples where NAN propagation is actually used for tracing errors. Maybe this is because of the weaknesses in the standard. I want to define different payload bits for different error conditions, for example:

0/0, 0*∞, ∞/∞, modulo(1,0), modulo(∞,1), ∞-∞, and other errors involving infinity and division by zero.
sqrt(-1), log(-1), pow(-1,0.1), and other errors deriving from logarithms and powers.
asin(2) and other mathematical functions.
explicit assignment. This can be useful when a variable is initialized to a NAN.

There are plenty of vacant bits for user-defined error codes.

Has this been done before, or do I have to invent everything from scratch? Are there any problems that I have to consider (other than NAN boxing in certain languages)

Any comparson involving NAN returns false. Even x==x returns false when x is NAN. min(x,y) can be implemented as min(x,y) = x < y ? x : y This will return y if any of the inputs is NAN, so min(1,NAN) = NAN, and min(NAN,1) = 1. This is illogical. We would expect min(x,y) and min(y,x) to be the same — A Fog
@NicolBolas: NaN does not exist solely to record an invalid operation. Payloads are used to convey information by some users, and the IEEE 754 did and does consider that in its deliberations. — Eric Postpischil
@NicolBolas: It is not generally a goal of the IEEE 754 committee to make an “intrinsic” out of “spelled out” code. It is certainly a goal to standardize useful operations in beficial ways. However, in this case, the “spelled out” code is not informative as min might be formed from x < y ? x : y or x > y ? y : x. These would produce different results for x = 1, y = NaN, and choosing one would be arbitrary. It would be preferable if the operation were commutative, making it not subject to changes when order of computation changes. — Eric Postpischil
@AFog: The current draft for the next IEEE 754 revision contains both NaN-favoring minimum and maximum and number-favoring minimumNumber and maximumNumber. This means an application would be able to choose what suits it, but your instruction set would have to support both if you intend it to provide conformance. — Eric Postpischil

Simon Byrne Simon Byrne · Accepted Answer · 2018-02-27T19:31:55

Yes, you are allowed to deviate from the "should"s. From the spec (§1.6):

― may indicates a course of action permissible within the limits of the standard with no implied preference (“may” means “is permitted to”)

― shall indicates mandatory requirements strictly to be followed in order to conform to the standard and from which no deviation is permitted (“shall” means “is required to”)

― should indicates that among several possibilities, one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain course of action is deprecated but not prohibited (“should” means “is recommended to”).

Regarding the behaviour of min, the Intel implementation also differs from the IEEE spec. From the Intel instruction set reference for MINSD:

If a value in the second source operand is an SNaN, then SNaN is returned unchanged to the destination (that is, a QNaN version of the SNaN is not returned).

If only one value is a NaN (SNaN or QNaN) for this instruction, the second source operand, either a NaN or a valid floating-point value, is written to the result. If instead of this behavior, it is required that the NaN source operand (from either the first or second source) be returned, the action of MINSD can be emulated using a sequence of instructions, such as, a comparison followed by AND, ANDN and OR.

In other words, it corresponds to x < y ? x : y. (See Argument order to std::min changes compiler output for floating-point for more details: this is C++ std::min, not the C math library fmin that wraps the IEEE-754 NaN-propagating minimum operation.)

I'm not actually sure what particular sequence they have in mind, but there is an alternative approach suggested here https://github.com/JuliaLang/julia/issues/7866#issuecomment-51845730.

NAN propagation and IEEE 754 standard

4 Answers