What is the category-theoretical basis for the requirement that the Haskell "id" function must return the same value as passed in?

Question

How can the following all be true?

In the Hask category, the Objects are Haskell types and the Morphisms are Haskell functions. Values play no role in Hask.
The identity Morphism is defined as an arrow originating at an Object A and terminating at the same Object A.
The role of the identity Morphism is played by the Haskell id function.
The value returned from the Haskell id function must be identical to the value of the argument passed in.

If the identity morphism is defined in category theory as an arrow from an Object A back to the same Object A, isn't that description satisfied by any and every Haskell function of type f :: A -> A ?

There is another question whose answers might also perhaps cover this topic, but they seem to assume a level of familiarity with category theory that I unfortunately do not possess.

This seems to me a very basic beginner-level question. So can someone supply an answer using only language, symbols and notional constructs that a beginner can understand?

I think I understand what you're asking. The reason we know id must be the identity function follows from the polymorphic type, and <hand waving>has to do with parametricity and free theorems </hand waving> which are things I don't completely understand. — jberryman
Yes, looks like you've read the post I linked above. I do have a more specific question than the one posed there, so I'm hoping the answers here can be more directed. — nclark
Parametricity is not relevant here. There are plenty of categories that have multiple "parametric functions forall a. a -> a" (natural endomorphisms of the identity functor), but the identity on an object A is always determined by the identities id_A . f = f, g . id_A = g. An example of the former would be a version of Hask that incorporates bottom, and notId :: a -> a; notId _ = undefined. — Reid Barton

chi chi · Accepted Answer · 2015-06-29T18:37:55

I'm not sure I really understood the point of your question.

But identity in categories must satisfy

id . f = f
g . id = g

for any f,g of the correct types. So id is not just any random function A -> A, it is the one satisfying the requirements above.

Note that, in Hask, we have that for any value a :: A

id . (const a) = const a

hence

id (const a ()) = const a ()

hence

id a = a

So id is really what we expect.

What is the category-theoretical basis for the requirement that the Haskell "id" function must return the same value as passed in?

3 Answers