Consider the Functor
type class in Haskell, where f
is a higher-kinded type variable:
class Functor f where
fmap :: (a -> b) -> f a -> f b
What this type signature says is that fmap changes the type parameter of an f
from a
to b
, but leaves f
as it was. So if you use fmap
over a list you get a list, if you use it over a parser you get a parser, and so on. And these are static, compile-time guarantees.
I don't know F#, but let's consider what happens if we try to express the Functor
abstraction in a language like Java or C#, with inheritance and generics, but no higher-kinded generics. First try:
interface Functor<A> {
Functor<B> map(Function<A, B> f);
}
The problem with this first try is that an implementation of the interface is allowed to return any class that implements Functor
. Somebody could write a FunnyList<A> implements Functor<A>
whose map
method returns a different kind of collection, or even something else that's not a collection at all but is still a Functor
. Also, when you use the map
method you can't invoke any subtype-specific methods on the result unless you downcast it to the type that you're actually expecting. So we have two problems:
- The type system doesn't allow us to express the invariant that the
map
method always returns the same Functor
subclass as the receiver.
- Therefore, there's no statically type-safe manner to invoke a non-
Functor
method on the result of map
.
There are other, more complicated ways you can try, but none of them really works. For example, you could try augment the first try by defining subtypes of Functor
that restrict the result type:
interface Collection<A> extends Functor<A> {
Collection<B> map(Function<A, B> f);
}
interface List<A> extends Collection<A> {
List<B> map(Function<A, B> f);
}
interface Set<A> extends Collection<A> {
Set<B> map(Function<A, B> f);
}
interface Parser<A> extends Functor<A> {
Parser<B> map(Function<A, B> f);
}
// …
This helps to forbid implementers of those narrower interfaces from returning the wrong type of Functor
from the map
method, but since there is no limit to how many Functor
implementations you can have, there is no limit to how many narrower interfaces you'll need.
(EDIT: And note that this only works because Functor<B>
appears as the result type, and so the child interfaces can narrow it. So AFAIK we can't narrow both uses of Monad<B>
in the following interface:
interface Monad<A> {
<B> Monad<B> flatMap(Function<? super A, ? extends Monad<? extends B>> f);
}
In Haskell, with higher-rank type variables, this is (>>=) :: Monad m => m a -> (a -> m b) -> m b
.)
Yet another try is to use recursive generics to try and have the interface restrict the result type of the subtype to the subtype itself. Toy example:
/**
* A semigroup is a type with a binary associative operation. Law:
*
* > x.append(y).append(z) = x.append(y.append(z))
*/
interface Semigroup<T extends Semigroup<T>> {
T append(T arg);
}
class Foo implements Semigroup<Foo> {
// Since this implements Semigroup<Foo>, now this method must accept
// a Foo argument and return a Foo result.
Foo append(Foo arg);
}
class Bar implements Semigroup<Bar> {
// Any of these is a compilation error:
Semigroup<Bar> append(Semigroup<Bar> arg);
Semigroup<Foo> append(Bar arg);
Semigroup append(Bar arg);
Foo append(Bar arg);
}
But this sort of technique (which is rather arcane to your run-of-the-mill OOP developer, heck to your run-of-the-mill functional developer as well) still can't express the desired Functor
constraint either:
interface Functor<FA extends Functor<FA, A>, A> {
<FB extends Functor<FB, B>, B> FB map(Function<A, B> f);
}
The problem here is this doesn't restrict FB
to have the same F
as FA
—so that when you declare a type List<A> implements Functor<List<A>, A>
, the map
method can still return a NotAList<B> implements Functor<NotAList<B>, B>
.
Final try, in Java, using raw types (unparametrized containers):
interface FunctorStrategy<F> {
F map(Function f, F arg);
}
Here F
will get instantiated to unparametrized types like just List
or Map
. This guarantees that a FunctorStrategy<List>
can only return a List
—but you've abandoned the use of type variables to track the element types of the lists.
The heart of the problem here is that languages like Java and C# don't allow type parameters to have parameters. In Java, if T
is a type variable, you can write T
and List<T>
, but not T<String>
. Higher-kinded types remove this restriction, so that you could have something like this (not fully thought out):
interface Functor<F, A> {
<B> F<B> map(Function<A, B> f);
}
class List<A> implements Functor<List, A> {
// Since F := List, F<B> := List<B>
<B> List<B> map(Function<A, B> f) {
// ...
}
}
And addressing this bit in particular:
(I think) I get that instead of myList |> List.map f
or myList |> Seq.map f |> Seq.toList
higher kinded types allow you to simply write myList |> map f
and it'll return a List
. That's great (assuming it's correct), but seems kind of petty? (And couldn't it be done simply by allowing function overloading?) I usually convert to Seq
anyway and then I can convert to whatever I want afterwards.
There are many languages that generalize the idea of the map
function this way, by modeling it as if, at heart, mapping is about sequences. This remark of yours is in that spirit: if you have a type that supports conversion to and from Seq
, you get the map operation "for free" by reusing Seq.map
.
In Haskell, however, the Functor
class is more general than that; it isn't tied to the notion of sequences. You can implement fmap
for types that have no good mapping to sequences, like IO
actions, parser combinators, functions, etc.:
instance Functor IO where
fmap f action =
do x <- action
return (f x)
-- This declaration is just to make things easier to read for non-Haskellers
newtype Function a b = Function (a -> b)
instance Functor (Function a) where
fmap f (Function g) = Function (f . g) -- `.` is function composition
The concept of "mapping" really isn't tied to sequences. It's best to understand the functor laws:
(1) fmap id xs == xs
(2) fmap f (fmap g xs) = fmap (f . g) xs
Very informally:
- The first law says that mapping with an identity/noop function is the same as doing nothing.
- The second law says that any result that you can produce by mapping twice, you can also produce by mapping once.
This is why you want fmap
to preserve the type—because as soon as you get map
operations that produce a different result type, it becomes much, much harder to make guarantees like this.
IMonad<T>
and then cast it back to e.g.IEnumerable<int>
orIObservable<int>
when you're done? Is this all just to avoid casting? – lobsterismreturn
would work since that really belongs to the monad type, not a particular instance so you wouldn't want to put it in theIMonad
interface at all. – Leebind
akaSelectMany
etc too. Which means someone could use the API tobind
anIObservable
to anIEnumerable
and assume it would work, which yeah yuck if that's the case and there's no way around that. Just not 100% sure there's no way around it. – lobsterism