Why do we need "Algebraic Data Types"?

Question

I've read some explanation of Algebraic Data Types:

These articles give very detail description and code samples.

At first I was thinking Algebraic Data Type was just for defining some types easily and we can match them with pattern matching. But after reading these articles, I found "pattern matching" is not even mentioned there, and the content looks interesting but far more complex than I expected.

So I have some questions (which are not answered in these articles):

Why do we need it, say, in Haskell or Scala?
What we can do if we have it, and what we can't do if we don't have it?

You want tuples to return more than one result and you want sum-types to make the choices you have explicit. (of course this is only the first drop in the ocean - you can write books about this stuff). The link you gave is maybe misleading - you might get the idea that you need this stuff only for ivory-tower-arkane-magic stuff but it's really just a way to manage your data in a sane way (especially if you don't have classes and inheritance and mutating references at your disposal) — Random Dev
@Carsten I feel you understand what I'm confused with. Appreciated your comment and hope you can tell something more :) — Freewind
Just to express the warning that the acronym "ADT" is used also to mean "Abstract Data Type", which is an opposed concept to "Algebraic Data Type". The acronym "ADT" should thus be used with caution, i.e., not at all. — pigworker
If you want to know why people need numbers, don't start with a course in number theory. It won't mention things like balancing your checkbook or counting sheep. I exagerrate but only a little. — n. 1.8e9-where's-my-share m.

dk14 dk14 · Accepted Answer · 2015-08-01T08:23:40

We should start with the Haskell wiki article Algebraic Data Types

And here, shortly, just my vision:

we need them to model business logic in old object oriented way (or actually in old categories-based way), and make it more typesafe as compiler can check that you've matched all possible choices. Or, in other words, that your function is total, not partial. Eventually, it gives the compiler an ability to proof the correctness of your code (that's why sealed traits are recommended). So, the more different types you have the better - btw, generic programming helps you here as it produces more types.
standard features: we can represent type as a "set" of objects, we can match object with type (using pattern matching), or even deconstruct (analyze) it with matchers. We can also dynamically add behavior (in compile-time) to such type with type classess. It's possible for regular class as well, but here it gives us an ability to separate algebraic model (types) from behavior (functions)
we can construct types as products/coproducts of different objects/types. You can think of algebraic type system as a set (or more generally - as a cartesian-closed category). type Boolean = True | False means that Boolean is a union (coproduct) of True and False. Cat(height: Height, weight: Weight) is a "tuple" (more generally - product) of Height and Weight. Product (more-less) represents "part of" relationship from OOP, coproduct - "is" (but in the opposite way).

It also gives us a way to dispatch behaviour in runtime in multimethod-style (like it was in CLOS):

  sealed trait Animal
  case class Cat extends Animal
  case class Dog extends Animal

 def move(c: Animal) = c match {
   case Cat(...) => ...
   case Dog(...) =>
   case a: Animal => ...//if you need default implementation
 }

Haskell-like:

 data Animal = Dog | Cat //coproduct

 let move Dog d = ...
 let move Cat c = ...

Instead of:

trait Animal {
  ...
  def move = ...
}

class Cat(val ...) extends Animal {
  override def move = ...
}

class Dog(val ...) extends Animal {
  override def move = ...
}

P.S. Theoretically, if you're modeling the world in algebraic way and your functions are total and pure - you can proof that your application is correct. If it compiles - it works :).

P.S.2. I should also mention that Scala's type hierarchy which has Any in it isn't so good for typesafety (but good for interop with Java and IO) as it breaks the nice structure defined by GADTs. More than that case class might be both GADT (algebraic) and ADT (abstract) as well, which also reduces the guarantees.

Why do we need "Algebraic Data Types"?

2 Answers