Yes, in particular GHC performs strictness analysis, which can drastically reduce space usage of a program with unintended laziness from O(n) to O(1).
For example, consider this trivial program:
$ cat LazySum.hs
main = print $ sum [1..100000]
Since sum
does not assume that the addition operator is strict, (it might be used with a Num
instance for which (+)
is lazy), it will cause a large number of thunks to be allocated. Without optimizations enabled, strictness analysis is not performed.
$ ghc --make LazySum.hs -rtsopts -fforce-recomp
[1 of 1] Compiling Main ( LazySum.hs, LazySum.o )
Linking LazySum ...
$ ./LazySum +RTS -s
./LazySum +RTS -s
5000050000
22,047,576 bytes allocated in the heap
18,365,440 bytes copied during GC
6,348,584 bytes maximum residency (4 sample(s))
3,133,528 bytes maximum slop
15 MB total memory in use (0 MB lost due to fragmentation)
Generation 0: 23 collections, 0 parallel, 0.04s, 0.03s elapsed
Generation 1: 4 collections, 0 parallel, 0.01s, 0.02s elapsed
INIT time 0.00s ( 0.00s elapsed)
MUT time 0.01s ( 0.03s elapsed)
GC time 0.05s ( 0.04s elapsed)
EXIT time 0.00s ( 0.00s elapsed)
Total time 0.06s ( 0.07s elapsed)
%GC time 83.3% (58.0% elapsed)
Alloc rate 2,204,757,600 bytes per MUT second
Productivity 16.7% of total user, 13.7% of total elapsed
However, if we compile with optimizations enabled, the strictness analyzer will determine that since we're using the addition operator for Integer
, which is known to be strict, the compiler knows that it is safe to evaluate the thunks ahead of time, and so the program runs in constant space.
$ ghc --make -O2 LazySum.hs -rtsopts -fforce-recomp
[1 of 1] Compiling Main ( LazySum.hs, LazySum.o )
Linking LazySum ...
$ ./LazySum +RTS -s
./LazySum +RTS -s
5000050000
9,702,512 bytes allocated in the heap
8,112 bytes copied during GC
27,792 bytes maximum residency (1 sample(s))
20,320 bytes maximum slop
1 MB total memory in use (0 MB lost due to fragmentation)
Generation 0: 18 collections, 0 parallel, 0.00s, 0.00s elapsed
Generation 1: 1 collections, 0 parallel, 0.00s, 0.00s elapsed
INIT time 0.00s ( 0.00s elapsed)
MUT time 0.01s ( 0.02s elapsed)
GC time 0.00s ( 0.00s elapsed)
EXIT time 0.00s ( 0.00s elapsed)
Total time 0.01s ( 0.02s elapsed)
%GC time 0.0% (2.9% elapsed)
Alloc rate 970,251,200 bytes per MUT second
Productivity 100.0% of total user, 55.0% of total elapsed
Note that we can get constant space even without optimizations, if we add the strictness ourselves:
$ cat StrictSum.hs
import Data.List (foldl')
main = print $ foldl' (+) 0 [1..100000]
$ ghc --make StrictSum.hs -rtsopts -fforce-recomp
[1 of 1] Compiling Main ( StrictSum.hs, StrictSum.o )
Linking StrictSum ...
$ ./StrictSum +RTS -s
./StrictSum +RTS -s
5000050000
9,702,664 bytes allocated in the heap
8,144 bytes copied during GC
27,808 bytes maximum residency (1 sample(s))
20,304 bytes maximum slop
1 MB total memory in use (0 MB lost due to fragmentation)
Generation 0: 18 collections, 0 parallel, 0.00s, 0.00s elapsed
Generation 1: 1 collections, 0 parallel, 0.00s, 0.00s elapsed
INIT time 0.00s ( 0.00s elapsed)
MUT time 0.00s ( 0.01s elapsed)
GC time 0.00s ( 0.00s elapsed)
EXIT time 0.00s ( 0.00s elapsed)
Total time 0.00s ( 0.01s elapsed)
%GC time 0.0% (2.1% elapsed)
Alloc rate 9,702,664,000,000 bytes per MUT second
Productivity 100.0% of total user, 0.0% of total elapsed
Strictness tends to be a bigger issue than tail calls, which aren't really a useful concept in Haskell, because of Haskell's evaluation model.