I am running Flink on only one node with Parallelism = 1 in order to compare its performance with a single-threaded application. I'm wondering if Flink is still using a Shuffle although it does not run in parallel. So if e.g. the following command is executed:
var counts = text.flatMap { _.toLowerCase.split("\\W+") filter { _.nonEmpty } }
.map { (_, 1) }
.groupBy(0)
.sum(1)
Will a Shuffle be used before the groupBy? And is there a way to check this?
(In the output of the Interactive Scala Shell there is a FlatMap, Map, Combine and finally a Reduce to observe. The same applies when running with Parallelism > 1.)