Zip two flows in Akka Streams

Question

I have two flows:

val a: Flow[Input, Data, NotUsed] =...
val b: Flow[Input, Unit, NotUsed] =...

The first flow is a stream of data events I care about, the second is a stream of "signals", that is I really only want to send a Data downstream when an element is received in b.

I thought of using something like a.zipWith(b)((fromA, fromB) => fromA) but that seems to work only between a flow and a source (despite the Akka documentation implying that it supports zipping flows too).

What am I missing?

Thanks

Leo C Leo C · Accepted Answer · 2019-07-04T16:31:33

If you look at the signatures of zip and zipWith:

def zip[U](that: Graph[SourceShape[U], _]): Repr[(Out, U)]

def zipWith[Out2, Out3](that: Graph[SourceShape[Out2], _])(combine: (Out, Out2) => Out3): Repr[Out3]

both methods expect a Source.

Zipping a Flow with another Flow would not be as trivial as one might think (e.g. the 2nd Flow might be producing multiple elements per input element with mapConcat).

You could consider building a custom GraphStage as shown in the following trivialized example:

case class DataIn(id: Int)
case class DataOut(content: String)
case class Signal(s: Int)

class ZipperFlow extends GraphStage[FlowShape[(DataIn, Signal), DataOut]] {

  val in = Inlet[(DataIn, Signal)]("ZipperFlow.in")
  val out = Outlet[DataOut]("ZipperFlow.out")

  override val shape = FlowShape.of(in, out)

  override def createLogic(attr: Attributes): GraphStageLogic =
    new GraphStageLogic(shape) {
      setHandler(in, new InHandler {
        override def onPush(): Unit = {
          push(out, DataOut("content-" + grab(in)._1.id))
        }
      })
      setHandler(out, new OutHandler {
        override def onPull(): Unit = {
          pull(in)
        }
      })
    }
}

Testing ZipperFlow:

implicit val system = ActorSystem("system")
implicit val materializer = ActorMaterializer()
implicit val ec = system.dispatcher

val dataSource = Source(1 to 5).map(DataIn(_))
val signalSource = Source(1 to 5).map(Signal(_))

val sink: Sink[DataOut, Future[Done]] = Sink.foreach(println)

dataSource.zip(signalSource).via(new ZipperFlow).runWith(sink)

// DataOut(content-1)
// DataOut(content-2)
// DataOut(content-3)
// DataOut(content-4)
// DataOut(content-5)

Zip two flows in Akka Streams

3 Answers