How to perform asynchronous streaming computations

How to perform asynchronous streaming computations

In certain situations you need an asynchronous operation with back pressure handled. For those it's as easy as adding mapAsync or mapAsyncUnordered depending on whether ordering for the elements is required or not. You will need to first provide a parallelism parameter to specify the maximum number of simultaneous operations. Then you pass the operation logic in via a function which returns a Future.

  implicit val system = ActorSystem()
  implicit val materializer = ActorMaterializer()
  implicit val ec = system.dispatcher

  def writeToKafka(batch: Seq[Int]): Future[Unit] = Future {
    println(s"Writing ${batch.size} elements to Kafka via thread '${Thread.currentThread().getName}'")
  }

  val mapAsyncStage = Source(1 to 1000000)
    .grouped(100)
    .mapAsync(10)(writeToKafka)
    .runWith(Sink.ignore)
    .onComplete(_ => system.terminate())

For mapAsync - See more in the Java or Scala documentation.

For mapAsyncUnordered - See more in the Java or Scala documentation.


    • Related Articles

    • How to implement batching logic in Akka Streams

      A common request we see with streaming data is the need to take the stream of elements and group them together (i.e. committing data to a database, a message queue or disk). Batching is usually a more efficient and performant solution than writing a ...
    • How to do Rate Limiting in Akka Streams

      In certain scenarios it is important to limit the number of concurrent requests to other services. For example, to avoid overwhelming the services and avoid performance degradation, or to maintain service level agreements. This is particularly ...
    • How to do Throttling in Akka Streams

      When building a streaming application you may find the need to throttle the upstream so as to avoid exceeding a specified rate. Akka Stream's provides the capability to either fail the stream or shape it by applying back pressure. This is simply done ...
    • How to stream multiple stream actions concurrently

      To construct efficient, scalable and low-latency data streams, it is often important to perform tasks concurrently. However Akka Streams executes stages sequentially on a single thread, so you must explicitly request this concurrency. Inserting these ...
    • Error handling and recovery in Akka Streams

      When developing applications you should assume that there will be unexpected issues. For this, Akka provides a set of supervision strategies to deal with errors within your actors. Akka streams is no different, in fact its error handling strategies ...