This repository contains a set of stream processing applications taken from the literature written in Apache Spark which have been cleaned up properly. The applications can be run in a homogeneous manner and their execution collects statistics of throughput (bandwidth) and latency in different ways.
The main developer and maintainer of this repository is Bolot Kasybekov. The supervisors are Gabriele Mencagli and Patrizio Dazzi