Skip to Main content Skip to Navigation
Conference papers

AdCom: Adaptive Combiner for Streaming Aggregations

Abstract : Continuous applications such as device monitoring and anomaly detection often require real-time aggregated statistics over unbounded data streams. While existing stream processing systems such as Flink, Spark, and Storm support processing of streaming aggregations, their optimizations are limited with respect to the dynamic nature of the data, and therefore are suboptimal when the workload changes and/or when there is data skew. In this paper we present AdCom, which is an adaptive combiner for stream processing engines. The use of AdCom in aggregation queries enables pre-aggregating tuples upstream (i.e., before data shuffling) followed by global aggregation downstream. In contrast to existing approaches, AdCom can automatically adjust the number of tuples to pre-aggregate depending on the data rate and available network. Our experimental study using real-world streaming workloads shows that using AdCom leads to 2.5-9× higher sustainable throughput without compromising latency.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03156337
Contributor : Guillaume Pierre Connect in order to contact the contributor
Submitted on : Tuesday, March 2, 2021 - 1:57:00 PM
Last modification on : Tuesday, March 9, 2021 - 3:06:50 AM
Long-term archiving on: : Monday, May 31, 2021 - 6:50:56 PM

File

main.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-03156337, version 1

Collections

Citation

Felipe Gutierrez, Kaustubh Beedkar, Abel Souza, Volker Markl. AdCom: Adaptive Combiner for Streaming Aggregations. EDBT 2021 - 24th International Conference on Extending Database Technology, Mar 2021, Nicosia, Cyprus. ⟨hal-03156337⟩

Share

Metrics

Record views

86

Files downloads

69