S. A. Noghabi, K. Paramasivam, Y. Pan, N. Ramesh, J. Bringhurst et al., Samza : stateful scalable stream processing at linkedin, Proceedings of the VLDB Endowment, vol.10, p.16341645, 2017.

A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, K. Ito et al., Stream : The stanford data stream management system, 2004.

J. Xu, Z. Chen, J. Tang, and S. Su, T-storm : Trac-aware online scheduling in storm, Distributed Computing Systems (ICDCS), 2014.

, IEEE 34th International Conference on, p.535544, 2014.

. .. Overview-of-solutions,

, Design and implementation of autoscale and autoscale+, p.107

. Overview-of-apache and . .. Storm,

.. .. Experimental,

.. .. Results-on-the-microbenchmark,

.. .. Results,

.. .. Experimental,

. , Results on simple insensitive topology

. , Results on simple sensitive topology

. , Results on complex sensitive topology

. .. Discussion,

. Bibliography-[abadi, The design of the borealis stream processing engine, CIDR, p.277289, 2005.

[. Abadi, Aurora: A new model and architecture for data stream management, The VLDB Journal, vol.12, issue.2, p.120139, 2003.

A. , , 2005.

S. Zdonik, Distributed operation in the borealis stream processing engine, Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, SIGMOD '05, p.882884, 2005.

. Akidau, Millwheel: fault-tolerant stream processing at internet scale, Proceedings of the VLDB Endowment, vol.6, issue.11, p.10331044, 2013.

. Akidau, The dataow model: A practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing, Proceedings of the VLDB Endowment, vol.8, p.17921803, 2015.

[. Amini, Spc: A distributed, scalable platform for data mining, Proceedings of the 4th international workshop on Data mining standards, services and platforms, p.2737, 2006.

. Andrzejak, Decision model for cloud computing under sla constraints, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, p.257266, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00788868

[. Aniello, Adaptive online scheduling in storm, The 7th ACM International Conference on Distributed Event-Based Systems, DEBS '13, p.207218, 2013.

. Arasu, Stream: The stanford data stream management system, 2004.

, INSA Lyon, tous droits réservés, p.130, 2018.

[. Bibliography and . Arasu, The cql continuous query language: Semantic foundations and query execution, The VLDB Journal, vol.15, issue.2, p.121142, 2006.

[. Babcock, , 2002.

, Models and issues in data stream systems, Proceedings of the Twenty-rst ACM SIGMODSIGACT-SIGART Symposium on Principles of Database Systems, PODS '02, p.116

[. Babcock, Load shedding for aggregation queries over data streams, Proceedings. 20th International Conference on, p.350361, 2004.

W. Babu, S. Babu, and J. Widom, Continuous queries over data streams, ACM Sigmod Record, vol.30, issue.3, p.109120, 2001.

[. Backman, , 2012.

C. , Continuously executing mapreduce workows on multi-core processors, Proceedings of Third International Workshop on MapReduce and Its Applications Date, MapReduce '12, vol.18

[. Balazinska, Load management and high availability in the medusa distributed stream processing engine, Proc. ACM SIGMOD Int. Conf. on Management of Data, p.929930, 2004.

[. Biem, Ibm infosphere streams for scalable, real-time, intelligent transportation services, Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, p.10931104, 2010.

[. Carbone, Apache ink: Stream and batch processing in a single engine, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, issue.4, p.36, 2015.

[. Cardellini, Elastic stateful stream processing in storm, High Performance Computing & Simulation (HPCS), 2016 International Conference on, p.583590, 2016.

[. Carter, J. L. Carter, and M. N. Wegman, Universal classes of hash functions, Journal of computer and system sciences, vol.18, issue.2, p.143154, 1979.

[. Fernandez, Integrating scale out and fault tolerance in stream processing using operator state management, Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD '13, p.51, 2003.

[. Chambers, Flumejava: easy, ecient data-parallel pipelines, ACM Sigplan Notices, vol.45, p.363375, 2010.

. Chandrasekaran, Telegraphcq: Continuous dataow processing, Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD '03, p.668668, 2003.

S. Chaudhuri, An overview of query optimization in relational systems, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS '98, p.3443, 1998.

[. Chen, Niagaracq: A scalable continuous query system for internet databases, SIGMOD Rec, vol.29, issue.2, p.379390, 2000.

[. Cherniack, An improved data stream summary: the count-min sketch and its applications, CIDR 2003-First Biennial Conference on Innovative Data Systems Research, vol.55, p.5875, 2003.

. Das, Model-based and model-free approaches to autonomic resource allocation, IBM Ressearch Report, p.23802, 2005.

D. Matteis, T. Mencagli-;-de-matteis, and G. Mencagli, Keep calm and react with foresight: Strategies for low-latency and energy-ecient elastic data stream processing, Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '16, vol.13, 2016.

J. Ghemawat-;-dean and S. Ghemawat, Mapreduce: Simplied data processing on large clusters, Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation, vol.6, p.1010, 2004.

A. Deshpande, An initial study of overheads of eddies, SIGMOD Rec, vol.33, issue.1, p.4449, 2004.

. Ding, Optimal operator state migration for elastic data stream processing, 2015.

G. Garcia-molina and H. , Database systems: the complete book, 2008.

B. Gedik, Partitioning functions for stateful data parallelism in stream processing, The VLDB Journal, vol.23, issue.4, p.517539, 2014.

, INSA Lyon, tous droits réservés, p.132, 2018.

. Bibliography-[gedik, Spade: The system s declarative stream processing engine, Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD '08, p.11231134, 2008.

[. Gedik, Elastic scaling for data stream processing, IEEE Trans. Parallel Distrib. Syst, vol.25, issue.6, p.14471463, 2014.

[. Golab, On Indexing Sliding Windows over Online Data Streams, p.712729, 2004.

G. Graefe-;-graefe, Query evaluation techniques for large databases, ACM Comput. Surv, vol.25, issue.2, p.73169, 1993.

[. Gulisano, Streamcloud: An elastic and scalable data streaming system, IEEE Transactions on Parallel and Distributed Systems, vol.23, issue.12, p.23512365, 2012.
URL : https://hal.archives-ouvertes.fr/lirmm-00748992

[. Heinze, Latencyaware elastic scaling for distributed data stream processing systems, Proceedings of the 8th ACM International Conference on Distributed Event-Based Systems, DEBS '14, p.1322, 2014.

[. Heinze, Autoscaling techniques for elastic data stream processing, Proceedings of the 8th ACM International Conference on Distributed Event-Based Systems, DEBS '14, p.318321, 2014.

[. Hirzel, A catalog of stream processing optimizations, ACM Comput. Surv, vol.46, issue.4, pp.46-146, 2014.

[. Hochreiner, Elastic stream processing for distributed environments, IEEE Internet Computing, vol.19, issue.6, p.5459, 2015.

[. Hummer, Elastic stream processing in the cloud, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.3, issue.5, p.333345, 2013.

S. Ishii, A. Ishii, and T. Suzumura, Elastic stream computing with clouds, Cloud Computing (CLOUD), 2011 IEEE International Conference on, pp.195-202, 2011.

[. Jain, Design, implementation, and evaluation of the linear road bnchmark on the stream processing core, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, p.431442, 2006.

[. Jain, Towards a streaming sql standard. Proc. VLDB Endow, vol.1, p.13791390, 2008.

Q. Jiang and S. Chakravarthy, Queueing analysis of relational operators for continuous data streams, Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM '03, p.271278, 2003.

V. V. Kalashnikov, Mathematical methods in queuing theory, vol.271, 2013.

[. Kang, Evaluating window joins over unbounded streams, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405), p.341352, 2003.

R. M. Karp-;-karp, A survey of parallel algorithms for shared-memory machines, 1988.

D. G. Kendall-;-kendall, Stochastic processes occurring in the theory of queues and their analysis by the method of the imbedded markov chain. The Annals of Mathematical Statistics, p.338354, 1953.

[. Lei, C. Rundensteiner-;-lei, and E. A. Rundensteiner, Robust distributed query processing for streaming data, ACM Trans. Database Syst, vol.39, issue.2, 2014.

M. Li, W. Li, and A. Mcleod, Distribution of the residual autocorrelations in multivariate arma time series models, Journal of the Royal Statistical Society. Series B, p.231239, 1981.

. Lin, Pacica: Replication in log-based distributed storage systems, 2008.

. Logothetis, In-situ mapreduce for log processing, Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, USENIXATC'11, p.99, 2011.

[. Lohrmann, Elastic stream processing with latency guarantees, Distributed Computing Systems (ICDCS), p.399410, 2015.

. Lorido-botran, , 2014.

, A review of auto-scaling techniques for elastic applications in cloud environments, J. Grid Comput, vol.12, issue.4, p.559592

[. Madden, Continuously adaptive continuous queries over streams, Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, SIGMOD '02, p.4960, 2002.

[. Maurer, Enacting SLAs in Clouds Using Rules, p.455466, 2011.

D. Mehta, M. Dewitt, and D. J. , Managing intra-operator parallelism in parallel database systems, Proceedings of the 21th International Conference on, 1995.

, INSA Lyon, tous droits réservés 134 Bibliography Very Large Data Bases, VLDB '95, p.382394, 2018.

E. Mishra, P. Mishra, and M. H. Eich, Join processing in relational databases, ACM Comput. Surv, vol.24, issue.1, p.63113, 1992.
DOI : 10.1145/128762.128764

. Murugesan, S. Gangadharan-;-murugesan, and G. Gangadharan, Harnessing green IT: Principles and practices, 2012.

M. A. Nasir, Fault tolerance for stream processing engines, 2016.

[. Neumeyer, S4: Distributed stream computing platform, Data Mining Workshops (ICDMW), 2010 IEEE International Conference on, p.170177, 2010.
DOI : 10.1109/icdmw.2010.172

[. Nikolov, Cloudfarm: An elastic cloud platform with exible and adaptive resource management, Proceedings of the 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, UCC '14, p.547553, 2014.
DOI : 10.1109/ucc.2014.84

[. Noghabi, Samza: stateful scalable stream processing at linkedin. Proceedings of the VLDB Endowment, vol.10, p.16341645, 2017.

[. Pearce, Quantifying the eectiveness of load balance algorithms, Proceedings of the 26th ACM International Conference on Supercomputing, ICS '12, p.185194, 2012.

. Peng, , 2015.

R. , Resource-aware scheduling in storm, Proceedings of the 16th Annual Middleware Conference, p.149161, 2015.

[. Petit, An algebric window model for data stream management, Proceedings of the Ninth ACM International Workshop on Data Engineering for Wireless and Mobile Access, MobiDE '10, p.1724, 2007.
DOI : 10.1145/1850822.1850826

URL : https://hal.archives-ouvertes.fr/hal-00953420

[. Qiao, Supporting sliding window queries for continuous data streams, 15th International Conference on Scientic and Statistical Database Management, p.8594, 2003.

[. Rivetti, Online scheduling for shue grouping in distributed stream processing systems research paper, ACM/IFIP/USENIX Middleware, 2016.

[. Rivetti, , 2015.

, Ecient key grouping for near-optimal load balancing in stream processing systems, Proceedings of the 9th ACM International Conference on Distributed Event-Based Systems, DEBS '15, p.8091

A. Rosenthal, U. S. Chakravarthy, B. Blaustein, and J. Blakely, INSA Lyon, tous droits réservés, 1989.

, Situation monitoring for active databases, Proceedings of the 15th International Conference on Very Large Data Bases, VLDB '89, p.455464

B. Sattler, K. Sattler, and F. Beier, Towards elastic stream processing: Patterns and infrastructure, BD3@VLDB, volume 1018 of CEUR Workshop Proceedings, 2013.

[. Satzger, Esc: Towards an elastic stream computing platform for the cloud, Cloud Computing (CLOUD), 2011 IEEE International Conference on, p.348355, 2011.

[. Schneider, , 2009.

, Elastic scaling of data parallel operators in stream processing, Parallel Distributed Processing, p.112, 2009.

[. Schreier, Alert: An architecture for transforming a passive dbms into an active dbms, Proceedings of the 17th International Conference on Very Large Data Bases, VLDB '91, p.469478, 1991.

[. Sedaghat, A virtual machine re-packing approach to the horizontal vs. vertical elasticity trade-o for cloud autoscaling, Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC '13, vol.6, 2013.

A. Shukla, Y. Simmhan, and R. Stephens, A survey of stream processing, Toward reliable and rapid elasticity for streaming dataows on clouds, vol.34, p.491541, 1997.

[. Stonebraker, The 8 requirements of real-time stream processing, SIGMOD Rec, vol.34, issue.4, p.4247, 2005.

. Tatbul, Staying t: Ecient load shedding techniques for distributed stream processing, Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB '07, p.159170, 2007.

. Tatbul, Load shedding in a data stream manager, Proceedings of the 29th International Conference on Very Large Data Bases, vol.29, p.309320, 2003.

N. Tatbul and S. Zdonik, Window-aware load shedding for aggregation queries over data streams, Proceedings of the 32Nd International Conference on Very Large Data Bases, VLDB '06, p.799810, 2006.

[. Tucker, Exploiting punctuation semantics in continuous data streams. Knowledge and Data Engineering, IEEE Transactions on, vol.15, issue.3, p.555568, 2003.

, INSA Lyon, tous droits réservés, p.136, 2018.

[. Bibliography and . Welsh, Seda: an architecture for wellconditioned, scalable internet services, PASCO'94First International Symposium on Parallel Symbolic Computation, vol.35, p.230243, 1994.

[. Wu, Challenges and experience in prototyping a multi-modal stream analytic and monitoring application on system s, Proceedings of the 33rd international conference on Very large data bases, p.11851196, 2007.

T. ;. Wu, Y. Wu, and K. Tan, Chronostream: Elastic stateful stream computation in the cloud, IEEE 31st International Conference on, p.723734, 2015.

P. Xu, . Xu, and G. Peng, Stela: Enabling stream processing systems to scale-in and scale-out on-demand, Proc. IEEE International Conference on Cloud Engineering (IC2E), 2016.

[. Xu, T-storm: Trac-aware online scheduling in storm, Distributed Computing Systems (ICDCS), p.535544, 2014.

Y. , Sonora: A platform for continuous mobile-cloud computing, 2012.

[. Zaharia, Resilient distributed datasets: A faulttolerant abstraction for in-memory cluster computing, Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, p.22, 2012.

[. Zaharia, Discretized streams: A fault-tolerant model for scalable stream processing, 2012.

Q. Zhu and G. Agrawal, Resource provisioning with budget constraints for adaptive applications in cloud environments, IEEE Transactions on Services Computing, vol.5, issue.4, p.497511, 2012.