,
, Cascading | Application Platform for Enterprise Big Data
,
,
,
,
, Nagios-the industry standard in it infrastructure monitoring
,
,
, Puma benchmarks and dataset downloads
,
,
,
, , 2016.
, , 2017.
, , 2017.
Conditional heteroscedasticity in time series of stock returns: Evidence and forecasts, Journal of business, pp.55-80, 1989. ,
Millwheel: faulttolerant stream processing at internet scale, Proceedings of the VLDB Endowment, vol.6, pp.1033-1044, 2013. ,
The dataflow model: A practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing, Proceedings of the VLDB Endowment, vol.8, pp.1792-1803, 2015. ,
Osgi-the dynamic module system for java, 2009. ,
Photon: Fault-tolerant and scalable joining of continuous data streams, Proceedings of the 2013 ACM SIGMOD international conference on management of data, pp.577-588, 2013. ,
The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion, 2006. ,
Adaptive online scheduling in storm, Proceedings of the 7th ACM International Conference on Distributed Event-based Systems, DEBS '13, pp.207-218, 2013. ,
A view of cloud computing, Communications of the ACM, vol.53, issue.4, pp.50-58, 2010. ,
Spark sql: Relational data processing in spark, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp.1383-1394, 2015. ,
The best code is no code at all, 2007. ,
Towards automatic optimization of mapreduce programs, Proceedings of the 1st ACM symposium on Cloud computing, pp.137-142, 2010. ,
Dynasore: Efficient in-memory store for social applications, Middleware 2013-ACM/IFIP/USENIX 14th International Middleware Conference, Beijing, pp.425-444, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00932468
Xen and the art of virtualization, ACM SIGOPS operating systems review, vol.37, pp.164-177, 2003. ,
The datacenter as a computer: An introduction to the design of warehouse-scale machines, Synthesis lectures on computer architecture, vol.8, issue.3, pp.1-154, 2013. ,
Learnable programming: Blocks and beyond, Commun. ACM, vol.60, issue.6, pp.72-80, 2017. ,
A survey on retail sales forecasting and prediction in fashion markets, Systems Science & Control Engineering, vol.3, issue.1, pp.154-161, 2015. ,
Qemu, a fast and portable dynamic translator, USENIX Annual Technical Conference, FREENIX Track, pp.41-46, 2005. ,
Machine learning strategies for time series forecasting, Business Intelligence, pp.62-77, 2013. ,
Time series analysis: forecasting and control. Holden-Day series in time series analysis, 1970. ,
Random forests, Machine Learning, vol.45, pp.5-32, 2001. ,
The fractal component model and its support in java. Software: Practice and Experience, vol.36, pp.1257-1284, 2006. ,
Geoscope: Online detection of geo-correlated information trends in social networks, Proc. VLDB Endow, vol.7, pp.229-240, 2013. ,
, ACM Queue, vol.14, pp.70-93, 2016.
Weather forecasting for weather derivatives, Journal of the American Statistical Association, vol.100, pp.6-16, 2005. ,
Online metrics prediction in monitoring systems, 2018 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02006574
Locality-aware routing in stateful streaming applications, Proceedings of the 17th International Middleware Conference, Middleware '16, vol.4, pp.1-4, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01407457
Apache flink: Stream and batch processing in a single engine, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, vol.36, issue.4, 2015. ,
Distributed qosaware scheduling in storm, Proceedings of the 9th ACM International Conference on Distributed Event-Based Systems, DEBS '15, pp.344-347, 2015. ,
Failure prediction of data centers using time series and fault tree analysis, 2012 IEEE 18th International Conference on Parallel and Distributed Systems, pp.794-799, 2012. ,
, The Analysis of Time Series: An Introduction, Sixth Edition. Chapman & Hall/CRC Texts in Statistical Science, 2003.
Temperature prediction using fuzzy time series, IEEE Transactions on Systems, Man, and Cybernetics, vol.30, issue.2, pp.263-275, 2000. ,
Time series forecasting using rnns: an extended attention mechanism to model periods and handle missing values, 2017. ,
Predicting performance and quantifying corporate governance risk for latin american adrs and banks. Financial Engineering and Applications, 2004. ,
Schism: A workloaddriven approach to database replication and partitioning, Proc. VLDB Endow, vol.3, issue.1-2, pp.48-57, 2010. ,
Optimizing shuffle performance in spark, 2013. ,
25 years of time series forecasting, International journal of forecasting, vol.22, issue.3, pp.443-473, 2006. ,
Mapreduce: Simplified data processing on large clusters, OSDI, pp.137-150, 2004. ,
Orange: Data mining toolbox in python, Journal of Machine Learning Research, vol.14, pp.2349-2353, 2013. ,
Workload scheduling in distributed stream processors using graph partitioning, 2015 IEEE International Conference on Big Data, Big Data, pp.124-133, 2015. ,
The berkeley data analytics stack: Present and future, Big Data, 2013 IEEE International Conference on, pp.2-3, 2013. ,
, , 2013.
Financial time series prediction using least squares support vector machines within the evidence framework, IEEE Transactions on Neural Networks, vol.12, issue.4, pp.809-821, 2001. ,
A multivariate timeseries modeling approach to severity of illness assessment and forecasting in icu with sparse, heterogeneous clinical data, Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI'15, pp.446-453, 2015. ,
Noisy time series prediction using recurrent neural networks and grammatical inference, Machine learning, vol.44, issue.1, pp.161-183, 2001. ,
Graphx: Graph processing in a distributed dataflow framework, 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14), pp.599-613, 2014. ,
, Javabeans. API Specification, 1997.
Time series analysis, vol.2, 1994. ,
Snap! (build your own blocks) (abstract only), Proceedings of the 45th ,
DOI : 10.1145/2445196.2445507
, ACM Technical Symposium on Computer Science Education, SIGCSE '14, pp.749-749, 2014.
Profiling, what-if analysis, and cost-based optimization of mapreduce programs, vol.4, pp.1111-1122, 2011. ,
Mesos: A platform for fine-grained resource sharing in the data center, NSDI, vol.11, pp.22-22, 2011. ,
Zookeeper: Wait-free coordination for internet-scale systems, USENIX annual technical conference, vol.8, 2010. ,
Automatic optimization for mapreduce programs, Proceedings of the VLDB Endowment, vol.4, pp.385-396, 2011. ,
Understanding the behavior of in-memory computing workloads, 2014 IEEE International Symposium on Workload Characterization (IISWC), pp.22-30, 2014. ,
, Occupy the cloud: Distributed computing for the 99%, 2017.
,
Jails: Confining the omnipotent root, Proceedings of the 2nd International SANE Conference, vol.43, p.116, 2000. ,
A fast and high quality multilevel scheme for partitioning irregular graphs, SIAM J. Sci. Comput, vol.20, issue.1, pp.359-392, 1998. ,
Cola: Optimizing stream processing applications via graph partitioning, Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware, Middleware '09, vol.16, pp.1-16, 2009. ,
Literate programming, The Computer Journal, vol.27, issue.2, pp.97-111, 1984. ,
Twitter heron: Stream processing at scale, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pp.239-250, 2015. ,
Cassandra: a decentralized structured storage system, ACM SIGOPS Operating Systems Review, vol.44, issue.2, pp.35-40, 2010. ,
Distribution, 1987. ,
Monitoring, prediction and prevention of sla violations in composite services, 2010 IEEE International Conference on Web Services, pp.369-376, 2010. ,
Thermocast: A cyber-physical forecasting model for datacenters, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '11, pp.1370-1378, 2011. ,
Big Data: Principles and Best Practices of Scalable Realtime Data Systems, 2015. ,
Regression and time series model selection, 1998. ,
The nist definition of cloud computing, 2011. ,
Mllib: Machine learning in apache spark, Journal of Machine Learning Research, vol.17, issue.34, pp.1-7, 2016. ,
Docker: lightweight linux containers for consistent development and deployment, Linux Journal, issue.239, 2014. ,
A digital signature based on a conventional encryption function, Conference on the Theory and Application of Cryptographic Techniques, pp.369-378, 1987. ,
Efficient computation of frequent and top-k elements in data streams, Proceedings of the 10th International Conference on Database Theory, ICDT'05, pp.398-412, 2005. ,
Predicting time series with support vector machines, International Conference on Artificial Neural Networks, pp.999-1004, 1997. ,
The power of both choices: Practical load balancing for distributed stream processing engines, 31st IEEE International Conference on Data Engineering, ICDE, pp.137-148, 2015. ,
When two choices are not enough: Balancing at scale in distributed stream processing, 32nd IEEE International Conference on Data Engineering, ICDE, 2015. ,
S4: Distributed stream computing platform, 2010 IEEE International Conference on Data Mining Workshops, pp.170-177, 2010. ,
, Samza: stateful scalable stream processing at linkedin. Proceedings of the VLDB Endowment, vol.10, pp.1634-1645, 2017.
Making sense of performance in data analytics frameworks, 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15), pp.293-307, 2015. ,
Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00650905
R-storm: Resource-aware scheduling in storm, Proceedings of the 16th Annual Middleware Conference, Middleware '15, pp.149-161, 2015. ,
Kubernetes-Scheduling the Future at Cloud Scale, 2015. ,
Scratch: Programming for all, Commun. ACM, vol.52, issue.11, pp.60-67, 2009. ,
Streampipes: solving the challenge with semantic stream processing pipelines, Proceedings of the 9th ACM International Conference on Distributed Event-Based Systems, pp.330-331, 2015. ,
Efficient key grouping for near-optimal load balancing in stream processing systems, Proceedings of the 9th ACM International Conference on Distributed Event-Based Systems, DEBS '15, pp.80-91, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194518
A computer scientist's view of life, the universe, and everything, Foundations of computer science, pp.201-208, 1997. ,
Omega: flexible, scalable schedulers for large compute clusters, SIGOPS European Conference on Computer Systems (EuroSys), pp.351-364, 2013. ,
A cloud service architecture for analyzing big monitoring data, Tsinghua Science and Technology, vol.21, issue.1, pp.55-70, 2016. ,
Keystoneml: Optimizing pipelines for large-scale advanced analytics, 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp.535-546, 2017. ,
E-store: Fine-grained elastic partitioning for distributed transaction processing, Proc. VLDB Endow, vol.8, pp.245-256, 2014. ,
, The Apache Spark developers. ML Pipelines, 2017.
, The Apache Storm developers, 2017.
, Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD '14, pp.147-156, 2014.
The nature of statistical learning theory, 1995. ,
Large-scale cluster management at Google with Borg, Proceedings of the European Conference on Computer Systems (EuroSys), 2015. ,
Corba: integrating diverse applications within distributed heterogeneous environments, IEEE Communications magazine, vol.35, issue.2, pp.46-55, 1997. ,
The UNIX-Haters Handbook. IDG books, 1994. ,
Backpropagation through time: what it does and how to do it, Proceedings of the IEEE, vol.78, issue.10, pp.1550-1560, 1990. ,
, , 2016.
Local higherorder graph clustering, Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '17, pp.555-564, 2017. ,
Spark: Cluster Computing with Working Sets, Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud'10, pp.10-10, 2010. ,
Discretized Streams: Fault-tolerant Streaming Computation at Scale, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.423-438, 2013. ,
The datacenter needs an operating system, Proceedings of the 3rd USENIX Conference on Hot Topics in Cloud Computing, HotCloud'11, pp.17-17, 2011. ,
Big data, for better or worse: 90% of world's data generated over last two years ,