N. Foto, J. D. Afrati, and . Ullman, Optimizing joins in a map-reduce environment, Proceedings of the International Conference on Extending Database Technology (EDBT), pp.99-110, 2010.

R. Agrawal, T. Imieli´nskiimieli´nski, and A. Swami, Mining Association Rules between Sets of Items in Large Databases, Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp.207-216, 1993.

R. Agrawal and R. Srikant, Fast algorithms for mining association rules in large databases, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.487-499, 1994.

H. Alemdar, V. Leroy, A. Prost-boucle, and F. Pétrot, Ternary neural networks for resource-ecient ai applications, Proceedings of the International Joint Conference on Neural Networks (IJCNN), pp.1-7, 2017.
DOI : 10.1109/ijcnn.2017.7966166

J. F. Allen, Maintaining knowledge about temporal intervals, Communications of the ACM, vol.26, issue.11, pp.832-843, 1983.
DOI : 10.1145/182.358434

S. Amer-yahia, E. Gaussier, V. Leroy, J. Pilourdault, R. M. Borromeo et al., Task Composition in Crowdsourcing, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp.194-203, 2016.
DOI : 10.1109/DSAA.2016.27

URL : https://hal.archives-ouvertes.fr/hal-01407780

S. Amer-yahia, F. Bonchi, C. Castillo, E. Feuerstein, I. Méndez-díaz et al., Composite Retrieval of Diverse and Complementary Bundles, IEEE Transactions on Knowledge and Data Engineering, vol.26, issue.11, pp.2662-2675, 2014.
DOI : 10.1109/TKDE.2014.2306678

C. Anderson, The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion, 2006.

A. Angel, S. Chaudhuri, G. Das, and N. Koudas, Ranking objects based on relationships and fixed associations, Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology, EDBT '09, pp.910-921, 2009.
DOI : 10.1145/1516360.1516464

URL : http://ranger.uta.edu/~gdas/websitepages/preprints-papers/edbt09.pdf

D. Arthur and S. Vassilvitskii, K-means++: The advantages of careful seeding, Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp.1027-1035, 2007.

R. Baeza-yates, A. Gionis, and F. Junqueira, Vassilis Plachouras, and Luca Telloli. On the feasibility of multi-site web search engines, Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), pp.425-434, 2009.

G. Barbon, V. Leroy, and G. Salaüun, Ternary neural networks for resource-ecient ai applications, Proceedings of the IPM International Conference on Fundamentals of Software Engineering (FSEN), pp.1-15, 2017.

C. James and . Bezdek, A convergence theorem for the fuzzy ISODATA clustering algorithms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.2, issue.1, pp.1-8, 1980.

J. C. Bezdek, R. Ehrlich, and W. Full, FCM: The fuzzy c-means clustering algorithm, Computers & Geosciences, vol.10, issue.2-3, pp.191-203, 1984.
DOI : 10.1016/0098-3004(84)90020-7

R. Blanco, E. Bortnikov, F. Junqueira, R. Lempel, L. Telloli et al., Caching search engine results over incremental indices, Proceedings of the ACM Conference on Research and Development in Information Retrieval (SIGIR), pp.82-89, 2010.
DOI : 10.1145/1772690.1772806

URL : http://www.dc.fi.udc.es/%7Eroi/publications/sigir2010b.pdf

A. Bonifati, M. Goodfellow, I. Manolescu, and D. Sileo, Algebraic incremental maintenance of xml views, ACM Transactions on Database Systems, vol.3814, issue.3, pp.1-1445, 2013.
DOI : 10.1145/2508020.2508021

URL : https://hal.archives-ouvertes.fr/inria-00624986

H. Bota, K. Zhou, J. M. Jose, and M. Lalmas, Composite retrieval of heterogeneous web search, Proceedings of the 23rd international conference on World wide web, WWW '14, pp.119-130, 2014.
DOI : 10.1145/2566486.2567985

URL : http://www.dcs.gla.ac.uk/~mounia/Papers/fp237-bota.pdf

U. Brefeld, B. B. Cambazoglu, and F. P. Junqueira, Document assignment in multi-site search engines, Proceedings of the fourth ACM international conference on Web search and data mining, WSDM '11, pp.575-584, 2011.
DOI : 10.1145/1935826.1935907

E. A. Brewer, Lessons from giant-scale services, IEEE Internet Computing, vol.5, issue.4, pp.46-55, 2001.
DOI : 10.1109/4236.939450

URL : http://cs.ucsb.edu/~ravenben/classes/papers/giant-ieeeic01.pdf

S. Brin and L. Page, The anatomy of a large-scale hypertextual Web search engine, Computer Networks and ISDN Systems, vol.30, issue.1-7, pp.107-117, 1998.
DOI : 10.1016/S0169-7552(98)00110-X

A. Brodsky, S. M. Henshaw, and J. Whittle, CARD, Proceedings of the 2008 ACM conference on Recommender systems, RecSys '08, pp.171-178, 2008.
DOI : 10.1145/1454008.1454037

J. Calbimonte, J. Mora, and O. Corcho, Query Rewriting in RDF Stream Processing, pp.486-502, 2016.
DOI : 10.1007/978-3-642-41335-3_41

URL : https://infoscience.epfl.ch/record/218546/files/query-rewriting-rdf(3).pdf

B. B. Cambazoglu, E. Varol, E. Kayaaslan, C. Aykanat, and R. Baeza-yates, Query forwarding in geographically distributed search engines, Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, SIGIR '10, pp.90-97, 2010.
DOI : 10.1145/1835449.1835467

URL : http://www.cs.bilkent.edu.tr/~aykanat/SIGIR-2010.pdf

B. B. Cambazoglu, H. Zaragoza, O. Chapelle, J. Chen, C. Liao et al., Early exit optimizations for additive machine learned ranking systems, Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pp.411-420, 2010.
DOI : 10.1145/1718487.1718538

URL : http://olivier.chapelle.cc/pub/wsdm2010.pdf

P. Carbone, A. Katsifodimos, S. Ewen, V. Markl, S. Haridi et al., Apache flink TM : Stream and batch processing in a single engine, IEEE Technical Committee on Data Engineering, vol.38, issue.4, pp.28-38, 2015.

B. Chawda, H. Gupta, S. Negi, T. A. Faruquie, L. Venkata et al., Processing interval joins on map-reduce, Proceedings of the International Conference on Extending Database Technology (EDBT), pp.463-474, 2014.

Y. Chen, S. Goldberg, D. Z. Wang, and S. , Ontological Pathfinding, Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, pp.835-846, 2016.
DOI : 10.14778/1687627.1687727

E. Chlamtác, M. Dinitz, C. Konrad, G. Kortsarz, and G. Rabanca, The densest k-subhypergraph problem, Proceedings of APPROX-RANDOM, pp.1-6, 2016.

M. Munmun-de-choudhury, S. Feldman, N. Amer-yahia, R. Golbandi, C. Lempel et al., Automatic construction of travel itineraries using social breadcrumbs, Proceedings of the 21st ACM conference on Hypertext and hypermedia, HT '10, pp.35-44, 2010.
DOI : 10.1145/1810617.1810626

E. F. Codd, The Relational Model for Database Management: Version 2, 1990.

C. Curino, E. Jones, Y. Zhang, and S. Madden, Schism, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.48-57, 2010.
DOI : 10.14778/1920841.1920853

W. W. Daniel, Applied Nonparametric Statistics, Houghton Mi?in, 1978.

J. Dean and S. Ghemawat, MapReduce, Proceedings of the USENIX Symposium on Operating Systems Design and Implementation (OSDI), pp.137-150, 2004.
DOI : 10.1145/1327452.1327492

A. Dignös, M. H. Böhlen, and J. Gamper, Overlap interval partition join, Proceedings of the 2014 ACM SIGMOD international conference on Management of data, SIGMOD '14, pp.1459-1470, 2014.
DOI : 10.1145/2588555.2612175

M. Drozdowski, Scheduling for Parallel Processing Computer Communications and Networks, 2009.

D. Dubois, A. Hadjali, and H. Prade, Fuzziness and uncertainty in temporal reasoning, J. UCS, vol.9, issue.9, p.1168, 2003.

J. Enderle, M. Hampel, and T. Seidl, Joining interval data in relational databases, Proceedings of the 2004 ACM SIGMOD international conference on Management of data , SIGMOD '04, pp.683-694, 2004.
DOI : 10.1145/1007568.1007645

R. Fagin, Combining Fuzzy Information from Multiple Systems, Proceedings of the Symposium on Principles of Database Systems (PODS), pp.216-226, 1996.
DOI : 10.1006/jcss.1998.1600

URL : https://doi.org/10.1006/jcss.1998.1600

R. Fagin, A. Lotem, and M. Naor, Optimal aggregation algorithms for middleware, Proceedings of the Symposium on Principles of Database Systems (PODS), pp.102-113, 2001.
DOI : 10.1145/375551.375567

URL : http://arxiv.org/abs/cs/0204046

J. Finger and N. Polyzotis, Robust and ecient algorithms for rank join evaluation, Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp.415-428, 2009.
DOI : 10.1145/1559845.1559890

URL : http://www.cse.ucsc.edu/~alkis/papers/sigmod568-finger.pdf

L. Galárraga, C. Teflioudi, K. Hose, and F. M. Suchanek, Fast rule mining in ontological knowledge bases with AMIE $$+$$ +, The VLDB Journal, vol.5, issue.3, pp.707-730, 2015.
DOI : 10.14778/2735508.2735510

D. Gao, C. S. Jensen, R. T. Snodgrass, and M. D. Soo, Join operations in temporal databases, The VLDB Journal, vol.25, issue.1, pp.2-29, 2005.
DOI : 10.1109/MC.1986.1663327

URL : http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications/TR-71.pdf

W. Gao, H. C. Lee, and Y. Miao, Geographically focused collaborative crawling, Proceedings of the 15th international conference on World Wide Web , WWW '06, pp.287-296, 2006.
DOI : 10.1145/1135777.1135822

F. Gaud, B. Lepers, J. Funston, M. Dashti, A. Fedorova et al., Challenges of memory management on modern NUMA systems, Communications of the ACM, vol.58, issue.12, pp.59-66, 2015.
DOI : 10.1145/2814328

URL : https://hal.archives-ouvertes.fr/hal-01242202

L. Geng and H. J. Hamilton, Interestingness measures for data mining, ACM Computing Surveys, vol.38, issue.3, 2006.
DOI : 10.1145/1132960.1132963

S. Goel, A. Broder, E. Gabrilovich, and B. Pang, Anatomy of the long tail, Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pp.201-210, 2010.
DOI : 10.1145/1718487.1718513

A. Graham, H. Garcia-molina, A. Paepcke, and T. Winograd, Time as essence for photo browsing through personal digital libraries, Proceedings of the second ACM/IEEE-CS joint conference on Digital libraries , JCDL '02, pp.326-335, 2002.
DOI : 10.1145/544220.544301

URL : http://www.cs.ucsd.edu/~rik/courses/cogs121-s03/proj/AO2198074PIX/time-essence.pdf

J. Han, J. Wang, Y. Lu, and P. Tzvetkov, Mining top-k frequent closed patterns without minimum support, Proceedings of the International Conference on Data Mining (ICDM), pp.211-218, 2002.

A. Heise, J. Quiané-ruiz, Z. Abedjan, A. Jentzsch, and F. Naumann, Scalable discovery of unique column combinations, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.301-312, 2013.
DOI : 10.14778/2732240.2732248

URL : http://www.vldb.org/pvldb/vol7/p301-heise.pdf

A. Bernardo, D. M. Huberman, F. Romero, and . Wu, Social networks that matter: Twitter under the microscope, First Monday, vol.14, issue.1, 2009.

O. Iegorov, Data mining approach to temporal debugging of embedded streaming applications, 2015 International Conference on Embedded Software (EMSOFT), 2016.
DOI : 10.1109/EMSOFT.2015.7318272

URL : https://hal.archives-ouvertes.fr/hal-01178782

F. Ihab, W. G. Ilyas, A. K. Aref, and . Elmagarmid, Supporting top-k join queries in relational databases, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.754-765, 2003.

F. Ihab, G. Ilyas, M. A. Beskales, and . Soliman, A survey of top-k query processing techniques in relational database systems, ACM Computing Surveys, vol.4011, issue.4, pp.1-1158, 2008.

A. Ja?e, M. Naaman, T. Tassa, and M. Davis, Generating summaries and visualization for large collections of geo-referenced photographs, Proceedings of the ACM SIGMM International Workshop on Multimedia Information Retrieval (MIR), pp.89-98, 2006.

K. Järvelin and J. Kekäläinen, Cumulated gain-based evaluation of IR techniques, ACM Transactions on Information Systems, vol.20, issue.4, pp.422-446, 2002.
DOI : 10.1145/582415.582418

P. Flavio, V. Junqueira, M. Leroy, and . Morel, Reactive index replication for distributed search engines, Proceedings of the ACM Conference on Research and Development in Information Retrieval (SIGIR), pp.831-840, 2012.

G. Karypis and V. Kumar, A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs, SIAM Journal on Scientific Computing, vol.20, issue.1, pp.359-392, 1998.
DOI : 10.1137/S1064827595287997

URL : http://glaros.dtc.umn.edu/gkhome/fetch/papers/mlSIAMSC99.pdf

M. G. Kendall, A NEW MEASURE OF RANK CORRELATION, Biometrika, vol.30, issue.1-2, pp.81-93, 1938.
DOI : 10.1093/biomet/30.1-2.81

M. Kirchgessner, V. Leroy, S. Amer-yahia, and S. Mishra, Testing Interestingness Measures in Practice: A Large-Scale Analysis of Buying Patterns, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp.547-556, 2016.
DOI : 10.1109/DSAA.2016.53

URL : https://hal.archives-ouvertes.fr/hal-01407787

S. Kulkarni, N. Bhagat, M. Fu, V. Kedigehalli, C. Kellogg et al., Twitter Heron, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pp.239-250, 2015.
DOI : 10.1145/2588555.2595641

V. Lavrenko and W. B. Croft, Relevance Models in Information Retrieval, pp.11-56, 2003.
DOI : 10.1007/978-94-017-0171-6_2

V. Leroy, M. Kirchgessner, A. Termier, and S. Amer-yahia, TopPI: An efficient algorithm for item-centric mining, Information Systems, vol.64, pp.104-118, 2017.
DOI : 10.1016/j.is.2016.09.001

URL : https://hal.archives-ouvertes.fr/hal-01354713

V. Leroy, S. Amer-yahia, E. Gaussier, and H. Mirisaee, Building Representative Composite Items, Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, pp.1421-1430, 2015.
DOI : 10.1145/1178677.1178692

URL : https://hal.archives-ouvertes.fr/hal-01180167

H. Li, Y. Wang, D. Zhang, M. Zhang, and E. Y. Chang, Pfp, Proceedings of the 2008 ACM conference on Recommender systems, RecSys '08, pp.107-114, 2008.
DOI : 10.1145/1454008.1454027

G. Liu, M. Feng, Y. Wang, L. Wong, S. Ng et al., Towards exploratory hypothesis testing and analysis, 2011 IEEE 27th International Conference on Data Engineering, pp.745-756, 2011.
DOI : 10.1109/ICDE.2011.5767907

URL : http://www.comp.nus.edu.sg/~wongls/psZ/v8-guimei-icde2011.pdf

H. Lu, B. C. Ooi, and K. Tan, On spatially partitioned temporal join, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.546-557, 1994.

L. Marujo, R. Ribeiro, A. Gershman, D. Martins-de-matos, J. P. Neto et al., Event-based summarization using a centrality-as-relevance model, Knowledge and Information Systems, vol.14, issue.4, pp.945-968, 2017.
DOI : 10.1145/564376.564398

A. Metwally, D. Agrawal, and . Abbadi, Ecient computation of frequent and top-k elements in data streams, Proceedings of the International Conference on Database Theory (ICDT), pp.398-412, 2005.

S. Minato, T. Uno, K. Tsuda, A. Terada, and J. Sese, A Fast Method of Statistical Assessment for Combinatorial Hypotheses Based on Frequent Itemset Enumeration, Machine Learning and Knowledge Discovery in Databases, pp.422-436, 2014.
DOI : 10.1007/978-3-662-44851-9_27

B. Négrevergne, A. Termier, J. Mhaut, and T. Uno, Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses, 2010 International Conference on High Performance Computing & Simulation, pp.521-528, 2010.
DOI : 10.1109/HPCS.2010.5547082

L. Neumeyer, B. Robbins, A. Nair, and A. Kesari, S4: Distributed Stream Computing Platform, 2010 IEEE International Conference on Data Mining Workshops, pp.170-177, 2010.
DOI : 10.1109/ICDMW.2010.172

URL : http://www.cs.brown.edu/courses/cs227/papers/s4.pdf

N. Ntarmos, I. Patlakas, and P. Triantafillou, Rank join queries in NoSQL databases, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.493-504, 2014.
DOI : 10.14778/2732286.2732287

A. Okcan and M. Riedewald, Processing theta-joins using MapReduce, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.949-960, 2011.
DOI : 10.1145/1989323.1989423

N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal, Discovering Frequent Closed Itemsets for Association Rules, Proceedings of the International Conference on Database Theory (ICDT), pp.398-416, 1999.
DOI : 10.1007/3-540-49257-7_25

URL : https://hal.archives-ouvertes.fr/hal-00467747

G. Piatetsky-shapiro, Knowledge discovery in databases, ACM SIGKDD Explorations Newsletter, vol.1, issue.2, 1991.
DOI : 10.1145/846183.846197

J. Pilourdault, Scalable Algorithms for Monitoring Activity Traces, 2017.

J. Pilourdault, V. Leroy, and S. Amer-yahia, Distributed Evaluation of Top-k Temporal Joins, Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, pp.1027-1039, 2016.
DOI : 10.14778/2350229.2350238

URL : https://hal.archives-ouvertes.fr/hal-01266188

A. Prost-boucle, F. Pétrot, V. Leroy, and H. Alemdar, Ecient and versatile fpga acceleration of support counting for stream mining of sequences and frequent itemsets, ACM Transactions on Reconfigurable Technology and Systems, vol.1021, issue.3, pp.1-2125, 2017.

M. Josep, V. Pujol, G. Erramilli, X. Siganos, N. Yang et al., The little engine(s) that could: Scaling online social networks, IEEE/ACM Transactions on Networking, vol.20, issue.4, pp.1162-1175, 2012.

M. Rousset and F. Ulliana, Extracting bounded-level modules from deductive rdf triplestores, Proceedings of the Conference on Artificial Intelligence (AAAI), pp.268-274, 2015.
URL : https://hal.archives-ouvertes.fr/lirmm-01086951

S. Senjuti-basu-roy, A. Amer-yahia, G. Chawla, C. Das, and . Yu, Constructing and exploring composite items, Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp.843-854, 2010.

L. T. Rodrygo, C. Santos, I. Macdonald, and . Ounis, Search result diversification, Found. Trends Inf. Retr, vol.9, issue.1, pp.1-90, 2015.

K. Schnaitter and N. Polyzotis, Evaluating rank joins with optimal cost, Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '08, pp.43-52, 2008.
DOI : 10.1145/1376916.1376924

URL : http://www.cse.ucsc.edu/~karlsch/pubs/pbrj-TR.pdf

R. R. Sokal and C. D. Michener, A Statistical Method for Evaluating Systematic Relationships. The University of Kansas science bulletin, pp.1409-1438, 1958.

I. Tatarinov, An ecient LFU-like policy for web caches, 1998.

S. Tatikonda, B. B. Cambazoglu, and F. P. Junqueira, Posting list intersection on multicore architectures, Proceedings of the 34th international ACM SIGIR conference on Research and development in Information, SIGIR '11, pp.963-972, 2011.
DOI : 10.1145/2009916.2010045

H. C. Carlos, A. J. Teixeira, M. Fonseca, G. Serafini, M. J. Siganos et al., Arabesque: A system for distributed graph mining, Proceedings of the Symposium on Operating Systems Principles (SOSP), pp.425-440, 2015.

T. Uno, T. Asai, Y. Uchida, and H. Arimura, An ecient algorithm for enumerating closed patterns in transaction databases, Discovery Science, pp.16-31, 2004.
DOI : 10.1007/978-3-540-30214-8_2

URL : http://research.nii.ac.jp/~uno/papers/lcm_ds04.pdf

T. Uno, M. Kiyomi, and H. Arimura, Lcm ver. 2: Ecient mining algorithms for frequent/closed/maximal itemsets, Proceedings of the Workshop on Frequent Itemset Mining Implementations (FIMI), 2004.
DOI : 10.1145/1133905.1133916

S. Wang, D. Maier, and B. C. Ooi, Fast and adaptive indexing of multidimensional observational data, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.1683-1694, 2016.
DOI : 10.14778/3007328.3007334

X. Wang, M. Bendersky, D. Metzler, and M. Najork, Learning to Rank with Selection Bias in Personal Search, Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, SIGIR '16, pp.115-124, 2016.
DOI : 10.1145/1718487.1718528

C. Wilson, B. Boe, A. Sala, K. P. Puttaswamy, and B. Y. Zhao, User interactions in social networks and their implications, Proceedings of the fourth ACM european conference on Computer systems, EuroSys '09, pp.205-218, 2009.
DOI : 10.1145/1519065.1519089

URL : http://cs.ucsb.edu/~ravenben/publications/pdf/interaction-eurosys09.pdf

M. Xie, V. S. Laks, P. T. Lakshmanan, and . Wood, Breaking out of the box of recommendations, Proceedings of the fourth ACM conference on Recommender systems, RecSys '10, 2010.
DOI : 10.1145/1864708.1864739

C. Zhai and J. La?erty, Model-based feedback in the language modeling approach to information retrieval, Proceedings of the tenth international conference on Information and knowledge management , CIKM'01, pp.403-410, 2001.
DOI : 10.1145/502585.502654

J. Zhang and T. Suel, Optimized Inverted List Assignment in Distributed Search Engine Architectures, 2007 IEEE International Parallel and Distributed Processing Symposium, pp.1-10, 2007.
DOI : 10.1109/IPDPS.2007.370231

URL : http://cis.poly.edu/~suel/papers/colloc.pdf

X. Zhang, L. Chen, and M. Wang, Ecient multi-way theta-join processing using mapreduce, Proceedings of the International Conference on Very Large Data Bases (VLDB), pp.1184-1195, 2012.
DOI : 10.14778/2350229.2350238

URL : http://vldb.org/pvldb/vol5/p1184_xiaofeizhang_vldb2012.pdf