T. Phan, L. Orazio, and P. Rigaux, A Theoretical and Experimental Comparison of Filter-Based Equijoins in MapReduce, TLDKS, vol.25, pp.33-70, 2016.
DOI : 10.1007/978-3-662-49534-6_2

URL : https://hal.archives-ouvertes.fr/hal-01408492

R. Vernica, M. J. Carey, and C. Li, Efficient parallel set-similarity joins using MapReduce, Proceedings of the 2010 international conference on Management of data, SIGMOD '10, pp.495-506, 2010.
DOI : 10.1145/1807167.1807222

A. Metwally, D. Agrawal, and A. Abbadi, Detectives, Proceedings of the 16th international conference on World Wide Web , WWW '07, pp.241-250, 2007.
DOI : 10.1145/1242572.1242606

E. Spertus, M. Sahami, and O. Buyukkokten, Evaluating similarity measures, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , KDD '05, pp.678-684, 2005.
DOI : 10.1145/1081870.1081956

M. Henzinger, Finding near-duplicate web pages, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '06, pp.284-291, 2006.
DOI : 10.1145/1148170.1148222

A. Z. Broder, S. C. Glassman, M. S. Manasse, and G. Zweig, Syntactic clustering of the Web, WWW, pp.1157-1166, 1997.
DOI : 10.1016/S0169-7552(97)00031-7

M. Sahami and T. D. Heilman, A web-based kernel function for measuring the similarity of short text snippets, Proceedings of the 15th international conference on World Wide Web , WWW '06, pp.377-386, 2006.
DOI : 10.1145/1135777.1135834

A. Metwally and C. Faloutsos, V-SMART-join, Proceedings of the VLDB Endowment, vol.5, issue.8, p.6077, 1204.
DOI : 10.14778/2212351.2212353

F. N. Afrati, A. D. Sarma, D. Menestrina, A. Parameswaran, and J. D. Ullman, Fuzzy Joins Using MapReduce, 2012 IEEE 28th International Conference on Data Engineering, pp.498-509, 2012.
DOI : 10.1109/ICDE.2012.66

T. Phan, L. Orazio, and P. Rigaux, Toward Intersection Filterbased Optimization for Joins in MapReduce, pp.1-2, 2013.
DOI : 10.1145/2501928.2501932

B. H. Bloom, Space/time trade-offs in hash coding with allowable errors, Communications of the ACM, vol.13, issue.7, pp.422-426, 1970.
DOI : 10.1145/362686.362692

URL : http://www.ovmj.org/GNUnet/papers/p422-bloom.pdf

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.14293/S2199-1006.1.SOR-UNCAT.AUNHT8.v1.RBZFIB

Y. N. Silva, J. M. Reed, and L. M. Tsosie, MapReduce-based similarity join for metric spaces, Proceedings of the 1st International Workshop on Cloud Intelligence, Cloud-I '12, pp.1-3, 2012.
DOI : 10.1145/2347673.2347676

Y. N. Silva and J. M. Reed, Exploiting MapReduce-based similarity joins, Proceedings of the 2012 international conference on Management of Data, SIGMOD '12, pp.693-696, 2012.
DOI : 10.1145/2213836.2213935

A. Okcan and M. Riedewald, Processing theta-joins using MapReduce, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.949-960, 2011.
DOI : 10.1145/1989323.1989423

C. Xiao, W. Wang, X. Lin, J. X. Yu, and G. Wang, Efficient similarity joins for near-duplicate detection, ACM Transactions on Database Systems, vol.36, issue.3, pp.1-1541, 2011.
DOI : 10.1145/2000824.2000825

URL : http://www.cse.unsw.edu.au/%7Eweiw/files/TODS-PPJoin-Final.pdf

Y. N. Silva, J. Reed, K. Brown, A. Wadsworth, and C. Rong, An Experimental Survey of MapReduce-Based Similarity Joins, Similarity Search and Applications, pp.181-195, 2016.
DOI : 10.1145/1367497.1367516

D. Guo, J. Wu, H. Chen, and X. Luo, Theory and Network Applications of Dynamic Bloom Filters, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications, pp.1-12, 2006.
DOI : 10.1109/INFOCOM.2006.325

URL : http://www.cse.ust.hk/~liu/guodeke/dynamicbloomfilters.pdf

B. Kimmett, V. Srinivasan, and A. Thomo, Fuzzy joins in MapReduce, Proceedings of the VLDB Endowment, vol.8, issue.12, pp.1514-1517, 2015.
DOI : 10.14778/2824032.2824049

, Apache SparkTM -Lightning-Fast Cluster Computing