A Theoretical and Experimental Comparison of Filter-Based Equijoins in MapReduce, TLDKS, vol.25, pp.33-70, 2016. ,
DOI : 10.1007/978-3-662-49534-6_2
URL : https://hal.archives-ouvertes.fr/hal-01408492
Efficient parallel set-similarity joins using MapReduce, Proceedings of the 2010 international conference on Management of data, SIGMOD '10, pp.495-506, 2010. ,
DOI : 10.1145/1807167.1807222
Detectives, Proceedings of the 16th international conference on World Wide Web , WWW '07, pp.241-250, 2007. ,
DOI : 10.1145/1242572.1242606
Evaluating similarity measures, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , KDD '05, pp.678-684, 2005. ,
DOI : 10.1145/1081870.1081956
Finding near-duplicate web pages, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '06, pp.284-291, 2006. ,
DOI : 10.1145/1148170.1148222
Syntactic clustering of the Web, WWW, pp.1157-1166, 1997. ,
DOI : 10.1016/S0169-7552(97)00031-7
A web-based kernel function for measuring the similarity of short text snippets, Proceedings of the 15th international conference on World Wide Web , WWW '06, pp.377-386, 2006. ,
DOI : 10.1145/1135777.1135834
V-SMART-join, Proceedings of the VLDB Endowment, vol.5, issue.8, p.6077, 1204. ,
DOI : 10.14778/2212351.2212353
Fuzzy Joins Using MapReduce, 2012 IEEE 28th International Conference on Data Engineering, pp.498-509, 2012. ,
DOI : 10.1109/ICDE.2012.66
Toward Intersection Filterbased Optimization for Joins in MapReduce, pp.1-2, 2013. ,
DOI : 10.1145/2501928.2501932
Space/time trade-offs in hash coding with allowable errors, Communications of the ACM, vol.13, issue.7, pp.422-426, 1970. ,
DOI : 10.1145/362686.362692
URL : http://www.ovmj.org/GNUnet/papers/p422-bloom.pdf
MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008. ,
DOI : 10.14293/S2199-1006.1.SOR-UNCAT.AUNHT8.v1.RBZFIB
MapReduce-based similarity join for metric spaces, Proceedings of the 1st International Workshop on Cloud Intelligence, Cloud-I '12, pp.1-3, 2012. ,
DOI : 10.1145/2347673.2347676
Exploiting MapReduce-based similarity joins, Proceedings of the 2012 international conference on Management of Data, SIGMOD '12, pp.693-696, 2012. ,
DOI : 10.1145/2213836.2213935
Processing theta-joins using MapReduce, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.949-960, 2011. ,
DOI : 10.1145/1989323.1989423
Efficient similarity joins for near-duplicate detection, ACM Transactions on Database Systems, vol.36, issue.3, pp.1-1541, 2011. ,
DOI : 10.1145/2000824.2000825
URL : http://www.cse.unsw.edu.au/%7Eweiw/files/TODS-PPJoin-Final.pdf
An Experimental Survey of MapReduce-Based Similarity Joins, Similarity Search and Applications, pp.181-195, 2016. ,
DOI : 10.1145/1367497.1367516
Theory and Network Applications of Dynamic Bloom Filters, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications, pp.1-12, 2006. ,
DOI : 10.1109/INFOCOM.2006.325
URL : http://www.cse.ust.hk/~liu/guodeke/dynamicbloomfilters.pdf
Fuzzy joins in MapReduce, Proceedings of the VLDB Endowment, vol.8, issue.12, pp.1514-1517, 2015. ,
DOI : 10.14778/2824032.2824049
, Apache SparkTM -Lightning-Fast Cluster Computing