J. R. Bray and . T. Curtis-j, An ordination of the upland forest communities of southern wisconsin, Ecological Monographs, vol.27, issue.4, pp.325-349, 1957.

. Cardon-r, N. Grabar, C. &. Grouin, and . Hamon-t, Présentation de la campagne d'évaluation DEFT 2020 : similarité textuelle en domaine ouvert et extraction d'information précise dans des cas cliniques, Actes de DEFT 2020, pp.3-14, 2020.

. Claveau-v, Vectorisation, Okapi et calcul de similarité pour le TAL : pour oublier enfin le TF-IDF, TALN -Traitement Automatique des Langues Naturelles, p.p, 2012.

. Gabay-s, M. &. Riguet, and . Barrault-l, A Workflow For On The Fly Normalisation Of 17th c. French, DH2019, 2019.

. Grabar-n, C. Grouin, . &. Hamon-t, and . Claveau-v, Information Retrieval and Information Extraction from Clinical Cases. Presentation of the DEFT 2019 Challenge, DEFT 2019 -Défi fouille de texte, pp.1-10, 2019.

. Huang-a, Similarity measures for text document clustering, Proceedings of the Sixth New Zealand Computer Science Research Student Conference (NZCSRSC2008), pp.49-56, 2008.

Y. Mehdad and . Tetreault-j, Do characters abuse more than words ?, Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp.299-303, 2016.

J. G. Moreno and . Dias-g, Easy Web Search Results Clustering : When Baselines Can Reach State-of-the-Art Algorithms, 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01076535

. Nebhi-k and . Bontcheva-k.-&-gorrell-g, Restoring capitalization in #tweets, Proceedings of the 24th International Conference on World Wide Web, WWW '15 Companion, pp.1111-1115, 2015.

R. S. Zhang-l and . Koren-y, On the difficulty of evaluating baselines : A study on recommender systems, 2019.

. Strubell-e, . &. Ganesh-a, and . Mccallum-a, Energy and policy considerations for deep learning in NLP, 2019.

. Umemura-k and . Church-k, Substring statistics, Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing '09, pp.53-71, 2009.