Automatic analysis of old documents: taking advantage of an incomplete, heterogeneous and noisy corpus. Recherche d'information, document et web sémantique, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02467535
Using SMT for OCR error correction of historical texts, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pp.962-966, 2016. ,
Bootstrapped OCR error detection for a less-resourced language variant, 13th Conference on Natural Language Processing (KONVENS 2016), pp.21-26, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01371689
Representativeness in corpus design, Literary and Linguistic Computing, vol.8, issue.4, p.1, 1993. ,
Maximal repeats enhance substringbased authorship attribution, Proceedings of the International Conference Recent Advances in Natural Language Processing, pp.63-71, 2015. ,
Hybrid OCR combination approach complemented by a specialized ICR applied on ancient documents, Humanities, computers and cultural heritage: Proceedings of the XVIth International Conference of the Association for History and Computing (AHC 2005), pp.161-168, 2005. ,
URL : https://hal.archives-ouvertes.fr/inria-00000363
Méthodes pour l'archéologie linguistique : datation par combinaison d'indices temporels, DÉfi Fouille de Textes, 2011. ,
Présentation et résultats du défi fouille de texte DEFT2010 où et quand un article de presse a-t-ilétéécrit ?, Actes de DEFT, p.23, 2010. ,
Présentation et résultats du défi fouille de texte DEFT2011. quand un article de presse a-t-ilétéécrit ?à quel article scientifique correspond ce résumé ?, Actes de DEFT, 2011. ,
Diachronic word embeddings reveal statistical laws of semantic change, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1489-1501, 2016. ,
Estimating document focus time, CIKM, 2013. ,
Improving temporal language models for determining time of nontimestamped documents, vol.5173, p.9, 2008. ,
Impact of ocr quality on named entity linking, Digital Libraries at the Crossroads of Digital Information for the Future, pp.102-115, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02557116
Estimating time models for news article excerpts, CIKM, 2016. ,
Temporal text ranking and automatic dating of texts, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, vol.2, pp.17-21, 2014. ,
Behind the times: Detecting epoch changes using large corpora, 2013. ,
ICDAR 2019 Competition on Post-OCR Text Correction, 15th International Conference on Document Analysis and Recognition, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02304334
Development of a morphological and syntactic lexicon of Old French, 26ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02148701
Ixagroupehudiac: A multiple approach system towards the diachronic evaluation of texts, Proceedings of the 9th International Workshop on Semantic Evaluation, pp.840-845, 2015. ,
Preliminary recommendations on text typology, 1996. ,
Stylistic changes for temporal text classification, TSD, 2013. ,
, Impact analysis of ocr quality on research tasks in digital archives. SpringerLink, pp.252-263, 2015.