Self-Supervised Deep Metric Learning for ancient papyrus fragments retrieval - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue International Journal on Document Analysis and Recognition Année : 2021

Self-Supervised Deep Metric Learning for ancient papyrus fragments retrieval

Résumé

This work focuses on document fragments association using Deep Metric Learning methods. More precisely, we are interested in ancient papyri fragments that need to be reconstructed prior to their analysis by papyrologists. This is a challenging task to automatize using machine learning algorithms because labeled data is rare, often incomplete, imbalanced and of inconsistent conservation states. However, there is a real need for such software in the papyrology community as the process of reconstructing the papyri by hand is extremely time consuming and tedious. In this paper, we explore ways in which papyrologists can obtain useful matching suggestion on new data using Deep Convolutional Siamese-Networks. We emphasize on low-to-no human intervention for annotating images. We show that the from-scratch self-supervised approach we propose is more effective than using knowledge transfer from a large dataset, the former achieving a top-1 accuracy score of 0.73 on a retrieval task involving 800 fragments. The research leading to this results has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program under grant agreement No 758907 and is part of the GESHAEM project, hosted by the Ausonius Institute. The source code (upon request) and data used in this article is available at https://morphoboid.labri.fr/ self-supervised-papyrus.html
Fichier principal
Vignette du fichier
IJDAR2021.pdf (8.04 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03260782 , version 1 (15-06-2021)

Identifiants

  • HAL Id : hal-03260782 , version 1

Citer

Antoine Pirrone, Marie Beurton-Aimar, Nicholas Journet. Self-Supervised Deep Metric Learning for ancient papyrus fragments retrieval. International Journal on Document Analysis and Recognition, 2021. ⟨hal-03260782⟩

Collections

CNRS
101 Consultations
81 Téléchargements

Partager

Gmail Facebook X LinkedIn More