Pattern spotting in historical documents using convolutional models - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Pattern spotting in historical documents using convolutional models

Ignacio Ubeda
  • Fonction : Auteur
José M. Saavedra
  • Fonction : Auteur
Stéphane Nicolas
Caroline Petitjean
Laurent Heutte

Résumé

Pattern spotting consists of searching in a collection of historical document images for occurrences of a graphical object using an image query. Contrary to object detection, no prior information nor predefined class is given about the query so training a model of the object is not feasible. In this paper, a convolutional neural network approach is proposed to tackle this problem. We use RetinaNet as a feature extractor to obtain multiscale embeddings of the regions of the documents and also for the queries. Experiments conducted on the DocExplore dataset show that our proposal is better at locating patterns and requires less storage for indexing images than the state-of-the-art system, but fails at retrieving some pages containing multiple instances of the query.

Dates et versions

hal-02335779 , version 1 (28-10-2019)

Identifiants

Citer

Ignacio Ubeda, José M. Saavedra, Stéphane Nicolas, Caroline Petitjean, Laurent Heutte. Pattern spotting in historical documents using convolutional models. 5th International Workshop on Historical Document Imaging and Processing, HIP 2019, Sydney, Australia, Sept 2019, ICDAR2019, Sep 2019, Sydney, Australia. pp.60-65, ⟨10.1145/3352631⟩. ⟨hal-02335779⟩
22 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More