Scaling up Automatic Structuring of Manuscript Sales Catalogues - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Scaling up Automatic Structuring of Manuscript Sales Catalogues

Résumé

Manuscript Sales Catalogues (MSC) are highly important for authenticating documents and studying the reception of authors. Their regular publication throughout Europe since the beginning of the 19th c. has consequently raised the interest around scaling up the means for automatically structuring their contents. Following successful first encoding tests with GROBID-Dictionaries [1,2] on a single MSC collection [3], we aim in this paper to present the results of more advanced tests of the system’s capacity to handle a larger corpus with MSC ofdifferent dealers, and therefore multiple layouts.
Fichier principal
Vignette du fichier
Grobid Catalogues TEI 2019.pdf (347.35 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02272962 , version 1 (28-08-2019)

Licence

Paternité

Identifiants

  • HAL Id : hal-02272962 , version 1

Citer

Lucie Rondeau Du Noyer, Simon Gabay, Mohamed Khemakhem, Laurent Romary. Scaling up Automatic Structuring of Manuscript Sales Catalogues. TEI 2019: What is text, really? TEI and beyond, Sep 2019, Graz, Austria. ⟨hal-02272962⟩
324 Consultations
160 Téléchargements

Partager

Gmail Facebook X LinkedIn More