Automatic annotation of bibliographical references in digital humanities books, articles and blogs - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Automatic annotation of bibliographical references in digital humanities books, articles and blogs

Young-Min Kim
  • Fonction : Auteur
  • PersonId : 982285
Patrice Bellot
Elodie Faath
  • Fonction : Auteur
  • PersonId : 982286
Marin Dacos

Résumé

In this paper, we deal with the problem of extracting and processing useful information from bibliographic references in Digital Humanities (DH) data. A machine learning tech- nique for sequential data analysis, Conditional Random Field is applied to a corpus extracted from OpenEdition site, a web platform for journals and book collections in the hu- manities and social sciences. We present our ongoing project with this purpose that includes the construction of a proper corpus and a efficient CRF model on this as a preliminary. This project is supported by Google Grant for Digital Hu- manities. A number of experiments are conducted to find one of the best settings for a CRF model on the corpus, and we verify them both in an automatic and manual way of evaluation.
Fichier principal
Vignette du fichier
article.pdf (599.35 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01317638 , version 1 (21-01-2019)

Identifiants

Citer

Young-Min Kim, Patrice Bellot, Elodie Faath, Marin Dacos. Automatic annotation of bibliographical references in digital humanities books, articles and blogs. 4th ACM workshop on Online books, complementary social media and crowdsourcing - BooksOnline '11, 2011, Glasgow, United Kingdom. ⟨10.1145/2064058.2064068⟩. ⟨hal-01317638⟩
145 Consultations
231 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More