Using Elasticsearch for entity recognition in affiliation disambiguation - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2021

Using Elasticsearch for entity recognition in affiliation disambiguation

Résumé

Automatic recognition of affiliations in the metadata of scholarly publications is a key point for monitoring and analyzing trends in scientific production, especially in an open science context. We propose an automatic alignment method on registries, based on Elasticsearch. The proposed method is modular and leaves the choice of the alignment criteria to the user, allowing him to keep control over the precision and recall of the method. An implementation is proposed for an automatic alignment on three registries: countries, GRID.ac and RNSR (research laboratory directory in France) on the Github https://github.com/dataesr/matcher and the performances are analyzed in this paper.
Fichier principal
Vignette du fichier
matcher.pdf (119.74 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03365806 , version 1 (07-10-2021)

Licence

Paternité

Identifiants

  • HAL Id : hal-03365806 , version 1

Citer

Anne L'Hôte, Eric Jeangirard. Using Elasticsearch for entity recognition in affiliation disambiguation. 2021. ⟨hal-03365806⟩
297 Consultations
275 Téléchargements

Partager

Gmail Facebook X LinkedIn More