Skip to Main content Skip to Navigation
New interface
Journal articles

Named Entity Recognition in Low-resource Languages using Cross-lingual distributional word representation

Abstract : Named Entity Recognition (NER) is a fundamental task in many NLP applications that seek to identify and classify expressions such as people, location, and organization names. Many NER systems have been developed, but the annotated data needed for good performances are not available for low-resource languages, such as Cameroonian languages. In this paper we exploit the low frequency of named entities in text to define a new suitable cross-lingual distributional representation for named entity recognition. We build the first Ewondo (a Bantu low-resource language of Cameroon) named entities recognizer by projecting named entity tags from English using our word representation. In terms of Recall, Precision and F-score, the obtained results show the effectiveness of the proposed distributional representation of words
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02557655
Contributor : Michael Mbouopda Connect in order to contact the contributor
Submitted on : Tuesday, September 29, 2020 - 1:07:06 PM
Last modification on : Wednesday, August 31, 2022 - 12:14:23 PM

File

ARIMA-Vol33-01-11 (1).pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Michael Franklin F Mbouopda, Paulin Melatagia Yonta. Named Entity Recognition in Low-resource Languages using Cross-lingual distributional word representation. Revue Africaine de Recherche en Informatique et Mathématiques Appliquées, 2020, Special issue CRI 2019, Volume 33 - 2020 - Special issue CRI 2019, pp.1-11. ⟨10.46298/arima.6439⟩. ⟨hal-02557655v3⟩

Share

Metrics

Record views

146

Files downloads

862