Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia

Abstract : The growth of available data in digital format has been facilitating the development of new models to automatically infer the semantic similarity between word pairs. However, there are still many natural languages without sufficient resources to evaluate measures of semantic relatedness. In this paper we translated word pairs from a well-known baseline for evaluating semantic relatedness measures into Portuguese and performed a manual evaluation of each pair. We compared the correlation with similar datasets in other languages and generated LSA models from Wikipedia articles in order to verify the pertinence of each dataset and how semantic similarity conveys across languages.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02089290
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Wednesday, April 3, 2019 - 3:56:38 PM
Last modification on : Friday, June 14, 2019 - 6:31:23 PM

File

leitzkegranada_22675.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02089290, version 1
  • OATAO : 22675

Collections

Citation

Roger Leitzke Granada, Cassia Trojahn, Renata Vieira. Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia. International Conference on Computational Processing of the Portuguese Language (PROPOR 2014), Oct 2014, Sao Carlos, Brazil. pp.170-175. ⟨hal-02089290⟩

Share

Metrics

Record views

4

Files downloads

6