Predicting the relevance of distributional semantic similarity with contextual information

Abstract : Using distributional analysis methods to compute semantic proximity links between words has become commonplace in NLP. The resulting relations are often noisy or difficult to interpret in general. This paper focuses on the issues of evaluating a distributional resource and filtering the relations it contains, but instead of considering it in abstracto, we focus on pairs of words in context. In a discourse, we are interested in knowing if the semantic link between two items is a byproduct of textual coherence or is irrelevant. We first set up a human annotation of semantic links with or without contextual information to show the importance of the textual context in evaluating the relevance of semantic similarity, and to assess the prevalence of actual semantic relations between word tokens. We then built an experiment to automatically predict this relevance, evaluated on the reliable reference data set which was the outcome of the first annotation. We show that in-document information greatly improve the prediction made by the similarity level alone.
Complete list of metadatas

Cited literature [35 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01056292
Contributor : Cécile Fabre <>
Submitted on : Monday, August 18, 2014 - 4:02:08 PM
Last modification on : Thursday, October 17, 2019 - 8:52:06 AM
Long-term archiving on : Thursday, November 27, 2014 - 5:30:16 AM

File

Muller_et_al_predicting_ACL201...
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01056292, version 1

Collections

Citation

Philippe Muller, Cécile Fabre, Clémentine Adam. Predicting the relevance of distributional semantic similarity with contextual information. 52nd Annual Meeting of the Association for Computational Linguistics - ACL 2014, Jun 2014, Baltimore, United States. pp.479-488. ⟨hal-01056292⟩

Share

Metrics

Record views

440

Files downloads

486