An overview of textual semantic similarity measures based on web intelligence

Abstract : Computing the semantic similarity between terms (or short text expressions) that have the same meaning but which are not lexicographically similar is a key challenge in many computer related fields. The problem is that traditional approaches to semantic similarity measurement are not suitable for all situations, for example, many of them often fail to deal with terms not covered by synonym dictionaries or are not able to cope with acronyms, abbreviations, buzzwords, brand names, proper nouns, and so on. In this paper, we present and evaluate a collection of emerging techniques developed to avoid this problem. These techniques use some kinds of web intelligence to determine the degree of similarity between text expressions. These techniques implement a variety of paradigms including the study of co-occurrence, text snippet comparison, frequent pattern finding, or search log analysis. The goal is to substitute the traditional techniques where necessary.
Keywords : dissemin
Document type :
Journal articles
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01630890
Contributor : Projet Dissemin <>
Submitted on : Wednesday, November 8, 2017 - 12:31:33 PM
Last modification on : Wednesday, October 10, 2018 - 5:00:03 PM
Long-term archiving on : Friday, February 9, 2018 - 12:39:20 PM

File

article.pdf
Files produced by the author(s)

Identifiers

Citation

Jorge Martinez-Gil. An overview of textual semantic similarity measures based on web intelligence. Artificial Intelligence Review, Springer Verlag, 2012, 42 (4), pp.935-943. ⟨10.1007/s10462-012-9349-8⟩. ⟨hal-01630890⟩

Share

Metrics

Record views

4991

Files downloads

5800