Semantic similarity measurement using historical google search patterns - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Information Systems Frontiers Année : 2013

Semantic similarity measurement using historical google search patterns

José F. Aldana Montes
  • Fonction : Auteur

Résumé

Computing the semantic similarity between terms (or short text expressions) that have the same meaning but which are not lexicographically similar is an important challenge in the information integration field. The problem is that techniques for textual semantic similarity measurement often fail to deal with words not covered by synonym dictionaries. In this paper, we try to solve this problem by determining the semantic similarity for terms using the knowledge inherent in the search history logs from the Google search engine. To do this, we have designed and evaluated four algorithmic methods for measuring the semantic similarity between terms using their associated history search patterns. These algorithmic methods are: a) frequent co-occurrence of terms in search patterns, b) computation of the relationship between search patterns, c) outlier coincidence on search patterns, and d) forecasting comparisons. We have shown experimentally that some of these methods correlate well with respect to human judgment when evaluating general purpose benchmark datasets, and significantly outperform existing methods when evaluating datasets containing terms that do not usually appear in dictionaries.
Fichier principal
Vignette du fichier
article.pdf (324.46 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01628399 , version 1 (03-11-2017)

Identifiants

Citer

Jorge Martinez-Gil, José F. Aldana Montes. Semantic similarity measurement using historical google search patterns. Information Systems Frontiers, 2013, 15 (3), pp.399-410. ⟨10.1007/s10796-012-9404-7⟩. ⟨hal-01628399⟩
2142 Consultations
262 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More