Using Word Embeddings to Retrieve Semantically Similar Questions in Community Question Answering - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of International Science and General Applications Année : 2018

Using Word Embeddings to Retrieve Semantically Similar Questions in Community Question Answering

Résumé

This paper focuses on question retrieval which is a crucial and tricky task in Community Question Answering (cQA). Question retrieval aims at finding historical questions that are semantically equivalent to the queried ones, assuming that the answers to the similar questions should also answer the new ones. The major challenges are the lexical gap problem as well as the verboseness in natural language. Most existing methods measure the similarity between questions based on the bag-of-words (BOWs) representation capturing no semantics between words. In this paper, we rely on word embeddings and TF-IDF for a meaningful vector representation of the questions. The similarity between questions is measured using cosine similarity based on their vector-based word representations. Experiments carried out on a real world data set from Yahoo! Answers show that our method is competetive.
Fichier principal
Vignette du fichier
NOUHA.pdf (596.44 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01873748 , version 1 (13-09-2018)

Identifiants

  • HAL Id : hal-01873748 , version 1

Citer

Nouha Othman, Rim Faiz, Kamel Smaïli. Using Word Embeddings to Retrieve Semantically Similar Questions in Community Question Answering. Journal of International Science and General Applications, 2018, 1 (1). ⟨hal-01873748⟩
111 Consultations
798 Téléchargements

Partager

Gmail Facebook X LinkedIn More