Improving the Community Question Retrieval Performance Using Attention-based Siamese LSTM - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Improving the Community Question Retrieval Performance Using Attention-based Siamese LSTM

Résumé

In this paper, we focus on the problem of question retrieval in community Question Answering (cQA) which aims to retrieve from the community archives the previous questions that are semantically equivalent to the new queries. The major challenges in this crucial task are the shortness of the questions as well as the word mismatch problem as users can formulate the same query using different wording. While numerous attempts have been made to address this problem, most existing methods relied on supervised models which significantly depend on large training data sets and manual feature engineering. Such methods are mostly constrained by their specificities that put aside the word order and ignore syntactic and semantic relationships. In this work, we rely on Neural Networks (NNs) which can learn rich dense representations of text data and enable the prediction of the textual similarity between the community questions. We propose a deep learning approach based on a Siamese architecture with LSTM networks, augmented with an attention mechanism. We test different similarity measures to predict the semantic similarity between the community questions. Experiments conducted on real cQA data sets in English and Arabic show that the performance of question retrieval is improved as compared to other competitive methods.
Fichier principal
Vignette du fichier
NLDB2020_CQA.pdf (446.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02867309 , version 1 (14-06-2020)

Identifiants

  • HAL Id : hal-02867309 , version 1

Citer

Nouha Othman, Rim Faiz, Kamel Smaïli. Improving the Community Question Retrieval Performance Using Attention-based Siamese LSTM. NLDB2020: 25th International Conference on Natural Language & Information Systems, Jun 2020, Saarbrücken, Germany. ⟨hal-02867309⟩
73 Consultations
104 Téléchargements

Partager

Gmail Facebook X LinkedIn More