Document Re-ranking Based on Topic-Comment Structure - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Document Re-ranking Based on Topic-Comment Structure

Résumé

This paper introduces a novel approach for document re-ranking in information retrieval based on topic-comment structure of texts. While most information retrieval models make the assumption that relevant documents are about the query and that aboutness can be captured considering bags of words only, we rather consider a more sophisticated analysis of discourse to capture document relevance by distinguishing the topic of a text from what is said about the topic (comment) in the text. The topic-comment structure of texts is extracted automatically from the first retrieved documents which are then re-ranked so that the top documents are the ones that share their topics with the query. The evaluation on TREC collections shows that the method significantly improves the retrieval performance.
Fichier principal
Vignette du fichier
ermakova_16965.pdf (289.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01530400 , version 1 (31-05-2017)

Identifiants

  • HAL Id : hal-01530400 , version 1
  • OATAO : 16965

Citer

Liana Ermakova, Josiane Mothe. Document Re-ranking Based on Topic-Comment Structure. 10th IEEE International Conference on Research Challenge in Information Science (RCIS 2016), Jun 2016, Grenoble, France. pp. 1-10. ⟨hal-01530400⟩
65 Consultations
190 Téléchargements

Partager

Gmail Facebook X LinkedIn More