Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Refinement.

Eric Sanjuan 1, * Fidelia Ibekwe-Sanjuan 2 Juan Manuel Torres-Moreno 1 Patricia Velazquez-Morales 1
* Corresponding author
2 ELICO Lyon 3
ELICO - Equipe de recherche de Lyon en sciences de l'information et de la communication
Abstract : Inthispaper,wetargetdocumentrankinginahighlytechni- cal field with the aim to approximate a ranking that is obtained through an existing ontology (knowledge structure). We test and combine sym- bolic and vector space models (VSM). Our symbolic approach relies on shallow NLP and on internal linguistic relations between Multi-Word Terms (MWTs). Documents are ranked based on different semantic rela- tions they share with the query terms, either directly or indirectly after clustering the MWTs using the identified lexico-semantic relations. The VSM approach consisted in ranking documents with different functions ranging from the classical tf.idf to more elaborate similarity functions. Results shows that the ranking obtained by the symbolic approach per- forms better on most queries than the vector space model. However, the ranking obtained by combining both approaches outperforms by a wide margin the results obtained by methods from each approach.
Document type :
Conference papers
Zoubida Kedad et al. 12th International Conference on Applications of Natural Language to Information systems (NLDB 2007)., Jun 2007, Paris, France. Springer, 4592/2007, pp.252-263, 2007, Lecture Notes in Computer Science. <10.1007/978-3-540-73351-5>


https://hal.archives-ouvertes.fr/hal-00636105
Contributor : Fidelia Ibekwe-Sanjuan <>
Submitted on : Wednesday, November 2, 2011 - 7:32:25 PM
Last modification on : Tuesday, February 3, 2015 - 3:36:47 PM

File

NLDB-07-last-version.pdf
fileSource_public_author

Identifiers

Collections

Citation

Eric Sanjuan, Fidelia Ibekwe-Sanjuan, Juan Manuel Torres-Moreno, Patricia Velazquez-Morales. Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Refinement.. Zoubida Kedad et al. 12th International Conference on Applications of Natural Language to Information systems (NLDB 2007)., Jun 2007, Paris, France. Springer, 4592/2007, pp.252-263, 2007, Lecture Notes in Computer Science. <10.1007/978-3-540-73351-5>. <hal-00636105>

Export

Share

Metrics

Consultation de
la notice

232

Téléchargement du document

55