Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Refinement.

Abstract : Inthispaper,wetargetdocumentrankinginahighlytechni- cal field with the aim to approximate a ranking that is obtained through an existing ontology (knowledge structure). We test and combine sym- bolic and vector space models (VSM). Our symbolic approach relies on shallow NLP and on internal linguistic relations between Multi-Word Terms (MWTs). Documents are ranked based on different semantic rela- tions they share with the query terms, either directly or indirectly after clustering the MWTs using the identified lexico-semantic relations. The VSM approach consisted in ranking documents with different functions ranging from the classical tf.idf to more elaborate similarity functions. Results shows that the ranking obtained by the symbolic approach per- forms better on most queries than the vector space model. However, the ranking obtained by combining both approaches outperforms by a wide margin the results obtained by methods from each approach.
Document type :
Conference papers
Zoubida Kedad et al. 12th International Conference on Applications of Natural Language to Information systems (NLDB 2007)., Jun 2007, Paris, France. Springer, 4592/2007, pp.252-263, 2007, Lecture Notes in Computer Science. <10.1007/978-3-540-73351-5>


https://hal.archives-ouvertes.fr/hal-00636105
Contributor : Fidelia Ibekwe-Sanjuan <>
Submitted on : Wednesday, November 2, 2011 - 7:32:25 PM
Last modification on : Wednesday, March 23, 2016 - 9:48:34 AM
Document(s) archivé(s) le : Friday, February 3, 2012 - 2:21:08 AM

File

NLDB-07-last-version.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Eric Sanjuan, Fidelia Ibekwe-Sanjuan, Torres-Moreno Juan-Manuel, Patricia Velazquez-Morales. Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Refinement.. Zoubida Kedad et al. 12th International Conference on Applications of Natural Language to Information systems (NLDB 2007)., Jun 2007, Paris, France. Springer, 4592/2007, pp.252-263, 2007, Lecture Notes in Computer Science. <10.1007/978-3-540-73351-5>. <hal-00636105>

Export

Share

Metrics

Record views

268

Document downloads

117