IRISA Participation in JRS 2012 Data-Mining Challenge: Lazy-Learning with Vectorization

Vincent Claveau 1
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In this article, we report on our participation in the JRS Data-Mining Challenge. The approach used by our system is a lazy- learning one, based on a simple k-nearest-neighbors technique. We more specifically addressed this challenge as an opportunity to test Informa- tion Retrieval (IR) inspired techniques in such a data-mining framework. In particular, we tested different similarity measures, including one called vectorization that we have proposed and tested in IR and Natural Lan- guage Processing frameworks. The resulting system is simple and efficient while offering good performance.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00760145
Contributor : Vincent Claveau <>
Submitted on : Monday, December 3, 2012 - 3:15:50 PM
Last modification on : Friday, November 16, 2018 - 1:25:10 AM
Long-term archiving on : Monday, March 4, 2013 - 3:50:26 AM

File

Claveau_JRS12_challenge.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00760145, version 1

Citation

Vincent Claveau. IRISA Participation in JRS 2012 Data-Mining Challenge: Lazy-Learning with Vectorization. JRS - Data Mining Competition: Topical Classification of Biomedical Research Papers, special event of Joint Rough Sets Symposium, Sep 2012, Chengdu, China. ⟨hal-00760145⟩

Share

Metrics

Record views

525

Files downloads

198