Supervised classification of categorical data with uncertain labels for DNA barcoding

Abstract : In the supervised classification framework, the human supervision is required for labeling a set of learning data which are then used for building the classifier. However, in many applications, the human supervision is either imprecise, difficult or expensive and this gives rise to non robust classifiers. An interesting application where this situation occurs is DNA barcoding which aims to develop a standard tool to identify species with no or limited recourse to taxonomic expertise. In some cases, the morphological features describing the reference sample may be misleading and the taxonomists attribute labels incorrectly. This work presents a robust supervised classification method for categorical data based on a multivariate multinomial mixture model. The proposed method is applied to DNA barcoding and compared to classical methods on a real dataset.
Document type :
Conference papers
ESANN 2009 - 11th European Symposium on Artificial Neural Networks, Apr 2009, Bruges, Belgium. d-side publications, pp.29-34, 2009
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00407834
Contributor : Charles Bouveyron <>
Submitted on : Monday, July 27, 2009 - 5:22:34 PM
Last modification on : Tuesday, May 6, 2014 - 12:42:26 PM

Identifiers

  • HAL Id : hal-00407834, version 1

Collections

Citation

Charles Bouveyron, Stephane Girard, Madalina Olteanu. Supervised classification of categorical data with uncertain labels for DNA barcoding. ESANN 2009 - 11th European Symposium on Artificial Neural Networks, Apr 2009, Bruges, Belgium. d-side publications, pp.29-34, 2009. <hal-00407834>

Share

Metrics

Consultations de la notice

304