Supervised classification of categorical data with uncertain labels for DNA barcoding - Archive ouverte HAL Access content directly
Conference Papers Year : 2009

Supervised classification of categorical data with uncertain labels for DNA barcoding

Abstract

In the supervised classification framework, the human supervision is required for labeling a set of learning data which are then used for building the classifier. However, in many applications, the human supervision is either imprecise, difficult or expensive and this gives rise to non robust classifiers. An interesting application where this situation occurs is DNA barcoding which aims to develop a standard tool to identify species with no or limited recourse to taxonomic expertise. In some cases, the morphological features describing the reference sample may be misleading and the taxonomists attribute labels incorrectly. This work presents a robust supervised classification method for categorical data based on a multivariate multinomial mixture model. The proposed method is applied to DNA barcoding and compared to classical methods on a real dataset.
No file

Dates and versions

hal-00407834 , version 1 (27-07-2009)

Identifiers

  • HAL Id : hal-00407834 , version 1

Cite

Charles Bouveyron, Stéphane Girard, Madalina Olteanu. Supervised classification of categorical data with uncertain labels for DNA barcoding. ESANN 2009 - 11th European Symposium on Artificial Neural Networks, Apr 2009, Bruges, Belgium. pp.29-34. ⟨hal-00407834⟩
189 View
0 Download

Share

Gmail Facebook X LinkedIn More