Discrete Box-Constrained Minimax Classifier for Uncertain and Imbalanced Class Proportions - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Discrete Box-Constrained Minimax Classifier for Uncertain and Imbalanced Class Proportions

Résumé

The goal of this paper is to build a supervised classifier addressing the following difficulties which commonly appear in safety-critical applications: imbalanced datasets, uncertain class proportions, dependencies between some features, presence of both numeric and categorical features, and arbitrary loss functions provided by experts. Many works have shown that discretizing the numeric features is relevant for dealing with mixed attributes. Thus, we develop a novel minimax classifier algorithm, designed for processing discrete or discretized features, which addresses all the previously mentioned issues. The usual minimax criterion derives from the computation of the class proportions which maximize the empirical Bayes risk over the probabilistic simplex. However, it can be potentially too pessimistic when the least favorable priors appear unrealistic or its risk of error becomes too high. In this case, under the assumption that the experts are able to provide independent bounds on some class proportions, our approach takes into account these constraints to decrease the risk of error. The resulting box-constrained minimax classifier appears as a trade-off between the discrete Bayes classifier and the usual minimax classifier.
Fichier principal
Vignette du fichier
Discrete Box-Constrained Minimax Classifier for Uncertain and Imbalanced Class Proportions.pdf (4.34 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02296592 , version 1 (25-09-2019)
hal-02296592 , version 2 (01-04-2020)
hal-02296592 , version 3 (02-03-2021)

Identifiants

  • HAL Id : hal-02296592 , version 2

Citer

Cyprien Gilet, Susana Barbosa, Lionel Fillatre. Discrete Box-Constrained Minimax Classifier for Uncertain and Imbalanced Class Proportions. 2020. ⟨hal-02296592v2⟩
540 Consultations
215 Téléchargements

Partager

Gmail Facebook X LinkedIn More