Tailored Aggregation for Classification

Tristan Mary-Huard; Stephane Robin

doi:10.1109/TPAMI.2009.55

Article Dans Une Revue IEEE Transactions on Pattern Analysis and Machine Intelligence Année : 2009

Tailored Aggregation for Classification

(1) , (1)

Tristan Mary-Huard

Fonction : Auteur
PersonId : 748716
IdHAL : tristanmary-huard
ORCID : 0000-0002-3839-9067
IdRef : 22754093X

Mathématiques et Informatique Appliquées

Stephane Robin

Fonction : Auteur
PersonId : 15469
IdHAL : scjrobin
ORCID : 0000-0003-1045-069X
IdRef : 052503720

Mathématiques et Informatique Appliquées

Résumé

Compression and variable selection are two classical strategies to deal with large-dimension data sets in classification. We propose an alternative strategy, called aggregation, which consists of a clustering step of redundant variables and a compression step within each group. We develop a statistical framework to define tailored aggregation methods that can be combined with selection methods to build reliable classifiers that benefit from the information contained in redundant variables. Two algorithms are proposed for ordered and nonordered variables, respectively. Applications to the kNNand CART algorithms are presented.

Mots clés

probabilistic approach statistical analysis large dimension redundancy aggregation very large databases classification approche probabiliste grande dimension redondance base donnée très grande

Domaines

Intelligence artificielle [cs.AI]

Archive Ouverte ProdInra : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01197577

Soumis le : vendredi 11 septembre 2015-20:06:56

Dernière modification le : mardi 12 mars 2024-10:44:19

Dates et versions

hal-01197577 , version 1 (11-09-2015)

Identifiants

HAL Id : hal-01197577 , version 1
DOI : 10.1109/TPAMI.2009.55
PRODINRA : 51694
WOS : 000269767600015

Citer

Tristan Mary-Huard, Stephane Robin. Tailored Aggregation for Classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31 (11), pp.2098-2105. ⟨10.1109/TPAMI.2009.55⟩. ⟨hal-01197577⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

AGROPARISTECH INRA MIA-PARIS INRAE MATHNUM

41 Consultations

0 Téléchargements

Tailored Aggregation for Classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager