Credal Clustering for Imbalanced Data

Zuowei Zhang; Zhunga Liu; Kuang Zhou; Arnaud Martin; Yiru Zhang

doi:10.1007/978-3-030-88601-1_2

Communication Dans Un Congrès Année : 2021

Credal Clustering for Imbalanced Data

(1, 2) , (1) , (1) , (2) , (3)

1
2
3

Zuowei Zhang

Fonction : Auteur
PersonId : 1115453

Northwestern Polytechnical University [Xi'an]

Declarative & Reliable management of Uncertain, user-generated Interlinked Data

Zhunga Liu

Fonction : Auteur
PersonId : 1115454

Northwestern Polytechnical University [Xi'an]

Kuang Zhou

Fonction : Auteur
PersonId : 772938
IdRef : 196429951

Northwestern Polytechnical University [Xi'an]

Arnaud Martin

Fonction : Auteur
PersonId : 1795
IdHAL : arnaud-martin
ORCID : 0000-0003-0882-0153
IdRef : 167578405

Declarative & Reliable management of Uncertain, user-generated Interlinked Data

Yiru Zhang

Fonction : Auteur

St. Francis Xavier University

Résumé

Traditional evidential clustering tends to build clusters where the number of data for each cluster fairly close to each other. However, it may not be suitable for imbalanced data. This paper proposes a new method, called credal clustering (CClu), to deal with imbalanced data based on the theory of belief functions. Consider a dataset with C wanted classes, the credal c-means (CCM) clustering method is employed at first to divide the dataset into some (i.e., S (S > C)) clusters. Then these clusters are gradually merged following a given principle based on the density of meta-clusters and the associated singleton clusters. The merging is finished when C singleton wanted classes are obtained. During this merging procedure, the objects in each singleton cluster will be assigned to one new singleton class. Moreover, a weighted mean vector rule is developed to classify the objects in the unmerged meta-cluster to the associated new classes using the K-Nearest neighbor technique. Two experiments show that CClu can handle imbalanced datasets with high accuracy, and the errors are reduced by properly modeling imprecision.

Mots clés

Evidential clustering belief functions imbalanced data credal c-means K-NN

Domaines

Informatique [cs]

Zuowei Zhang : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03394857

Soumis le : vendredi 22 octobre 2021-11:28:08

Dernière modification le : mardi 12 décembre 2023-09:55:36

Dates et versions

hal-03394857 , version 1 (22-10-2021)

Identifiants

HAL Id : hal-03394857 , version 1
DOI : 10.1007/978-3-030-88601-1_2

Citer

Zuowei Zhang, Zhunga Liu, Kuang Zhou, Arnaud Martin, Yiru Zhang. Credal Clustering for Imbalanced Data. 6th International Conference on Belief Functions, Oct 2021, Shanghai, China. pp.13-21, ⟨10.1007/978-3-030-88601-1_2⟩. ⟨hal-03394857⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM CYBERSCHOOL

31 Consultations

0 Téléchargements

Credal Clustering for Imbalanced Data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager