Semi-supervised fuzzy c-means variants: a study on noisy label supervision

Abstract : Semi-supervised clustering algorithms aim at discovering the hidden structure of data sets with the help of expert knowledge, generally expressed as constraints on the data such as class labels or pairwise relations. Most of the time, the expert is considered as an oracle that only provides correct constraints. This paper focuses on the case where some label constraints are erroneous and proposes to investigate into more detail three semi-supervised fuzzy c-means clustering approaches as they have been tailored to naturally handle uncertainty in the expert labeling. In order to run a fair comparison between existing algorithms, formal improvements have been proposed to guarantee and fasten their convergence. Experiments conducted on real and artificial data sets under uncertain labels and noise in the constraints show the effectiveness of using fuzzy clustering algorithm for noisy semi-supervised clustering.
Document type :
Conference papers
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02017485
Contributor : Violaine Antoine <>
Submitted on : Wednesday, February 13, 2019 - 11:06:11 AM
Last modification on : Monday, January 20, 2020 - 12:14:06 PM
Long-term archiving on: Tuesday, May 14, 2019 - 2:23:28 PM

File

ipmu18.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02017485, version 1

Citation

Antoine Violaine, Nicolas Labroche. Semi-supervised fuzzy c-means variants: a study on noisy label supervision. IPMU, Jun 2018, Cadiz, Spain. pp.51-62. ⟨hal-02017485⟩

Share

Metrics

Record views

57

Files downloads

50