Semi-automatic data annotation based on feature-space projection and local quality metrics: An application to cerebral emboli characterization - Centre de Recherche en Acquisition et Traitement de l'Image pour la Santé Accéder directement au contenu
Article Dans Une Revue Medical Image Analysis Année : 2022

Semi-automatic data annotation based on feature-space projection and local quality metrics: An application to cerebral emboli characterization

Résumé

We propose a semi-supervised learning approach to annotate a dataset with reduced requirements for manual annotation and with controlled annotation error. The method is based on feature-space projection and label propagation using local quality metrics. First, an auto-encoder extracts the features of the samples in an unsupervised manner. Then, the extracted features are projected by a t-distributed stochastic neighbor embedding algorithm into a two-dimensional (2D) space. A selection of the best 2D projection is introduced based on the silhouette score. The expert annotator uses the obtained 2D representation to manually label samples. Finally, the labels of the labeled samples are propagated to the unlabeled samples using a K-nearest neighbor strategy and local quality metrics. We compare our method against semi-supervised optimum-path forest and K-nearest neighbor label propagation (without considering local quality metrics). Our method achieves state-of-the-art results on three different datasets by labeling more than 96% of the samples with an annotation error from 7% to 17%. Additionally, our method allows to control the trade-off between annotation error and number of labeled samples. Moreover, we combine our method with robust loss functions to compensate for the label noise introduced by automatic label propagation. Our method allows to achieve similar, and even better, classification performances compared to those obtained using a fully manually labeled dataset, with up to 6% in terms of classification accuracy.
Fichier principal
Vignette du fichier
Vindas-22_Semi-automatic data annotation based on feature-space projection - Semi_supervised_Data_Annotation_Hal.pdf (30.21 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03872997 , version 2 (26-11-2022)
hal-03872997 , version 1 (19-07-2023)

Identifiants

Citer

Yamil Vindas, Blaise Kévin Guépié, Marilys Almar, Emmanuel Roux, Philippe Delachartre. Semi-automatic data annotation based on feature-space projection and local quality metrics: An application to cerebral emboli characterization. Medical Image Analysis, 2022, 79, pp.102437. ⟨10.1016/j.media.2022.102437⟩. ⟨hal-03872997v1⟩
99 Consultations
45 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More