Natural vs Balanced Distribution in Deep Learning on Whole Slide Images for Cancer Detection - Real Expression Artificial Life Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Natural vs Balanced Distribution in Deep Learning on Whole Slide Images for Cancer Detection

Résumé

The class distribution of data is one of the factors that regulates the performance of machine learning models. However, investigations on the impact of different distributions available in the literature are very few, sometimes absent for domain-specific tasks. In this paper, we analyze the impact of natural and balanced distributions of the training set in deep learning (DL) models applied on histological images, also known as whole slide images (WSIs). WSIs are considered as the gold standard for cancer diagnosis. In recent years, researchers have turned their attention to DL models to automate and accelerate the diagnosis process. In the training of such DL models, filtering out the non-regions-of-interest from the WSIs and adopting an artificial distribution-usually a balanced distribution-is a common trend. In our analysis, we show that keeping the WSIs data in their usual distribution-which we call natural distribution-for DL training is better than the artificially obtained balanced distribution. We conduct an empirical comparative study with 10 random folds for each distribution, comparing the resulting average performance levels in terms of five different evaluation metrics. Experimental results show the effectiveness of the natural distribution over the balanced one across all the evaluation metrics. CCS CONCEPTS • Computing methodologies → Supervised learning; Image processing; Image segmentation; • Applied computing → Health informatics.
Fichier principal
Vignette du fichier
For_HAL_SAC_2021_Natural_vs_Balanced_Distribution_in_Deep_Learning_on_Whole_Slide_Images_for_Cancer_Detection (1).pdf (3.6 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03057209 , version 1 (11-12-2020)

Identifiants

Citer

Ismat Ara Reshma, Sylvain Cussat-Blanc, Radu Tudor Ionescu, Hervé Luga, Josiane Mothe. Natural vs Balanced Distribution in Deep Learning on Whole Slide Images for Cancer Detection. 36th ACM/SIGAPP Symposium on Applied Computing (SAC 2021), Association for Computing Machinery (ACM) - Special Interest Group on Applied Computing (SIGAPP), Mar 2021, Virtual Event, South Korea. ⟨hal-03057209⟩
201 Consultations
49 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More