Why rankings of biomedical image analysis competitions should be interpreted with care - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Nature Communications Année : 2018

Why rankings of biomedical image analysis competitions should be interpreted with care

Résumé

International challenges have become the standard for validation of biomedical image analysis methods. Given their scientific impact, it is surprising that a critical analysis of common practices related to the organization of challenges has not yet been performed. In this paper, we present a comprehensive analysis of biomedical image analysis challenges conducted up to now. We demonstrate the importance of challenges and show that the lack of quality control has critical consequences. First, reproducibility and interpretation of the results is often hampered as only a fraction of relevant information is typically provided. Second, the rank of an algorithm is generally not robust to a number of variables such as the test data used for validation, the ranking scheme applied and the observers that make the reference annotations. To overcome these problems, we recommend best practice guidelines and define open research questions to be addressed in the future.
Fichier principal
Vignette du fichier
s41467-018-07619-7.pdf (1.09 Mo) Télécharger le fichier
CORRECTION-Maier-Hein-why ranking.pdf (566.58 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-01958848 , version 1 (12-07-2019)

Licence

Paternité

Identifiants

Citer

Lena Maier-Hein, Matthias Eisenmann, Annika Reinke, Sinan Onogur, Marko Stankovic, et al.. Why rankings of biomedical image analysis competitions should be interpreted with care. Nature Communications, 2018, 9 (1), pp.5217. ⟨10.1038/s41467-018-07619-7⟩. ⟨hal-01958848⟩
309 Consultations
192 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More