Eisenmann, M, Reinke, A, Weru, V, Tizabi, MD, Isensee, F, Adler, TJ, Ali, S, Andrearczyk, V, Aubreville, M, Baid, U, Bakas, S, Balu, N, Bano, S, Bernal, J, Bodenstedt, S, Casella, A, Cheplygina, V, Daum, M, De Bruijne, M, Depeursinge, A, Dorent, R, Egger, J, Ellis, DG, Engelhardt, S, Ganz, M, Ghatwary, N, Girard, G, Godau, P, Gupta, A, Hansen, L, Harada, K, Heinrich, M, Heller, N, Hering, A, Huaulmé, A, Jannin, P, Kavur, AE, Kodym, O, Kozubek, M, Li, J, Li, H, Ma, J, Martín-Isla, C, Menze, B, Noble, A, Oreiller, V, Padoy, N, Pati, S, Payette, K, Rädsch, T, Rafael-Patiño, J, Bawa, VS, Speidel, S, Sudre, CH, Van Wijnen, K, Wagner, M, Wei, D, Yamlahi, A, Yap, MH ORCID: https://orcid.org/0000-0001-7681-4287, Yuan, C, Zenk, M, Zia, A, Zimmerer, D, Aydogan, D, Bhattarai, B, Bloch, L, Brüngel, R, Cho, J, Choi, C, Dou, Q, Ezhov, I, Friedrich, CM, Fuller, C, Gaire, RR, Galdran, A, García Faura, A, Grammatikopoulou, M, Hong, S, Jahanifar, M, Jang, I, Kadkhodamohammadi, A, Kang, I, Kofler, F, Kondo, S, Kuijf, H, Li, M, Luu, M, Martinčič, T, Morais, P, Naser, MA, Oliveira, B, Owen, D, Pang, S, Park, J, Park, S, Płotka, S, Puybareau, E, Rajpoot, N, Ryu, K and Saeed, N (2023) Why is the winner the best? In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17 June 2023 - 24 June 2023, Vancouver, Canada.
|
Accepted Version
Available under License In Copyright. Download (5MB) | Preview |
Abstract
International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To address this gap in the literature, we performed a multicenter study with all 80 competitions that were conducted in the scope of IEEE ISBI 2021 and MICCAI 2021. Statistical analyses performed based on comprehensive descriptions of the submitted algorithms linked to their rank as well as the underlying participation strategies revealed common characteristics of winning solutions. These typically include the use of multi-task learning (63%) and/or multi-stage pipelines (61%), and a focus on augmentation (100%), image preprocessing (97%), data curation (79%), and post-processing (66%). The 'typical' lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning. Two core general development strategies stood out for highly-ranked teams: the reflection of the metrics in the method design and the focus on analyzing and handling failure cases. According to the organizers, 43% of the winning algorithms exceeded the state of the art but only 11% completely solved the respective domain problem. The insights of our study could help researchers (1) improve algorithm development strategies when approaching new problems, and (2) focus on open research questions revealed by this work.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.