Over-optimistic evaluation and reporting of novel cluster algorithms: an illustrative study

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Ullmann, Theresa ORCID: https://orcid.org/0000-0003-1215-8561; Beer, Anna ORCID: https://orcid.org/0000-0002-6890-997X; Hünemörder, Maximilian ORCID: https://orcid.org/0000-0001-9848-3714; Seidl, Thomas und Boulesteix, Anne-Laure ORCID: https://orcid.org/0000-0002-2729-0947 (2022): Over-optimistic evaluation and reporting of novel cluster algorithms: an illustrative study. In: Advances in Data Analysis and Classification, Bd. 17, Nr. 1: S. 211-238 [PDF, 856kB]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

DOI: 10.1007/s11634-022-00496-5

Abstract

When researchers publish new cluster algorithms, they usually demonstrate the strengths of their novel approaches by comparing the algorithms' performance with existing competitors. However, such studies are likely to be optimistically biased towards the new algorithms, as the authors have a vested interest in presenting their method as favorably as possible in order to increase their chances of getting published. Therefore, the superior performance of newly introduced cluster algorithms is over-optimistic and might not be confirmed in independent benchmark studies performed by neutral and unbiased authors. This problem is known among many researchers, but so far, the different mechanisms leading to over-optimism in cluster algorithm evaluation have never been systematically studied and discussed. Researchers are thus often not aware of the full extent of the problem. We present an illustrative study to illuminate the mechanisms by which authors-consciously or unconsciously-paint their cluster algorithm's performance in an over-optimistic light. Using the recently published cluster algorithm Rock as an example, we demonstrate how optimization of the used datasets or data characteristics, of the algorithm's parameters and of the choice of the competing cluster algorithms leads to Rock's performance appearing better than it actually is. Our study is thus a cautionary tale that illustrates how easy it can be for researchers to claim apparent superiority of a new cluster algorithm. This illuminates the vital importance of strategies for avoiding the problems of over-optimism (such as, e.g., neutral benchmark studies), which we also discuss in the article.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Medizin
Themengebiete:	600 Technik, Medizin, angewandte Wissenschaften > 610 Medizin und Gesundheit
URN:	urn:nbn:de:bvb:19-epub-106440-7
ISSN:	1862-5347
Sprache:	Englisch
Dokumenten ID:	106440
Datum der Veröffentlichung auf Open Access LMU:	11. Sep. 2023 13:38
Letzte Änderungen:	20. Sep. 2023 10:03
DFG:	Gefördert durch die Deutsche Forschungsgemeinschaft (DFG) - 491502892

Dokument bearbeiten