In: PLOS Computational Biology
20(3), e1011936
[PDF, 652kB]
Abstract
Throughout their education and when reading the scientific literature, students may get the impression that there is a unique and correct analysis strategy for every data analysis task and that this analysis strategy will always yield a significant and noteworthy result. This expectation conflicts with a growing realization that there is a multiplicity of possible analysis strategies in empirical research, which will lead to overoptimism and nonreplicable research findings if it is combined with result-dependent selective reporting. Here, we argue that students are often ill-equipped for real-world data analysis tasks and unprepared for the dangers of selectively reporting the most promising results. We present a seminar course intended for advanced undergraduates and beginning graduate students of data analysis fields such as statistics, data science, or bioinformatics that aims to increase the awareness of uncertain choices in the analysis of empirical data and present ways to deal with these choices through theoretical modules and practical hands-on sessions.
Dokumententyp: | Zeitschriftenartikel |
---|---|
Fakultät: | Mathematik, Informatik und Statistik > Statistik
Medizin > Institut für Medizinische Informationsverarbeitung, Biometrie und Epidemiologie |
Themengebiete: | 600 Technik, Medizin, angewandte Wissenschaften > 610 Medizin und Gesundheit |
URN: | urn:nbn:de:bvb:19-epub-115671-0 |
ISSN: | 1553-7358 |
Sprache: | Englisch |
Dokumenten ID: | 115671 |
Datum der Veröffentlichung auf Open Access LMU: | 17. Apr. 2024, 13:01 |
Letzte Änderungen: | 17. Apr. 2024, 13:01 |