In: PLOS ONE
17(5), e0251194
[PDF, 614kB]
Abstract
Computational reproducibility is a corner stone for sound and credible research. Especially in complex statistical analyses—such as the analysis of longitudinal data—reproducing results is far from simple, especially if no source code is available. In this work we aimed to reproduce analyses of longitudinal data of 11 articles published in PLOS ONE. Inclusion criteria were the availability of data and author consent. We investigated the types of methods and software used and whether we were able to reproduce the data analysis using open source software. Most articles provided overview tables and simple visualisations. Generalised Estimating Equations (GEEs) were the most popular statistical models among the selected articles. Only one article used open source software and only one published part of the analysis code. Replication was difficult in most cases and required reverse engineering of results or contacting the authors. For three articles we were not able to reproduce the results, for another two only parts of them. For all but two articles we had to contact the authors to be able to reproduce the results. Our main learning is that reproducing papers is difficult if no code is supplied and leads to a high burden for those conducting the reproductions. Open data policies in journals are good, but to truly boost reproducibility we suggest adding open code policies.
Item Type: | Journal article |
---|---|
Faculties: | Mathematics, Computer Science and Statistics > Statistics Medicine > Institute for Medical Information Processing, Biometry and Epidemiology |
Subjects: | 300 Social sciences > 310 Statistics |
URN: | urn:nbn:de:bvb:19-epub-93100-3 |
ISSN: | 1932-6203 |
Annotation: | Correction: https://doi.org/10.1371/journal.pone.0269047 |
Language: | English |
Item ID: | 93100 |
Date Deposited: | 26. Aug 2022, 05:29 |
Last Modified: | 05. Jan 2024, 14:34 |