Logo Logo
Switch Language to German
Bin, Riccardo de; Boulesteix, Anne-Laure; Benner, Axel; Becker, Natalia; Sauerbrei, Willi (2019): Combining clinical and molecular data in regression prediction models: insights from a simulation study. In: Briefings in Bioinformatics
Full text not available from 'Open Access LMU'.


Data integration, i.e. the use of different sources of information for data analysis, is becoming one of the most important topics in modern statistics. Especially in, but not limited to, biomedical applications, a relevant issue is the combination of low-dimensional (e.g. clinical data) and high-dimensional (e.g. molecular data such as gene expressions) data sources in a prediction model. Not only the different characteristics of the data, but also the complex correlation structure within and between the two data sources, pose challenging issues. In this paper, we investigate these issues via simulations, providing some useful insight into strategies to combine low- and high-dimensional data in a regression prediction model. In particular, we focus on the effect of the correlation structure on the results, while accounting for the influence of our specific choices in the design of the simulation study.