Steffen, Andreas; Apostolakis, Joannis:
On the ease of predicting the thermodynamic properties of beta-cyclodextrin inclusion complexes.
In: Chemistry Central Journal
Background: In this study we investigated the predictability of three thermodynamic quantities related to complex formation. As a model system we chose the host-guest complexes of beta-cyclodextrin (beta-CD) with different guest molecules. A training dataset comprised of 176 beta-CD guest molecules with experimentally determined thermodynamic quantities was taken from the literature. We compared the performance of three different statistical regression methods principal component regression (PCR), partial least squares regression (PLSR), and support vector machine regression combined with forward feature selection (SVMR/FSS)-with respect to their ability to generate predictive quantitative structure property relationship (QSPR) models for Delta G degrees, Delta H degrees and Delta S degrees on the basis of computed molecular descriptors. Results: We found that SVMR/FFS marginally outperforms PLSR and PCR in the prediction of Delta G degrees, with PLSR performing slightly better than PCR. PLSR and PCR proved to be more stable in a nested cross-validation protocol. Whereas Delta G degrees can be predicted in good agreement with experimental values, none of the methods led to comparably good predictive models for Delta H degrees. In using the methods outlined in this study, we found that Delta S degrees appears almost unpredictable. In order to understand the differences in the ease of predicting the quantities, we performed a detailed analysis. As a result we can show that free energies are less sensitive (than enthalpy or entropy) to the small structural variations of guest molecules. This property, as well as the lower sensitivity of Delta G degrees to experimental conditions, are possible explanations for its greater predictability. Conclusion: This study shows that the ease of predicting Delta G degrees cannot be explained by the predictability of either Delta H degrees or Delta S degrees Our analysis suggests that the poor predictability of T Delta S degrees and, to a lesser extent, Delta H degrees has to do with a stronger dependence of these quantities on the structural details of the complex and only to a lesser extent on experimental error.