Predicting melting points of organic molecules : applications to aqueous solubility prediction using the General Solubility Equation
Abstract
In this work we make predictions of several important molecular properties of academic and industrial importance to seek answers to two questions: 1) Can we apply efficient machine learning techniques, using inexpensive descriptors, to predict melting points to a reasonable level of accuracy? 2) Can values of this level of accuracy be usefully applied to predicting aqueous solubility? We present predictions of melting points made by several novel machine learning models, previously applied to solubility prediction. Additionally, we make predictions of solubility via the General Solubility Equation (GSE) and monitor the impact of varying the logP prediction model (AlogP and XlogP) on the GSE. We note that the machine learning models presented, using a modest number of 2D descriptors, can make melting point predictions in line with the current state of the art prediction methods (RMSE ≥ 40 oC). We also find that predicted melting points, with an RMSE of tens of degrees Celsius, can be usefully applied to the GSE to yield accurate solubility predictions (log10S RMSE < 1) over a small dataset of druglike molecules.
Citation
McDonagh , J , van Mourik , T & Mitchell , J B O 2015 , ' Predicting melting points of organic molecules : applications to aqueous solubility prediction using the General Solubility Equation ' , Molecular Informatics , vol. 34 , no. 11-12 , pp. 715-724 . https://doi.org/10.1002/minf.201500052
Publication
Molecular Informatics
Status
Peer reviewed
ISSN
1868-1743Type
Journal article
Rights
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim. This work is made available online in accordance with the publisher’s policies. This is the author created, accepted version manuscript following peer review and may differ slightly from the final published version. The final published version of this work is available at https://dx.doi.org/10.1002/minf.201500052. This article may be used for non-commercial purposes in accordance with Wiley-VCH Terms and Conditions for self-archiving. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim. This work is made available online in accordance with the publisher’s policies. This is the author created, accepted version manuscript following peer review and may differ slightly from the final published version. The final published version of this work is available at https://dx.doi.org/10.1002/minf.201500052. This article may be used for non-commercial purposes in accordance with Wiley-VCH Terms and Conditions for self-archiving.
Collections
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.