Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorBoobier, Samuel
dc.contributor.authorOsbourn, Anne
dc.contributor.authorMitchell, John B. O.
dc.date.accessioned2017-12-18T12:30:24Z
dc.date.available2017-12-18T12:30:24Z
dc.date.issued2017-12-13
dc.identifier.citationBoobier , S , Osbourn , A & Mitchell , J B O 2017 , ' Can human experts predict solubility better than computers? ' , Journal of Cheminformatics , vol. 9 , no. 63 . https://doi.org/10.1186/s13321-017-0250-yen
dc.identifier.issn1758-2946
dc.identifier.otherPURE: 251536334
dc.identifier.otherPURE UUID: b4aeaf81-3601-40ca-8607-8a725eeb2b5f
dc.identifier.otherScopus: 85037991253
dc.identifier.otherORCID: /0000-0002-0379-6097/work/56638641
dc.identifier.otherWOS: 000417957400001
dc.identifier.urihttps://hdl.handle.net/10023/12351
dc.descriptionThis work took place as SB’s MChem undergraduate research project at the University of St Andrews. The authors thank University of St Andrews Library for funding the Open Access publication of this work. AO’s laboratory is supported by the UK Biotechnological and Biological Sciences Research Council (BBSRC) Institute Strategic Programme Grant ‘Molecules from Nature’ (BB/P012523/1) and the John Innes Foundation.en
dc.description.abstractIn this study, we design and carry out a survey, asking human experts to predict the aqueous solubility of druglike organic compounds. We investigate whether these experts, drawn largely from the pharmaceutical industry and academia, can match or exceed the predictive power of algorithms. Alongside this, we implement 10 typical machine learning algorithms on the same dataset. The best algorithm, a variety of neural network known as a multi-layer perceptron, gave an RMSE of 0.985 log S units and an R2 of 0.706. We would not have predicted the relative success of this particular algorithm in advance. We found that the best individual human predictor generated an almost identical prediction quality with an RMSE of 0.942 log S units and an R2 of 0.723. The collection of algorithms contained a higher proportion of reasonably good predictors, nine out of ten compared with around half of the humans. We found that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median generated excellent predictivity. While our consensus human predictor achieved very slightly better headline figures on various statistical measures, the difference between it and the consensus machine learning predictor was both small and statistically insignificant. We conclude that human experts can predict the aqueous solubility of druglike molecules essentially equally well as machine learning algorithms. We find that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median is a powerful way of benefitting from the wisdom of crowds.
dc.language.isoeng
dc.relation.ispartofJournal of Cheminformaticsen
dc.rights© The Author(s) 2017. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.en
dc.subjectQD Chemistryen
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectRM Therapeutics. Pharmacologyen
dc.subjectDASen
dc.subject.lccQDen
dc.subject.lccQA75en
dc.subject.lccRMen
dc.titleCan human experts predict solubility better than computers?en
dc.typeJournal articleen
dc.contributor.sponsorBBSRCen
dc.description.versionPublisher PDFen
dc.contributor.institutionUniversity of St Andrews. School of Chemistryen
dc.contributor.institutionUniversity of St Andrews. Biomedical Sciences Research Complexen
dc.contributor.institutionUniversity of St Andrews. EaSTCHEMen
dc.identifier.doihttps://doi.org/10.1186/s13321-017-0250-y
dc.description.statusPeer revieweden
dc.identifier.grantnumberBB/I00596X/1en


This item appears in the following Collection(s)

Show simple item record