Predicting the protein targets for athletic performance-enhancing substances

Mavridis, Lazaros; Mitchell, John B. O.

Show simple item record

Files in this item

Name:: JofCheminformatics.pdf
Size:: 2.599Mb
Format:: PDF

View/Open

Item metadata

dc.contributor.author	Mavridis, Lazaros
dc.contributor.author	Mitchell, John B. O.
dc.date.accessioned	2013-07-09T16:01:02Z
dc.date.available	2013-07-09T16:01:02Z
dc.date.issued	2013-06-25
dc.identifier	57886905
dc.identifier	92cb23d2-6cc3-439c-8aa3-ab54ad390b09
dc.identifier	84880054720
dc.identifier.citation	Mavridis , L & Mitchell , J B O 2013 , ' Predicting the protein targets for athletic performance-enhancing substances ' , Journal of Cheminformatics , vol. 5 , no. 31 , 31 . https://doi.org/10.1186/1758-2946-5-31	en
dc.identifier.issn	1758-2946
dc.identifier.other	ORCID: /0000-0002-0379-6097/work/34033397
dc.identifier.uri	https://hdl.handle.net/10023/3838
dc.description.abstract	Background: The World Anti-Doping Agency (WADA) publishes the Prohibited List, a manually compiled international standard of substances and methods prohibited in-competition, out-of-competition and in particular sports. It would be ideal to be able to identify all substances that have one or more performance-enhancing pharmacological actions in an automated, fast and cost effective way. Here, we use experimental data derived from the ChEMBL database (~7,000,000 activity records for 1,300,000 compounds) to build a database model that takes into account both structure and experimental information, and use this database to predict both on-target and off-target interactions between these molecules and targets relevant to doping in sport. Results: The ChEMBL database was screened and eight well populated categories of activities (Ki, Kd, EC50, ED50, activity, potency, inhibition and IC50) were used for a rule-based filtering process to define the labels “active” or “inactive”. The “active” compounds for each of the ChEMBL families were thereby defined and these populated our bioactivity-based filtered families. A structure-based clustering step was subsequently performed in order to split families with more than one distinct chemical scaffold. This produced refined families, whose members share both a common chemical scaffold and bioactivity against a common target in ChEMBL. Conclusions: We have used the Parzen-Rosenblatt machine learning approach to test whether compounds in ChEMBL can be correctly predicted to belong to their appropriate refined families. Validation tests using the refined families gave a significant increase in predictivity compared with the filtered or with the original families. Out of 61,660 queries in our Monte Carlo cross-validation, belonging to 19,639 refined families, 41,300 (66.98%) had the parent family as the top prediction and 53,797 (87.25%) had the parent family in the top four hits. Having thus validated our approach, we used it to identify the protein targets associated with the WADA prohibited classes. For compounds where we do not have experimental data, we use their computed patterns of interaction with protein targets to make predictions of bioactivity. We hope that other groups will test these predictions experimentally in the future.
dc.format.extent	13
dc.format.extent	2725472
dc.language.iso	eng
dc.relation.ispartof	Journal of Cheminformatics	en
dc.subject	Protein target prediction	en
dc.subject	Polypharmacology	en
dc.subject	Machine learning	en
dc.subject	Side effects	en
dc.subject	Multi-label prediction	en
dc.subject	Drugs in sport	en
dc.subject	Drug repurposing	en
dc.subject	QD Chemistry	en
dc.subject.lcc	QD	en
dc.title	Predicting the protein targets for athletic performance-enhancing substances	en
dc.type	Journal article	en
dc.contributor.institution	University of St Andrews. School of Chemistry	en
dc.contributor.institution	University of St Andrews. Biomedical Sciences Research Complex	en
dc.contributor.institution	University of St Andrews. EaSTCHEM	en
dc.identifier.doi	https://doi.org/10.1186/1758-2946-5-31
dc.description.status	Peer reviewed	en
dc.identifier.url	http://www.jcheminf.com/content/5/1/31	en

This item appears in the following Collection(s)

University of St Andrews Research

Show simple item record