In silico metabolism prediction is a cheminformatic task of autonomously predicting the set of
metabolic byproducts produced from a specified molecule and a set of enzymes or reactions.
Here we describe a novel machine-learned in silico cytochrome P450 (CYP450) metabolism
prediction suite, called CyProduct, that accurately predicts metabolic byproducts for a specified
molecule and a human CYP450 isoform. It includes three modules: (1) CypReact, a tool that
predicts if the query compound reacts with a given CYP450 enzyme; (2) CypBoM, a tool that
accurately predicts the “bond site” of the reaction (i.e., which specific bonds within the query
molecule react with the CYP isoform); and (3) MetaboGen, a tool that generates the metabolic
byproducts based on CypBoM’s bond-site prediction. CyProduct predicts metabolic
biotransformation products for each of the nine most important human CYP450 enzymes.
CypBoM uses an important new concept called “Bond of Metabolism” (BoM), which extends the
traditional “Site of Metabolism" (SoM) by specifying the information about the set of chemical
bonds that is modified or formed in a metabolic reaction (rather than the specific atom). We
created a BoM database for 3487 CYP450-mediated Phase I reactions, then used this to train the
CypBoM Predictor to predict the reactive bond locations on substrate molecules. CypBoM
Predictor’s cross-validated Jaccard score for reactive bond prediction ranged from 0.380 to 0.452
over the nine CYP450 enzymes. Over variants of a test set of 72 known CYP450 substrates and
30 non-reactants, CyProduct outperformed the other packages -- including ADMET Predictor,
BioTransformer and GLORY -- by an average of 200% (wrt Jaccard score) in terms of predicting
metabolites. The CyProduct suite and the datasets are freely available at
https://bitbucket.org/wishartlab/cyproduct/src/master/.
Published: September 10, 2021
Citation
Tian S., X. Cao, R. Greiner, C. Li, A. Guo, and D. Wishart. 2021.CyProduct: A software tool for accurately predicting the byproducts of human cytochrome P450 metabolism.Journal of Chemical Information and Modeling 61, no. 6:3128-3140.PNNL-SA-160965.doi:10.1021/acs.jcim.1c00144