Vortex script to predict sites of metabolism

It is really useful to have two sites of metabolism tools available that use contrasting methodologies, FAME 2 using curated dataset of experimentally determined metabolism data to build a machine learning model using simple descriptors. In contrast SMARTCyp uses precomputed activation energies from density functional theory (DFT) calculations of model compounds. These are used to predict the reactivity of similar fragments within the target molecule the final score is modified to reflect the accessibility to the active site of the different CYP450 iso forms and improvements for N-oxidations of tertiary amines are included, specifically an empirical corrections to unlikely oxidations of tertiary alkylamines.

I previously wrote a script displaying the results of a SMARTCyp calculation in a webview, The first part of the script imports the smartcyp.jar, however with each update I was finding issues so I thought it might be better to simply treat SMARTCyp as a command line application and use subprocess to access it.

SMARTCyp uses a large number of quantum chemical calculations of fragment activation energies have previously been performed. The results have been compiled into rules consisting of SMARTS patterns and associated energies. SMARTCyp only uses the 2D structure of a compound, and the energy required for oxidation at each atom is computed by fragment matching towards the SMARTS patterns. If multiple patterns match the same atom the pattern with the lowest energy is used. The accessibility is approximated as the relative topological distance of an atom from the center of the molecule, and the final score is computed as Score = energy – 8*accessibility.

The results are displayed as a web page using ChemDoodle Web components.

The Vortex scripts are shown below and can be downloaded from the link at the bottom of the page. REMEMBER you will need to edit some of the paths for your environment.

The SMARTCyp Vortex Script

The FAME2 Vortex Script

FAME 2 uses an extremely randomised trees approach which is a computationally efficient classification algorithm. FAME 2 uses circular descriptions of atoms and their environments. 

The scripts are activated by right-clicking on a row in the workspace allowing the user to choose the script from the dropdown menu, the calculation may take a few seconds depending on the size of the structure, and the results are shown below.

he scripts can be downloaded here 

and need to be put in the context folder that is in the Vortex Add ons folder.

Last updated 21 September 2017

Related Posts