Vortex script for Getting UniChem data from ChEMBL

UniChem is a web resource provided by the EBI, it is a ‘Unified Chemical Identifier’ system, designed to assist in the rapid cross-referencing of chemical structures, and their identifiers, between multiple databases. Currently the UniChem contains data from 27 different data sources. Currently UniChem provides links to 108,941,995 structures.

Chambers, J., Davies, M., Gaulton, A., Hersey, A., Velankar, S., Petryszak, R., Hastings, J., Bellis, L., McGlinchey, S. and Overington, J.P. UniChem: A Unified Chemical Structure Cross-Referencing and Identifier Tracking System. Journal of Cheminformatics 2013, 5:3 (January 2013). DOI: http://dx.doi.org/10.1186/1758-2946-5-3

Whilst I’ve written about a script to search using InChi keys it is also possible to search using compound identifiers.

ChEMBL also provide a RESTful Web service that users can use to retrieve data from the UniChem database in a programmatic fashion.

All RESTful queries are constructed using the following base url

Specific query urls are then constructed by adding a method name to this base url, followed by input data.

Input data may consist of three types

Since the different datasources will have different molecule identifiers for the same molecule it is important to have both the ID and the corresponding datasource.

Since we have the ChEMBLID our URL will have the form

By default the data is returned in JSON format, with the key-value pairs being, datasource and the compound ID.

The first part of the script asks the user to select the column contains the ChEMBLID, then we create the columns. Then we loop through the workspace calling the web service for each ID, parse the returned JSON and populate the workspace as shown below.

It should be straightforward to modify the script to search any of the datasources with the appropriate list of molecule identifiers.

The Vortex Script

The script can be downloaded from here 

Page Updated 15 February 2016

Related Posts

2 thoughts on “Vortex script for Getting UniChem data from ChEMBL

Comments are closed.