Clustering is an invaluable cheminformatics technique for subdividing a typically large compound collection into small groups of similar compounds. One of the advantages is that
Category: Jupyter Notebook
tmap is a very fast visualisation library for large, high-dimensional data sets. It was published in 2020 DOI and the code is available on GitHub
I’m sometimes asked for a tool to compare the similarity of a list of molecules with every other molecule in the list. I suspect there
Project Jupyter is the winner of the White House OSTP “Technical Advancement to Enable Open Science” category. Open science relies on technical advancements and infrastructures
Another tutorial from Greg Landrum, focussing on how to take advantage of modern multithreaded CPUs. https://greglandrum.github.io/rdkit-blog/posts/2024-02-11-more-multithreading.html Looking at Generating fingerprints, Molecular standardization, Conformer generation, RMSD
A previous post described Getting Ligand ID for multiple PDB files, using the PDBe API (https://www.ebi.ac.uk/pdbe/pdbe-services). The result is a csv file containing the ligand
The Protein Data Bank (https://www.rcsb.org) is an invaluable repository of 3D biomolecular structures. As of writing the database contains 214,791 structures (X-ray, Cryo-EM and NMR)
The Protein Data Bank (https://www.rcsb.org) is an invaluable repository of 3D biomolecular structures. As of writing the database contains 214,791 structures (X-ray, Cryo-EM and NMR)
Scikit-Mol originated during an RDKit hackathon and has gradually evolved into a tool that I looks extremely useful. Scikit-Mol is related to Scikit-learn and molecules,
Cambridge Open Engage is the collaborative platform to upload, share and advance early and open research and a recent post caught my eye. Molecular set