PubChem is an invaluable source of information about 99 million molecules accessible via a website or programmatically. PubChem is an open chemistry database at the National Institutes
Category: Data Analysis Tools
A listing of data analysis tools
TabPFN is a foundation model trained on around 130,000,000 synthetically generated datasets that mimic “real world” tabular data. These datasets sampled dataset size and number
A recent paper published in Nature caught my eye, Accurate predictions on small data with a tabular foundation model by Hollmann et al., Here we present

Whilst Apple’s emulation later Rosetta2 that allows X86_64 applications to run on Apple Silicon this is not ideal especially for computationally intensive tasks. The latest
This looks very useful for anyone having to process multiple molecules, I particularly like the error processing! The open-source package scikit-learn provides various machine learning
tmap is a very fast visualisation library for large, high-dimensional data sets. It was published in 2020 DOI and the code is available on GitHub

I’m sometimes asked for a tool to compare the similarity of a list of molecules with every other molecule in the list. I suspect there
A new Apple preprint has appeared on Arxiv. https://arxiv.org/pdf/2403.20329.pdf Reference resolution is an important problem, one that is essential to understand and success- fully handle
Project Jupyter is the winner of the White House OSTP “Technical Advancement to Enable Open Science” category. Open science relies on technical advancements and infrastructures
Great video by Alex Ziskind on installing a local large language models on Apple Silicon.