Nutmeg and SPICE: Models and data for biomolecular machine learning

Peter Eastman, Benjamin P. Pritchard, John D. Chodera, Thomas E. Markland
Journal of Chemical Theory and Computation 20:8583, 2024.
[DOI] [preprint]

We present a significant expansion of the SPICE dataset, a large-scale quantum chemical dataset for training machine learning potentials, and show how it can be used to build extremely accurate machine learning potentials.

NNP/MM: Fast molecular dynamics simulations with machine learning potentials and molecular mechanics

Galvelis R, Varela-Rial A, Doerr S, Fino R, Eastman P, Markland TE,  Chodera JD, and de Fabritiis G
Journal of Chemical Information and Modeling 63:5701, 2023 [DOI] [arXiv]

We demonstrate that a new generation of quantum machine learning (QML) potentials based on neural networks---which can achieve quantum chemical accuracy at a fraction of the cost---can be implemented efficiently in the OpenMM molecular dynamics simulation engine as part of hybrid machine learning / molecular mechanics (ML/MM) potentials that promise to deliver superior accuracy for modeling protein-ligand interactions.

SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials

Eastman P, Behara PK, Dotson DL, Galvelis R, Herr JE, Horton JT, Mao Y, Chodera JD, Pritchard BP, Wang Y, De Fabritiis G, and Markland TE
Scientific Data 10:11, 2023 [DOI]

To remedy the lack of large, open quantum chemical datasets for training accurate general machine learning potentials and molecular mechanics force fields for druglike small molecules and biomolecules, we produce the open SPICE dataset, and show how it can be used to build extremely accurate machine learning potentials.

OpenMM 7: Rapid Development of High Performance Algorithms for Molecular Dynamics

Peter Eastman, Jason Swails, John D. Chodera, Robert T. McGibbon, Yutong Zhao, Kyle A. Beauchamp, Lee-Ping Wang, Andrew C. Simmonett, Matthew P. Harrigan, Chaya D. Stern, Rafal P. Wiewiora, Bernard R. Brooks, Vijay S. Pande. PLoS Computational Biology 13:e1005659, 2017. [DOI] [bioRxiv] [website] [GitHub]

We describe the latest version of OpenMM, a GPU-accelerated framework for high performance molecular simulation applications.