We describe the design and data collection (and associated challenges) for the SAMPL6 part II logP octanol-water blind prediction challenge, where the goal was to benchmark the accuracy of force fields for druglike molecules (here, molecules resembling kinase inhibitors).
Steven K. Albanese*, Daniel L. Parton*, Mehtap Isik**, Lucelenie Rodríguez-Laureano**, Sonya M. Hanson, Julie M. Behr, Scott Gradia, Chris Jeans, Nicholas M. Levinson, Markus A. Seeliger, and John D. Chodera.
* co-first author; ** co-second author
Biochemistry 57:4675, 2018. [DOI] [PDF] [bioRxiv] [GitHub]
Interactive data browser: [github.io]
Plasmids available via AddGene
Human kinase catalytic domains---the therapeutic target of selective kinase inhibitors used in the treatment of cancer and other diseases---are notoriously difficult and expensive to express in insect or human cells. Here, we utilize the phosphatase co-expression technology developed by Markus Seeliger (now at Stony Brook) to develop a library of human kinase catalytic domains for facile and inexpensive expression in bacteria.
Ariën S. Rustenburg, Justin Dancer, Baiwei Lin, Jianweng A. Feng, Daniel F. Ortwine, David L. Mobley, and John D. Chodera.
Journal of Computer-Aided Molecular Design 30:945, 2016. [DOI] [bioRxiv] [PDF] // data: [GitHub]
Solicited manuscript for special issue of the Journal of Computer Aided Molecular Design on the SAMPL5 Challenge.
The SAMPL Challenges have driven predictive physical modeling for ligand:protein binding forward by focusing the community on a series of blind challenges that evaluate performance on blind datasets, focus attention on current challenges for physical modeling techniques, and provide high-quality experimental datasets to the community after the challenge is over. For many years, challenges focused around hydration free energies have proven to be extremely useful, with theory now able to determine when experiment is wrong. To replace these challenges, since no more hydration free energy data is being measured, we proposed to use the partition or distribution coefficients of small druglike molecules between aqueous and apolar phases. We report the collection of cyclohexane-water partition data for a set of compounds used to drive the SAMPL5 distribution coefficient challenge, providing the experimental data, methodology, and insight for future iterations of this challenge.