TY - JOUR
T1 - A Comprehensive Discovery Platform for Organophosphorus Ligands for Catalysis
AU - Gensch, Tobias
AU - Dos Passos Gomes, Gabriel
AU - Friederich, Pascal
AU - Peters, Ellyn
AU - Gaudin, Théophile
AU - Pollice, Robert
AU - Jorner, Kjell
AU - Nigam, Akshatkumar
AU - Lindner-D'Addario, Michael
AU - Sigman, Matthew S.
AU - Aspuru-Guzik, Alán
N1 - Funding Information:
T.Ge. thanks the Leopoldina Fellowship Programme of the German National Academy of Sciences Leopoldina (LPDS 2017-18). T.Ge. is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy (EXC 2008/1-390540038) and by a Liebig Fellowship of the Fonds der Chemischen Industrie. G.P.G. gratefully acknowledges the Natural Sciences and Engineering Research Council of Canada (NSERC) for the Banting Postdoctoral Fellowship. R.P. acknowledges funding through a Postdoc.Mobility fellowship by the Swiss National Science Foundation (SNSF, Project No. 191127). K.J. was a fellow of the AstraZeneca Postdoc Programme (2018–2020). M.L.D. gratefully acknowledges the Fonds de Recherche Quebec Nature et Technologies (FRQNT) for the B1X Master’s Scholarship and support from the Queen Elizabeth II Graduate Scholarship in Science and Technology (QEII-GSST). The support and resources from the Center for High Performance Computing at the University of Utah are gratefully acknowledged. We acknowledge the Defense Advanced Research Projects Agency (DARPA) under the Accelerated Molecular Discovery Program under Cooperative Agreement No. HR00111920027 dated August 1, 2019. The content of the information presented in this work does not necessarily reflect the position or the policy of the Government. A.A.-G. thanks Anders G. Frøseth for his generous support. A.A.-G. also acknowledges the generous support of Natural Resources Canada and the Canada 150 Research Chairs program. We thank Compute Canada for computational resources. DFT and xtb calculations were performed on the Niagara supercomputer at the SciNet HPC Consortium. SciNet is funded by the Canada Foundation for Innovation; the Government of Ontario; Ontario Research Fund - Research Excellence; and the University of Toronto. Machine learning models were developed and trained on the supercomputer Beluga from École de technologie supérieure, managed by Calcul Québec and Compute Canada. The operation of this supercomputer is funded by the Canada Foundation for Innovation (CFI), the ministère de l’Économie, de la science et de l’innovation du Québec (MESI), and the Fonds de recherche du Québec – Nature et technologies (FRQ-NT). We are grateful to UofT Matter Lab system administrators Dr. Claire Yu and Chris Crebolder for helping with the deployment of the web app. M.S.S. and E.P. thank the NSF under the CCI Center for Computer Assisted Synthesis (CHE-1925607) for support.
Publisher Copyright:
© 2022 American Chemical Society.
PY - 2022/1/26
Y1 - 2022/1/26
N2 - The design of molecular catalysts typically involves reconciling multiple conflicting property requirements, largely relying on human intuition and local structural searches. However, the vast number of potential catalysts requires pruning of the candidate space by efficient property prediction with quantitative structure-property relationships. Data-driven workflows embedded in a library of potential catalysts can be used to build predictive models for catalyst performance and serve as a blueprint for novel catalyst designs. Herein we introduce kraken, a discovery platform covering monodentate organophosphorus(III) ligands providing comprehensive physicochemical descriptors based on representative conformer ensembles. Using quantum-mechanical methods, we calculated descriptors for 1558 ligands, including commercially available examples, and trained machine learning models to predict properties of over 300000 new ligands. We demonstrate the application of kraken to systematically explore the property space of organophosphorus ligands and how existing data sets in catalysis can be used to accelerate ligand selection during reaction optimization.
AB - The design of molecular catalysts typically involves reconciling multiple conflicting property requirements, largely relying on human intuition and local structural searches. However, the vast number of potential catalysts requires pruning of the candidate space by efficient property prediction with quantitative structure-property relationships. Data-driven workflows embedded in a library of potential catalysts can be used to build predictive models for catalyst performance and serve as a blueprint for novel catalyst designs. Herein we introduce kraken, a discovery platform covering monodentate organophosphorus(III) ligands providing comprehensive physicochemical descriptors based on representative conformer ensembles. Using quantum-mechanical methods, we calculated descriptors for 1558 ligands, including commercially available examples, and trained machine learning models to predict properties of over 300000 new ligands. We demonstrate the application of kraken to systematically explore the property space of organophosphorus ligands and how existing data sets in catalysis can be used to accelerate ligand selection during reaction optimization.
UR - http://www.scopus.com/inward/record.url?scp=85123859191&partnerID=8YFLogxK
U2 - 10.1021/jacs.1c09718
DO - 10.1021/jacs.1c09718
M3 - Article
C2 - 35020383
AN - SCOPUS:85123859191
SN - 0002-7863
VL - 144
SP - 1205
EP - 1217
JO - Journal of the American Chemical Society
JF - Journal of the American Chemical Society
IS - 3
ER -