Hello! I am Wenhao Gao (高文昊), an incoming Assistant Professor in the Department of Chemical and Biomolecular Engineering at the University of Pennsylvania, and a member of Innovation in Data Engineering and Science (IDEAS). I will be starting in January 2027 and will begin recruiting Ph.D. students from the December 2025 application cycle. Before joining Penn, I will be a postdoctoral researcher at Stanford University with Professor Grant Rotskoff and Professor Stefano Ermon.
I am interested in the intersection of chemistry and AI, particularly in how AI can transform molecular discovery. My research focuses on building systematic methodologies that enable scalable, effective, and efficient molecular discovery for applications such as drug design and sustainable materials. The main approach is to integrate chemical and physical priors with modern computational techniques to enhance the modeling and design of molecules and materials with targeted functionalities, with a recent emphasis on AI and machine learning. I am also interested in applying these methods to real-world problems, including therapeutic discovery and sustainable chemistry.
I recently received my PhD in Chemical Engineering from MIT, where I was advised by Professor Connor W. Coley. My PhD research was supported by the Google PhD Fellowship and the Takeda Fellowship. I am honored to be recognized among the CAS Future Leaders 2025, a D. E. Shaw Research Fellow, and a Forbes 30 Under 30 Asia honoree in Healthcare and Science. You can find more information about my research, publications, and projects, as well as my professional and academic background, below.
I'll be recruiting PhD and Master students in the December 2025 application cycle. If you're interested in joining my group, please apply to the CBE, CIS, or Chemistry PhD programs at Penn and list my name in your application. I'll also be recruiting postdocs. If you're interested in working with me, please email your CV along with a brief statement of your research interests and background. Please also include your professional goals and what you hope to learn and accomplish specifically while working in the group. If you have independent funding, kindly mention it in your email. If you're currently a PhD student at Penn and are interested in my research, feel free to reach out! Please note that due to the high volume of emails, I may not be able to respond to every message I receive.
Most recent publications on Google Scholar.
‡ indicates equal contribution.
Generative Artificial Intelligence for Navigating Synthesizable Chemical Space
Wenhao Gao‡, Shitong Luo‡ and Connor W. Coley
PNAS
Projecting Molecules into Synthesizable Chemical Spaces
Shitong Luo‡, Wenhao Gao‡, Zuofan Wu‡, Jian Peng, Connor W. Coley and Jianzhu Ma
ICML 2024
Substrate Scope Contrastive Learning: Repurposing Human Bias to Learn Atomic Representations
Wenhao Gao, Priyanka Raghavan, Ron Shprints and Connor W. Coley
JACS
Closing the Execution Gap in Generative AI for Chemicals and Materials: Freeways or Safeguards
Akshay Subramanian, Wenhao Gao, ... Rafael Gomez-Bombarelli
An MIT Exploration of Generative AI
Scientific Discovery in the Age of Artificial Intelligence
Hanchen Wang‡, Tianfan Fu‡, Yuanqi Du‡, Wenhao Gao, ... Marinka Zitnik
Nature
Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization
Wenhao Gao‡, Tianfan Fu‡, Jimeng Sun and Connor W. Coley
NeurIPS 2022
Autonomous Platforms for Data-driven Organic Synthesis
Wenhao Gao, Priyanka Raghavan, and Connor W. Coley
Nature Communications
Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular Design
Wenhao Gao, Rocio Mercado, and Connor W. Coley
ICLR 2022 (Spotlight)
Differentiable Scaffolding Tree for Molecular Optimization
Tianfan Fu‡, Wenhao Gao‡, Cao Xiao, Jacob Yasonik, Connor W. Coley and Jimeng Sun
ICLR 2022
Synthesizability of Molecules Proposed by Generative Models
Wenhao Gao and Connor W. Coley
Journal of Chemical Informatics and Modeling
Generative Artificial Intelligence for Navigating Synthesizable Chemical Space
Wenhao Gao‡, Shitong Luo‡ and Connor W. Coley
PNAS
Projecting Molecules into Synthesizable Chemical Spaces
Shitong Luo‡, Wenhao Gao‡, Zuofan Wu‡, Jian Peng, Connor W. Coley and Jianzhu Ma
ICML 2024
Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Kevin Yu, Jihye Roh, Ziang Li, Wenhao Gao, Runzhong Wang and Connor W Coley
NeurIPS 2024 (Spotlight)
Efficient Evolutionary Search over Chemical Space with Large Language Models
Haorui Wang, Marta Skreta, Yuanqi Du, Wenhao Gao, Lingkai Kong, Cher Tian Ser, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Alan Aspuru-Guzik, Kirill Neklyudov and Chao Zhang
ICLR 2025
Syntax-Guided Procedural Synthesis of Molecules
Michael Sun, Alston Lo, Wenhao Gao, Minghao Guo, Veronika Thost, Jie Chen, Connor W. Coley and Wojciech Matusik
ArXiv
TDC-2: Multimodal Foundation for Therapeutic Science
Alejandro Velez-Arce, Kexin Huang, Michelle Li, Xiang Lin, Wenhao Gao, Tianfan Fu, Manolis Kellis, Bradley L Pentelute and Marinka Zitnik
ArXiv
Substrate Scope Contrastive Learning: Repurposing Human Bias to Learn Atomic Representations
Wenhao Gao, Priyanka Raghavan, Ron Shprints and Connor W. Coley
JACS
AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design
Xinze Li, Penglei Wang, Tianfan Fu, Wenhao Gao, Chengtao Li, Leilei Shi and Junhong Liu
ArXiv
Closing the Execution Gap in Generative AI for Chemicals and Materials: Freeways or Safeguards
Akshay Subramanian, Wenhao Gao, ... Rafael Gomez-Bombarelli
An MIT Exploration of Generative AI
Scientific Discovery in the Age of Artificial Intelligence
Hanchen Wang‡, Tianfan Fu‡, Yuanqi Du‡, Wenhao Gao, ... Marinka Zitnik
Nature
Artificial Intelligence Foundation for Therapeutic Science
Kexin Huang‡, Tianfan Fu‡, Wenhao Gao‡, Yue Zhao, Yusuf Roohani, Jure Leskovec, Connor W. Coley, Cao Xiao, Jimeng Sun and Marinka Zitnik
Nature Chemical Biology
Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization
Wenhao Gao‡, Tianfan Fu‡, Jimeng Sun and Connor W. Coley
NeurIPS 2022
Reinforced Genetic Algorithm for Structure-based Drug Design
Tianfan Fu‡, Wenhao Gao‡, Connor W. Coley and Jimeng Sun
NeurIPS 2022
Autonomous Platforms for Data-driven Organic Synthesis
Wenhao Gao, Priyanka Raghavan, and Connor W. Coley
Nature Communications
Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular Design
Wenhao Gao, Rocio Mercado, and Connor W. Coley
ICLR 2022 (Spotlight)
Differentiable Scaffolding Tree for Molecular Optimization
Tianfan Fu‡, Wenhao Gao‡, Cao Xiao, Jacob Yasonik, Connor W. Coley and Jimeng Sun
ICLR 2022
Pairwise Difference Regression: A Machine Learning Meta-algorithm for Improved Prediction and Uncertainty Quantification in Chemical Search
Michael Tynes, Wenhao Gao, Daniel J. Burrill, Enrique R. Batista, Danny Perez, Ping Yang and Nicholas Lubbers
Journal of Chemical Informatics and Modeling
Therapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Kexin Huang‡, Tianfan Fu‡, Wenhao Gao‡, Yue Zhao, Yusuf Roohani, Jure Leskovec, Connor W. Coley, Cao Xiao, Jimeng Sun and Marinka Zitnik
NeurIPS 2021
Deep Learning in Protein Structural Modeling and Design
Wenhao Gao, Sai P. Mahajan, Jeremias Sulam and Jeffrey J.Gray
Patterns
Synthesizability of Molecules Proposed by Generative Models
Wenhao Gao and Connor W. Coley
Journal of Chemical Informatics and Modeling