Saved in:
Bibliographic Details
Main Authors: Sun, Michael, Guo, Minghao, Yuan, Weize, Thost, Veronika, Owens, Crystal Elaine, Grosz, Aristotle Franklin, Selvan, Sharvaa, Zhou, Katelyn, Mohiuddin, Hassan, Pedretti, Benjamin J, Smith, Zachary P, Chen, Jie, Matusik, Wojciech
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.08147
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914878836441088
author Sun, Michael
Guo, Minghao
Yuan, Weize
Thost, Veronika
Owens, Crystal Elaine
Grosz, Aristotle Franklin
Selvan, Sharvaa
Zhou, Katelyn
Mohiuddin, Hassan
Pedretti, Benjamin J
Smith, Zachary P
Chen, Jie
Matusik, Wojciech
author_facet Sun, Michael
Guo, Minghao
Yuan, Weize
Thost, Veronika
Owens, Crystal Elaine
Grosz, Aristotle Franklin
Selvan, Sharvaa
Zhou, Katelyn
Mohiuddin, Hassan
Pedretti, Benjamin J
Smith, Zachary P
Chen, Jie
Matusik, Wojciech
contents Recent research in molecular discovery has primarily been devoted to small, drug-like molecules, leaving many similarly important applications in material design without adequate technology. These applications often rely on more complex molecular structures with fewer examples that are carefully designed using known substructures. We propose a data-efficient and interpretable model for representing and reasoning over such molecules in terms of graph grammars that explicitly describe the hierarchical design space featuring motifs to be the design basis. We present a novel representation in the form of random walks over the design space, which facilitates both molecule generation and property prediction. We demonstrate clear advantages over existing methods in terms of performance, efficiency, and synthesizability of predicted molecules, and we provide detailed insights into the method's chemical interpretability.
format Preprint
id arxiv_https___arxiv_org_abs_2403_08147
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Representing Molecules as Random Walks Over Interpretable Grammars
Sun, Michael
Guo, Minghao
Yuan, Weize
Thost, Veronika
Owens, Crystal Elaine
Grosz, Aristotle Franklin
Selvan, Sharvaa
Zhou, Katelyn
Mohiuddin, Hassan
Pedretti, Benjamin J
Smith, Zachary P
Chen, Jie
Matusik, Wojciech
Machine Learning
Biomolecules
Recent research in molecular discovery has primarily been devoted to small, drug-like molecules, leaving many similarly important applications in material design without adequate technology. These applications often rely on more complex molecular structures with fewer examples that are carefully designed using known substructures. We propose a data-efficient and interpretable model for representing and reasoning over such molecules in terms of graph grammars that explicitly describe the hierarchical design space featuring motifs to be the design basis. We present a novel representation in the form of random walks over the design space, which facilitates both molecule generation and property prediction. We demonstrate clear advantages over existing methods in terms of performance, efficiency, and synthesizability of predicted molecules, and we provide detailed insights into the method's chemical interpretability.
title Representing Molecules as Random Walks Over Interpretable Grammars
topic Machine Learning
Biomolecules
url https://arxiv.org/abs/2403.08147