Salvato in:
Dettagli Bibliografici
Autori principali: Ni, Yuyan, Feng, Shikun, Hong, Xin, Sun, Yuancheng, Ma, Wei-Ying, Ma, Zhi-Ming, Ye, Qiwei, Lan, Yanyan
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2407.11086
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866916325654265856
author Ni, Yuyan
Feng, Shikun
Hong, Xin
Sun, Yuancheng
Ma, Wei-Ying
Ma, Zhi-Ming
Ye, Qiwei
Lan, Yanyan
author_facet Ni, Yuyan
Feng, Shikun
Hong, Xin
Sun, Yuancheng
Ma, Wei-Ying
Ma, Zhi-Ming
Ye, Qiwei
Lan, Yanyan
contents Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook the fundamental physical principles governing molecules. In contrast, applying denoising in pre-training can be interpreted as an equivalent force learning, but the limited noise distribution introduces bias into the molecular distribution. To address this issue, we introduce a molecular pre-training framework called fractional denoising (Frad), which decouples noise design from the constraints imposed by force learning equivalence. In this way, the noise becomes customizable, allowing for incorporating chemical priors to significantly improve molecular distribution modeling. Experiments demonstrate that our framework consistently outperforms existing methods, establishing state-of-the-art results across force prediction, quantum chemical properties, and binding affinity tasks. The refined noise design enhances force accuracy and sampling coverage, which contribute to the creation of physically consistent molecular representations, ultimately leading to superior predictive performance.
format Preprint
id arxiv_https___arxiv_org_abs_2407_11086
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Pre-training with Fractional Denoising to Enhance Molecular Property Prediction
Ni, Yuyan
Feng, Shikun
Hong, Xin
Sun, Yuancheng
Ma, Wei-Ying
Ma, Zhi-Ming
Ye, Qiwei
Lan, Yanyan
Machine Learning
Artificial Intelligence
Chemical Physics
Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook the fundamental physical principles governing molecules. In contrast, applying denoising in pre-training can be interpreted as an equivalent force learning, but the limited noise distribution introduces bias into the molecular distribution. To address this issue, we introduce a molecular pre-training framework called fractional denoising (Frad), which decouples noise design from the constraints imposed by force learning equivalence. In this way, the noise becomes customizable, allowing for incorporating chemical priors to significantly improve molecular distribution modeling. Experiments demonstrate that our framework consistently outperforms existing methods, establishing state-of-the-art results across force prediction, quantum chemical properties, and binding affinity tasks. The refined noise design enhances force accuracy and sampling coverage, which contribute to the creation of physically consistent molecular representations, ultimately leading to superior predictive performance.
title Pre-training with Fractional Denoising to Enhance Molecular Property Prediction
topic Machine Learning
Artificial Intelligence
Chemical Physics
url https://arxiv.org/abs/2407.11086