Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Chen, Haochen, Huang, Qi, Wu, Anan, Zhang, Wenhao, Ye, Jianliang, Wu, Jianming, Tan, Kai, Lu, Xin, Xu, Xin
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2510.26231
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866908620001640448
author Chen, Haochen
Huang, Qi
Wu, Anan
Zhang, Wenhao
Ye, Jianliang
Wu, Jianming
Tan, Kai
Lu, Xin
Xu, Xin
author_facet Chen, Haochen
Huang, Qi
Wu, Anan
Zhang, Wenhao
Ye, Jianliang
Wu, Jianming
Tan, Kai
Lu, Xin
Xu, Xin
contents Automatic structure elucidation is essential for self-driving laboratories as it enables the system to achieve truly autonomous. This capability closes the experimental feedback loop, ensuring that machine learning models receive reliable structure information for real-time decision-making and optimization. Herein, we present DiSE, an end-to-end diffusion-based generative model that integrates multiple spectroscopic modalities, including MS, 13C and 1H chemical shifts, HSQC, and COSY, to achieve automated yet accurate structure elucidation of organic compounds. By learning inherent correlations among spectra through data-driven approaches, DiSE achieves superior accuracy, strong generalization across chemically diverse datasets, and robustness to experimental data despite being trained on calculated spectra. DiSE thus represents a significant advance toward fully automated structure elucidation, with broad potential in natural product research, drug discovery, and self-driving laboratories.
format Preprint
id arxiv_https___arxiv_org_abs_2510_26231
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds
Chen, Haochen
Huang, Qi
Wu, Anan
Zhang, Wenhao
Ye, Jianliang
Wu, Jianming
Tan, Kai
Lu, Xin
Xu, Xin
Information Retrieval
Automatic structure elucidation is essential for self-driving laboratories as it enables the system to achieve truly autonomous. This capability closes the experimental feedback loop, ensuring that machine learning models receive reliable structure information for real-time decision-making and optimization. Herein, we present DiSE, an end-to-end diffusion-based generative model that integrates multiple spectroscopic modalities, including MS, 13C and 1H chemical shifts, HSQC, and COSY, to achieve automated yet accurate structure elucidation of organic compounds. By learning inherent correlations among spectra through data-driven approaches, DiSE achieves superior accuracy, strong generalization across chemically diverse datasets, and robustness to experimental data despite being trained on calculated spectra. DiSE thus represents a significant advance toward fully automated structure elucidation, with broad potential in natural product research, drug discovery, and self-driving laboratories.
title DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds
topic Information Retrieval
url https://arxiv.org/abs/2510.26231