Saved in:
Bibliographic Details
Main Authors: Guo, Kehan, Shen, Yili, Gonzalez-Montiel, Gisela Abigail, Huang, Yue, Zhou, Yujun, Surve, Mihir, Guo, Zhichun, Das, Prayel, Chawla, Nitesh V, Wiest, Olaf, Zhang, Xiangliang
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.09897
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913690765230080
author Guo, Kehan
Shen, Yili
Gonzalez-Montiel, Gisela Abigail
Huang, Yue
Zhou, Yujun
Surve, Mihir
Guo, Zhichun
Das, Prayel
Chawla, Nitesh V
Wiest, Olaf
Zhang, Xiangliang
author_facet Guo, Kehan
Shen, Yili
Gonzalez-Montiel, Gisela Abigail
Huang, Yue
Zhou, Yujun
Surve, Mihir
Guo, Zhichun
Das, Prayel
Chawla, Nitesh V
Wiest, Olaf
Zhang, Xiangliang
contents The rapid advent of machine learning (ML) and artificial intelligence (AI) has catalyzed major transformations in chemistry, yet the application of these methods to spectroscopic and spectrometric data, referred to as Spectroscopy Machine Learning (SpectraML), remains relatively underexplored. Modern spectroscopic techniques (MS, NMR, IR, Raman, UV-Vis) generate an ever-growing volume of high-dimensional data, creating a pressing need for automated and intelligent analysis beyond traditional expert-based workflows. In this survey, we provide a unified review of SpectraML, systematically examining state-of-the-art approaches for both forward tasks (molecule-to-spectrum prediction) and inverse tasks (spectrum-to-molecule inference). We trace the historical evolution of ML in spectroscopy, from early pattern recognition to the latest foundation models capable of advanced reasoning, and offer a taxonomy of representative neural architectures, including graph-based and transformer-based methods. Addressing key challenges such as data quality, multimodal integration, and computational scalability, we highlight emerging directions such as synthetic data generation, large-scale pretraining, and few- or zero-shot learning. To foster reproducible research, we also release an open-source repository containing recent papers and their corresponding curated datasets (https://github.com/MINE-Lab-ND/SpectrumML_Survey_Papers). Our survey serves as a roadmap for researchers, guiding progress at the intersection of spectroscopy and AI.
format Preprint
id arxiv_https___arxiv_org_abs_2502_09897
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond
Guo, Kehan
Shen, Yili
Gonzalez-Montiel, Gisela Abigail
Huang, Yue
Zhou, Yujun
Surve, Mihir
Guo, Zhichun
Das, Prayel
Chawla, Nitesh V
Wiest, Olaf
Zhang, Xiangliang
Artificial Intelligence
Machine Learning
The rapid advent of machine learning (ML) and artificial intelligence (AI) has catalyzed major transformations in chemistry, yet the application of these methods to spectroscopic and spectrometric data, referred to as Spectroscopy Machine Learning (SpectraML), remains relatively underexplored. Modern spectroscopic techniques (MS, NMR, IR, Raman, UV-Vis) generate an ever-growing volume of high-dimensional data, creating a pressing need for automated and intelligent analysis beyond traditional expert-based workflows. In this survey, we provide a unified review of SpectraML, systematically examining state-of-the-art approaches for both forward tasks (molecule-to-spectrum prediction) and inverse tasks (spectrum-to-molecule inference). We trace the historical evolution of ML in spectroscopy, from early pattern recognition to the latest foundation models capable of advanced reasoning, and offer a taxonomy of representative neural architectures, including graph-based and transformer-based methods. Addressing key challenges such as data quality, multimodal integration, and computational scalability, we highlight emerging directions such as synthetic data generation, large-scale pretraining, and few- or zero-shot learning. To foster reproducible research, we also release an open-source repository containing recent papers and their corresponding curated datasets (https://github.com/MINE-Lab-ND/SpectrumML_Survey_Papers). Our survey serves as a roadmap for researchers, guiding progress at the intersection of spectroscopy and AI.
title Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond
topic Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2502.09897