Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cohen, Jaron, Hasson, Alexander G., Tanovic, Sara
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Chemical Physics
Online Access:	https://arxiv.org/abs/2512.08077
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909951662751744
author	Cohen, Jaron Hasson, Alexander G. Tanovic, Sara
author_facet	Cohen, Jaron Hasson, Alexander G. Tanovic, Sara
contents	Since the advent of machine learning, interpretability has remained a persistent challenge, becoming increasingly urgent as generative models support high-stakes applications in drug and material discovery. Recent advances in large language model (LLM) architectures have yielded chemistry language models (CLMs) with impressive capabilities in molecular property prediction and molecular generation. However, how these models internally represent chemical knowledge remains poorly understood. In this work, we extend sparse autoencoder techniques to uncover and examine interpretable features within CLMs. Applying our methodology to the Foundation Models for Materials (FM4M) SMI-TED chemistry foundation model, we extract semantically meaningful latent features and analyse their activation patterns across diverse molecular datasets. Our findings reveal that these models encode a rich landscape of chemical concepts. We identify correlations between specific latent features and distinct domains of chemical knowledge, including structural motifs, physicochemical properties, and pharmacological drug classes. Our approach provides a generalisable framework for uncovering latent knowledge in chemistry-focused AI systems. This work has implications for both foundational understanding and practical deployment; with the potential to accelerate computational chemistry research.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_08077
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Unveiling Latent Knowledge in Chemistry Language Models through Sparse Autoencoders Cohen, Jaron Hasson, Alexander G. Tanovic, Sara Machine Learning Chemical Physics Since the advent of machine learning, interpretability has remained a persistent challenge, becoming increasingly urgent as generative models support high-stakes applications in drug and material discovery. Recent advances in large language model (LLM) architectures have yielded chemistry language models (CLMs) with impressive capabilities in molecular property prediction and molecular generation. However, how these models internally represent chemical knowledge remains poorly understood. In this work, we extend sparse autoencoder techniques to uncover and examine interpretable features within CLMs. Applying our methodology to the Foundation Models for Materials (FM4M) SMI-TED chemistry foundation model, we extract semantically meaningful latent features and analyse their activation patterns across diverse molecular datasets. Our findings reveal that these models encode a rich landscape of chemical concepts. We identify correlations between specific latent features and distinct domains of chemical knowledge, including structural motifs, physicochemical properties, and pharmacological drug classes. Our approach provides a generalisable framework for uncovering latent knowledge in chemistry-focused AI systems. This work has implications for both foundational understanding and practical deployment; with the potential to accelerate computational chemistry research.
title	Unveiling Latent Knowledge in Chemistry Language Models through Sparse Autoencoders
topic	Machine Learning Chemical Physics
url	https://arxiv.org/abs/2512.08077

Similar Items