Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Oschinski, Hedda, Ach, Maximilian L., Jakob, Konstantin S., Carbogno, Christian, Reuter, Karsten
Format:	Preprint
Published:	2026
Subjects:	Materials Science
Online Access:	https://arxiv.org/abs/2605.31495
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910272513376256
author	Oschinski, Hedda Ach, Maximilian L. Jakob, Konstantin S. Carbogno, Christian Reuter, Karsten
author_facet	Oschinski, Hedda Ach, Maximilian L. Jakob, Konstantin S. Carbogno, Christian Reuter, Karsten
contents	The targeted discovery of inorganic materials remains challenging due to the vastness of compositional design spaces and the high cost of exhaustive screening. Task-specific generative artificial intelligence represents a particularly efficient alternative to screening, yet demands tedious collection of training data before providing real benefit. General-purpose large language models (LLMs) have recently shown tremendous potential for the targeted generation of single, optimal materials compositions without the need for task-specific fine-tuning. However, it is unclear whether LLMs generally pose an advantage compared to specialized generative models, in particular in large design spaces. Here, we demonstrate that such models are capable of covering entire regions of the targeted property space effectively and systematically. Using Elpasolite materials as an established benchmark for generative tasks in large chemical spaces, we find that an iterative prompt-and-response framework is able to recover on average 96% of all low-energy Elpasolites in the target region. This performance, driven mainly by iterative in-context learning, surpasses the generative abilities of previous, task-specific models. Our results establish general-purpose LLMs as flexible and accessible components for inverse materials design workflows.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_31495
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	General-purpose LLMs as Constrained Crystal Composition Generators Oschinski, Hedda Ach, Maximilian L. Jakob, Konstantin S. Carbogno, Christian Reuter, Karsten Materials Science The targeted discovery of inorganic materials remains challenging due to the vastness of compositional design spaces and the high cost of exhaustive screening. Task-specific generative artificial intelligence represents a particularly efficient alternative to screening, yet demands tedious collection of training data before providing real benefit. General-purpose large language models (LLMs) have recently shown tremendous potential for the targeted generation of single, optimal materials compositions without the need for task-specific fine-tuning. However, it is unclear whether LLMs generally pose an advantage compared to specialized generative models, in particular in large design spaces. Here, we demonstrate that such models are capable of covering entire regions of the targeted property space effectively and systematically. Using Elpasolite materials as an established benchmark for generative tasks in large chemical spaces, we find that an iterative prompt-and-response framework is able to recover on average 96% of all low-energy Elpasolites in the target region. This performance, driven mainly by iterative in-context learning, surpasses the generative abilities of previous, task-specific models. Our results establish general-purpose LLMs as flexible and accessible components for inverse materials design workflows.
title	General-purpose LLMs as Constrained Crystal Composition Generators
topic	Materials Science
url	https://arxiv.org/abs/2605.31495

Similar Items