Saved in:
Bibliographic Details
Main Authors: Liu, Cong, Gong, Chengyue, Liu, Zhenyu, Zhao, Jiale, Zhang, Yuxuan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.03946
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917312909541376
author Liu, Cong
Gong, Chengyue
Liu, Zhenyu
Zhao, Jiale
Zhang, Yuxuan
author_facet Liu, Cong
Gong, Chengyue
Liu, Zhenyu
Zhao, Jiale
Zhang, Yuxuan
contents Generative models hold great promise for accelerating material discovery but are often limited by their inflexible single-stage generative process in designing valid and diverse materials. To address this, we propose a two-stage generative framework, Lang2Str, that combines the strengths of large language models (LLMs) and flow-based models for flexible and precise material generation. Our method frames the generative process as a conditional generative task, where an LLM provides high-level conditions by generating descriptions of material unit cells' geometric layouts and properties. These descriptions, informed by the LLM's extensive background knowledge, ensure reasonable structure designs. A conditioned flow model then decodes these textual conditions into precise continuous coordinates and unit cell parameters. This staged approach combines the structured reasoning of LLMs and the distribution modeling capabilities of flow models. Experimental results show that our method achieves competitive performance on \textit{ab initio} material generation and crystal structure prediction tasks, with generated structures exhibiting closer alignment to ground truth in both geometry and energy levels, surpassing state-of-the-art models. The flexibility and modularity of our framework further enable fine-grained control over the generation process, potentially leading to more efficient and customizable material design.
format Preprint
id arxiv_https___arxiv_org_abs_2603_03946
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models
Liu, Cong
Gong, Chengyue
Liu, Zhenyu
Zhao, Jiale
Zhang, Yuxuan
Machine Learning
Generative models hold great promise for accelerating material discovery but are often limited by their inflexible single-stage generative process in designing valid and diverse materials. To address this, we propose a two-stage generative framework, Lang2Str, that combines the strengths of large language models (LLMs) and flow-based models for flexible and precise material generation. Our method frames the generative process as a conditional generative task, where an LLM provides high-level conditions by generating descriptions of material unit cells' geometric layouts and properties. These descriptions, informed by the LLM's extensive background knowledge, ensure reasonable structure designs. A conditioned flow model then decodes these textual conditions into precise continuous coordinates and unit cell parameters. This staged approach combines the structured reasoning of LLMs and the distribution modeling capabilities of flow models. Experimental results show that our method achieves competitive performance on \textit{ab initio} material generation and crystal structure prediction tasks, with generated structures exhibiting closer alignment to ground truth in both geometry and energy levels, surpassing state-of-the-art models. The flexibility and modularity of our framework further enable fine-grained control over the generation process, potentially leading to more efficient and customizable material design.
title Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models
topic Machine Learning
url https://arxiv.org/abs/2603.03946