Saved in:
Bibliographic Details
Main Authors: Ha, Minh-Quyet, Le, Dinh-Khiet, Dao, Duc-Anh, Vu, Tien-Sinh, Nguyen, Duong-Nguyen, Nguyen, Viet-Cuong, Kino, Hiori, Huynh, Van-Nam, Dam, Hieu-Chi
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.14631
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929722598883328
author Ha, Minh-Quyet
Le, Dinh-Khiet
Dao, Duc-Anh
Vu, Tien-Sinh
Nguyen, Duong-Nguyen
Nguyen, Viet-Cuong
Kino, Hiori
Huynh, Van-Nam
Dam, Hieu-Chi
author_facet Ha, Minh-Quyet
Le, Dinh-Khiet
Dao, Duc-Anh
Vu, Tien-Sinh
Nguyen, Duong-Nguyen
Nguyen, Viet-Cuong
Kino, Hiori
Huynh, Van-Nam
Dam, Hieu-Chi
contents Discovering novel high-entropy alloys (HEAs) with desirable properties is challenging due to the vast compositional space and complex phase formation mechanisms. Efficient exploration of this space requires a strategic approach that integrates heterogeneous knowledge sources. Here, we propose a framework that systematically combines knowledge extracted from computational material datasets with domain knowledge distilled from scientific literature using large language models (LLMs). A central feature of this approach is the explicit consideration of element substitutability, identifying chemically similar elements that can be interchanged to potentially stabilize desired HEAs. Dempster-Shafer theory, a mathematical framework for reasoning under uncertainty, is employed to model and combine substitutabilities based on aggregated evidence from multiple sources. The framework predicts the phase stability of candidate HEA compositions and is systematically evaluated on both quaternary alloy systems, demonstrating superior performance compared to baseline machine learning models and methods reliant on single-source evidence in cross-validation experiments. By leveraging multi-source knowledge, the framework retains robust predictive power even when key elements are absent from the training data, underscoring its potential for knowledge transfer and extrapolation. Furthermore, the enhanced interpretability of the methodology offers insights into the fundamental factors governing HEA formation. Overall, this work provides a promising strategy for accelerating HEA discovery by integrating computational and textual knowledge sources, enabling efficient exploration of vast compositional spaces with improved generalization and interpretability.
format Preprint
id arxiv_https___arxiv_org_abs_2502_14631
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery
Ha, Minh-Quyet
Le, Dinh-Khiet
Dao, Duc-Anh
Vu, Tien-Sinh
Nguyen, Duong-Nguyen
Nguyen, Viet-Cuong
Kino, Hiori
Huynh, Van-Nam
Dam, Hieu-Chi
Machine Learning
Discovering novel high-entropy alloys (HEAs) with desirable properties is challenging due to the vast compositional space and complex phase formation mechanisms. Efficient exploration of this space requires a strategic approach that integrates heterogeneous knowledge sources. Here, we propose a framework that systematically combines knowledge extracted from computational material datasets with domain knowledge distilled from scientific literature using large language models (LLMs). A central feature of this approach is the explicit consideration of element substitutability, identifying chemically similar elements that can be interchanged to potentially stabilize desired HEAs. Dempster-Shafer theory, a mathematical framework for reasoning under uncertainty, is employed to model and combine substitutabilities based on aggregated evidence from multiple sources. The framework predicts the phase stability of candidate HEA compositions and is systematically evaluated on both quaternary alloy systems, demonstrating superior performance compared to baseline machine learning models and methods reliant on single-source evidence in cross-validation experiments. By leveraging multi-source knowledge, the framework retains robust predictive power even when key elements are absent from the training data, underscoring its potential for knowledge transfer and extrapolation. Furthermore, the enhanced interpretability of the methodology offers insights into the fundamental factors governing HEA formation. Overall, this work provides a promising strategy for accelerating HEA discovery by integrating computational and textual knowledge sources, enabling efficient exploration of vast compositional spaces with improved generalization and interpretability.
title Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery
topic Machine Learning
url https://arxiv.org/abs/2502.14631