Guardado en:
Detalles Bibliográficos
Autores principales: Wang, Xiao, Yang, Qingquan, Wang, Fuling, Chen, Qiang, Wu, Wentao, Jin, Yu, Jiang, Jingtao, Jin, Liye, Jiang, Bo, Sun, Dengdi, Lv, Wanli, Chen, Meiwen, Chen, Zehua, Xu, Guosheng, Tang, Jin
Formato: Preprint
Publicado: 2025
Materias:
Acceso en línea:https://arxiv.org/abs/2502.05615
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866910819926671360
author Wang, Xiao
Yang, Qingquan
Wang, Fuling
Chen, Qiang
Wu, Wentao
Jin, Yu
Jiang, Jingtao
Jin, Liye
Jiang, Bo
Sun, Dengdi
Lv, Wanli
Chen, Meiwen
Chen, Zehua
Xu, Guosheng
Tang, Jin
author_facet Wang, Xiao
Yang, Qingquan
Wang, Fuling
Chen, Qiang
Wu, Wentao
Jin, Yu
Jiang, Jingtao
Jin, Liye
Jiang, Bo
Sun, Dengdi
Lv, Wanli
Chen, Meiwen
Chen, Zehua
Xu, Guosheng
Tang, Jin
contents Nuclear fusion is one of the most promising ways for humans to obtain infinite energy. Currently, with the rapid development of artificial intelligence, the mission of nuclear fusion has also entered a critical period of its development. How to let more people to understand nuclear fusion and join in its research is one of the effective means to accelerate the implementation of fusion. This paper proposes the first large model in the field of nuclear fusion, XiHeFusion, which is obtained through supervised fine-tuning based on the open-source large model Qwen2.5-14B. We have collected multi-source knowledge about nuclear fusion tasks to support the training of this model, including the common crawl, eBooks, arXiv, dissertation, etc. After the model has mastered the knowledge of the nuclear fusion field, we further used the chain of thought to enhance its logical reasoning ability, making XiHeFusion able to provide more accurate and logical answers. In addition, we propose a test questionnaire containing 180+ questions to assess the conversational ability of this science popularization large model. Extensive experimental results show that our nuclear fusion dialogue model, XiHeFusion, can perform well in answering science popularization knowledge. The pre-trained XiHeFusion model is released on https://github.com/Event-AHU/XiHeFusion.
format Preprint
id arxiv_https___arxiv_org_abs_2502_05615
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Wang, Xiao
Yang, Qingquan
Wang, Fuling
Chen, Qiang
Wu, Wentao
Jin, Yu
Jiang, Jingtao
Jin, Liye
Jiang, Bo
Sun, Dengdi
Lv, Wanli
Chen, Meiwen
Chen, Zehua
Xu, Guosheng
Tang, Jin
Computer Vision and Pattern Recognition
Artificial Intelligence
Nuclear fusion is one of the most promising ways for humans to obtain infinite energy. Currently, with the rapid development of artificial intelligence, the mission of nuclear fusion has also entered a critical period of its development. How to let more people to understand nuclear fusion and join in its research is one of the effective means to accelerate the implementation of fusion. This paper proposes the first large model in the field of nuclear fusion, XiHeFusion, which is obtained through supervised fine-tuning based on the open-source large model Qwen2.5-14B. We have collected multi-source knowledge about nuclear fusion tasks to support the training of this model, including the common crawl, eBooks, arXiv, dissertation, etc. After the model has mastered the knowledge of the nuclear fusion field, we further used the chain of thought to enhance its logical reasoning ability, making XiHeFusion able to provide more accurate and logical answers. In addition, we propose a test questionnaire containing 180+ questions to assess the conversational ability of this science popularization large model. Extensive experimental results show that our nuclear fusion dialogue model, XiHeFusion, can perform well in answering science popularization knowledge. The pre-trained XiHeFusion model is released on https://github.com/Event-AHU/XiHeFusion.
title XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2502.05615