Saved in:
Bibliographic Details
Main Authors: Zhang, Shansi, Li, Min
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.10651
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914157694025728
author Zhang, Shansi
Li, Min
author_facet Zhang, Shansi
Li, Min
contents Data analysis and performance evaluation of simulation deduction plays a pivotal role in modern warfare, which enables military personnel to gain invaluable insights into the potential effectiveness of different strategies, tactics, and operational plans. Traditional manual analysis approach is time-consuming and limited by human errors. To enhance efficiency and accuracy, large language models (LLMs) with strong analytical and inferencing capabilities can be employed. However, high-quality analysis reports with well-structured formatting cannot be obtained through a single instruction input to the LLM. To tackle this issue, we propose a method that first decomposes the complex task into several sub-tasks and designs effective system prompts and user prompts for each sub-task. Multi-round interactions with the LLM incorporating self-check and reflection are then conducted to enable structured data extraction as well as multi-step analysis and evaluation. Furthermore, custom tools are defined and invoked to generate figures and compute metrics. We also design multiple report templates, each tailored to a specific application and input data type, ensuring their adaptability across a variety of scenarios. Extensive evaluation results demonstrate that the reports generated by our method exhibit higher quality, therefore obtaining higher scores than the baseline method.
format Preprint
id arxiv_https___arxiv_org_abs_2511_10651
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Data Analysis and Performance Evaluation of Simulation Deduction Based on LLMs
Zhang, Shansi
Li, Min
Computation and Language
Artificial Intelligence
Data analysis and performance evaluation of simulation deduction plays a pivotal role in modern warfare, which enables military personnel to gain invaluable insights into the potential effectiveness of different strategies, tactics, and operational plans. Traditional manual analysis approach is time-consuming and limited by human errors. To enhance efficiency and accuracy, large language models (LLMs) with strong analytical and inferencing capabilities can be employed. However, high-quality analysis reports with well-structured formatting cannot be obtained through a single instruction input to the LLM. To tackle this issue, we propose a method that first decomposes the complex task into several sub-tasks and designs effective system prompts and user prompts for each sub-task. Multi-round interactions with the LLM incorporating self-check and reflection are then conducted to enable structured data extraction as well as multi-step analysis and evaluation. Furthermore, custom tools are defined and invoked to generate figures and compute metrics. We also design multiple report templates, each tailored to a specific application and input data type, ensuring their adaptability across a variety of scenarios. Extensive evaluation results demonstrate that the reports generated by our method exhibit higher quality, therefore obtaining higher scores than the baseline method.
title Data Analysis and Performance Evaluation of Simulation Deduction Based on LLMs
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2511.10651