Saved in:
Bibliographic Details
Main Authors: Yang, Jingxuan, Xu, Weichao, Shi, Yuchen, Zhang, Yi, Feng, Shuo, Pei, Huaxin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.09372
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914191505358848
author Yang, Jingxuan
Xu, Weichao
Shi, Yuchen
Zhang, Yi
Feng, Shuo
Pei, Huaxin
author_facet Yang, Jingxuan
Xu, Weichao
Shi, Yuchen
Zhang, Yi
Feng, Shuo
Pei, Huaxin
contents Testing and evaluating decision-making agents remains challenging due to unknown system architectures, limited access to internal states, and the vastness of high-dimensional scenario spaces. Existing testing approaches often rely on surrogate models of decision-making agents to generate large-scale scenario libraries; however, discrepancies between surrogate models and real decision-making agents significantly limit their generalizability and practical applicability. To address this challenge, this paper proposes intelligent resilience testing (IRTest), a unified online adaptive testing framework designed to rapidly adjust to diverse decision-making agents. IRTest initializes with an offline-trained surrogate prediction model and progressively reduces surrogate-to-real gap during testing through two complementary adaptation mechanisms: (i) online neural fine-tuning in data-rich regimes, and (ii) lightweight importance-sampling-based weighting correction in data-limited regimes. A Bayesian optimization strategy, equipped with bias-corrected acquisition functions, guides scenario generation to balance exploration and exploitation in complex testing spaces. Extensive experiments across varying levels of task complexity and system heterogeneity demonstrate that IRTest consistently improves failure-discovery efficiency, testing robustness, and cross-system generalizability. These results highlight the potential of IRTest as a practical solution for scalable, adaptive, and resilient testing of decision-making agents.
format Preprint
id arxiv_https___arxiv_org_abs_2512_09372
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Intelligent Resilience Testing for Decision-Making Agents with Dual-Mode Surrogate Adaptation
Yang, Jingxuan
Xu, Weichao
Shi, Yuchen
Zhang, Yi
Feng, Shuo
Pei, Huaxin
Systems and Control
Testing and evaluating decision-making agents remains challenging due to unknown system architectures, limited access to internal states, and the vastness of high-dimensional scenario spaces. Existing testing approaches often rely on surrogate models of decision-making agents to generate large-scale scenario libraries; however, discrepancies between surrogate models and real decision-making agents significantly limit their generalizability and practical applicability. To address this challenge, this paper proposes intelligent resilience testing (IRTest), a unified online adaptive testing framework designed to rapidly adjust to diverse decision-making agents. IRTest initializes with an offline-trained surrogate prediction model and progressively reduces surrogate-to-real gap during testing through two complementary adaptation mechanisms: (i) online neural fine-tuning in data-rich regimes, and (ii) lightweight importance-sampling-based weighting correction in data-limited regimes. A Bayesian optimization strategy, equipped with bias-corrected acquisition functions, guides scenario generation to balance exploration and exploitation in complex testing spaces. Extensive experiments across varying levels of task complexity and system heterogeneity demonstrate that IRTest consistently improves failure-discovery efficiency, testing robustness, and cross-system generalizability. These results highlight the potential of IRTest as a practical solution for scalable, adaptive, and resilient testing of decision-making agents.
title Intelligent Resilience Testing for Decision-Making Agents with Dual-Mode Surrogate Adaptation
topic Systems and Control
url https://arxiv.org/abs/2512.09372