Saved in:
| Main Authors: | Li, Yu, Tang, Sizhe, Chen, Rongqian, Yu, Fei Xu, Jiang, Guangyu, Imani, Mahdi, Bastian, Nathaniel D., Lan, Tian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.02196 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search
by: Tang, Sizhe, et al.
Published: (2026)
by: Tang, Sizhe, et al.
Published: (2026)
Interactive Critique-Revision Training for Reliable Structured LLM Generation
by: Yu, Fei Xu, et al.
Published: (2026)
by: Yu, Fei Xu, et al.
Published: (2026)
Metric-Gradient Projection for Stable Multi-Agent Policy Learning
by: Zhang, Zuyuan, et al.
Published: (2026)
by: Zhang, Zuyuan, et al.
Published: (2026)
Optimizing Prompt Sequences using Monte Carlo Tree Search for LLM-Based Optimization
by: Yu, Fei Xu, et al.
Published: (2025)
by: Yu, Fei Xu, et al.
Published: (2025)
Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization
by: Li, Yu, et al.
Published: (2026)
by: Li, Yu, et al.
Published: (2026)
Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents
by: Tang, Sizhe, et al.
Published: (2026)
by: Tang, Sizhe, et al.
Published: (2026)
Geometry of Drifting MDPs with Path-Integral Stability Certificates
by: Zhang, Zuyuan, et al.
Published: (2026)
by: Zhang, Zuyuan, et al.
Published: (2026)
Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data
by: Mei, Yongsheng, et al.
Published: (2024)
by: Mei, Yongsheng, et al.
Published: (2024)
Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning
by: Fang, Zeyu, et al.
Published: (2026)
by: Fang, Zeyu, et al.
Published: (2026)
Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics
by: Zhang, Zuyuan, et al.
Published: (2026)
by: Zhang, Zuyuan, et al.
Published: (2026)
A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection
by: Yun, Sanggeon, et al.
Published: (2025)
by: Yun, Sanggeon, et al.
Published: (2025)
MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection
by: Yun, Sanggeon, et al.
Published: (2025)
by: Yun, Sanggeon, et al.
Published: (2025)
Automated Cyber Defense with Generalizable Graph-based Reinforcement Learning Agents
by: King, Isaiah J., et al.
Published: (2025)
by: King, Isaiah J., et al.
Published: (2025)
FedQHD: Closed-Form Function-Space Federated Reinforcement Learning
by: Hou, Yuchen, et al.
Published: (2026)
by: Hou, Yuchen, et al.
Published: (2026)
Global Optimization on Graph-Structured Data via Gaussian Processes with Spectral Representations
by: Hong, Shu, et al.
Published: (2025)
by: Hong, Shu, et al.
Published: (2025)
IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents
by: Chen, Rongqian, et al.
Published: (2026)
by: Chen, Rongqian, et al.
Published: (2026)
$n$-Musketeers: Reinforcement Learning Shapes Collaboration Among Language Models
by: Masukawa, Ryozo, et al.
Published: (2026)
by: Masukawa, Ryozo, et al.
Published: (2026)
LogHD: Robust Compression of Hyperdimensional Classifiers via Logarithmic Class-Axis Reduction
by: Yun, Sanggeon, et al.
Published: (2025)
by: Yun, Sanggeon, et al.
Published: (2025)
Continuous GNN-based Anomaly Detection on Edge using Efficient Adaptive Knowledge Graph Learning
by: Yun, Sanggeon, et al.
Published: (2024)
by: Yun, Sanggeon, et al.
Published: (2024)
Generalized Holographic Reduced Representations
by: Yeung, Calvin, et al.
Published: (2024)
by: Yeung, Calvin, et al.
Published: (2024)
Operator-Guided Invariance Learning for Continuous Reinforcement Learning
by: Zhang, Zuyuan, et al.
Published: (2026)
by: Zhang, Zuyuan, et al.
Published: (2026)
Towards Type Agnostic Cyber Defense Agents
by: Galinkin, Erick, et al.
Published: (2024)
by: Galinkin, Erick, et al.
Published: (2024)
MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning
by: Tang, Sizhe, et al.
Published: (2025)
by: Tang, Sizhe, et al.
Published: (2025)
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
by: Chen, Jingdi, et al.
Published: (2024)
by: Chen, Jingdi, et al.
Published: (2024)
DrugAgent: Automating AI-aided Drug Discovery Programming through LLM Multi-Agent Collaboration
by: Liu, Sizhe, et al.
Published: (2024)
by: Liu, Sizhe, et al.
Published: (2024)
PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning
by: Masukawa, Ryozo, et al.
Published: (2025)
by: Masukawa, Ryozo, et al.
Published: (2025)
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
by: Wang, Xiyao, et al.
Published: (2024)
by: Wang, Xiyao, et al.
Published: (2024)
Designing Robust Cyber-Defense Agents with Evolving Behavior Trees
by: Potteiger, Nicholas, et al.
Published: (2024)
by: Potteiger, Nicholas, et al.
Published: (2024)
Perception Graph for Cognitive Attack Reasoning in Augmented Reality
by: Chen, Rongqian, et al.
Published: (2025)
by: Chen, Rongqian, et al.
Published: (2025)
Explainable Autonomous Cyber Defense using Adversarial Multi-Agent Reinforcement Learning
by: Zhang, Yiyao, et al.
Published: (2026)
by: Zhang, Yiyao, et al.
Published: (2026)
GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments
by: Du, Enjun, et al.
Published: (2025)
by: Du, Enjun, et al.
Published: (2025)
CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
by: Zhang, Kaiyuan, et al.
Published: (2025)
by: Zhang, Kaiyuan, et al.
Published: (2025)
ANALYSE -- Learning to Attack Cyber-Physical Energy Systems With Intelligent Agents
by: Wolgast, Thomas, et al.
Published: (2023)
by: Wolgast, Thomas, et al.
Published: (2023)
Reasoning Knowledge-Gap in Drone Planning via LLM-based Active Elicitation
by: Fang, Zeyu, et al.
Published: (2026)
by: Fang, Zeyu, et al.
Published: (2026)
PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense
by: Cadet, Xavier, et al.
Published: (2025)
by: Cadet, Xavier, et al.
Published: (2025)
Beyond the Heatmap: A Rigorous Evaluation of Component Impact in MCTS-Based TSP Solvers
by: Pan, Xuanhao, et al.
Published: (2024)
by: Pan, Xuanhao, et al.
Published: (2024)
PIME: Prototype-based Interpretable MCTS-Enhanced Brain Network Analysis for Disorder Diagnosis
by: Zhang, Kunyu, et al.
Published: (2026)
by: Zhang, Kunyu, et al.
Published: (2026)
Quantitative Resilience Modeling for Autonomous Cyber Defense
by: Cadet, Xavier, et al.
Published: (2025)
by: Cadet, Xavier, et al.
Published: (2025)
Event-Driven Temporal Graph Networks for Asynchronous Multi-Agent Cyber Defense in NetForge_RL
by: Jankowski, Igor
Published: (2026)
by: Jankowski, Igor
Published: (2026)
A State-Space Approach to Nonstationary Discriminant Analysis
by: Xie, Shuilian, et al.
Published: (2025)
by: Xie, Shuilian, et al.
Published: (2025)
Similar Items
-
NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search
by: Tang, Sizhe, et al.
Published: (2026) -
Interactive Critique-Revision Training for Reliable Structured LLM Generation
by: Yu, Fei Xu, et al.
Published: (2026) -
Metric-Gradient Projection for Stable Multi-Agent Policy Learning
by: Zhang, Zuyuan, et al.
Published: (2026) -
Optimizing Prompt Sequences using Monte Carlo Tree Search for LLM-Based Optimization
by: Yu, Fei Xu, et al.
Published: (2025) -
Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization
by: Li, Yu, et al.
Published: (2026)