Saved in:
| Main Authors: | Zhang, Zuoyu, Zhu, Yancheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.03242 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world Scenarios
by: Huang, Yao, et al.
Published: (2025)
by: Huang, Yao, et al.
Published: (2025)
Enhancing Tool Calling in LLMs with the International Tool Calling Dataset
by: Zhang, Zuoyu, et al.
Published: (2026)
by: Zhang, Zuoyu, et al.
Published: (2026)
Deceptive Risk Minimization: Out-of-Distribution Generalization by Deceiving Distribution Shift Detectors
by: Majumdar, Anirudha
Published: (2025)
by: Majumdar, Anirudha
Published: (2025)
LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning
by: Shi, Weijie, et al.
Published: (2025)
by: Shi, Weijie, et al.
Published: (2025)
Intentional Deception as Controllable Capability in LLM Agents
by: Starace, Jason, et al.
Published: (2026)
by: Starace, Jason, et al.
Published: (2026)
Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios
by: Huang, Zhongzhen, et al.
Published: (2025)
by: Huang, Zhongzhen, et al.
Published: (2025)
Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios
by: Li, Jingzheng, et al.
Published: (2025)
by: Li, Jingzheng, et al.
Published: (2025)
Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving
by: Li, Jingzheng, et al.
Published: (2025)
by: Li, Jingzheng, et al.
Published: (2025)
Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models
by: Tan, Yingshui, et al.
Published: (2024)
by: Tan, Yingshui, et al.
Published: (2024)
AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design
by: Zhang, Zhishuai, et al.
Published: (2025)
by: Zhang, Zhishuai, et al.
Published: (2025)
AI Deception: Risks, Dynamics, and Controls
by: Chen, Boyuan, et al.
Published: (2025)
by: Chen, Boyuan, et al.
Published: (2025)
AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems
by: Ferrag, Mohamed Amine, et al.
Published: (2026)
by: Ferrag, Mohamed Amine, et al.
Published: (2026)
CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching
by: Wang, Yuzhe, et al.
Published: (2026)
by: Wang, Yuzhe, et al.
Published: (2026)
CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs
by: Du, Yongkang, et al.
Published: (2026)
by: Du, Yongkang, et al.
Published: (2026)
LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios
by: Chen, Tianyu, et al.
Published: (2026)
by: Chen, Tianyu, et al.
Published: (2026)
Out-of-Distribution Detection for Safety Assurance of AI and Autonomous Systems
by: Hodge, Victoria J., et al.
Published: (2025)
by: Hodge, Victoria J., et al.
Published: (2025)
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
by: Song, Zhiheng, et al.
Published: (2026)
by: Song, Zhiheng, et al.
Published: (2026)
Exploring the Necessity of Reasoning in LLM-based Agent Scenarios
by: Zhou, Xueyang, et al.
Published: (2025)
by: Zhou, Xueyang, et al.
Published: (2025)
FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with Reasonable Adversariality
by: Chen, Keyu, et al.
Published: (2024)
by: Chen, Keyu, et al.
Published: (2024)
Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning
by: Wang, Haining, et al.
Published: (2024)
by: Wang, Haining, et al.
Published: (2024)
MPR-GUI: Benchmarking and Enhancing Multilingual Perception and Reasoning in GUI Agents
by: Chen, Ruihan, et al.
Published: (2025)
by: Chen, Ruihan, et al.
Published: (2025)
Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning
by: Wang, Yiming, et al.
Published: (2024)
by: Wang, Yiming, et al.
Published: (2024)
CR4T: Rewrite-Based Guardrails for Adolescent LLM Safety
by: An, Heajun, et al.
Published: (2026)
by: An, Heajun, et al.
Published: (2026)
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
by: Hubinger, Evan, et al.
Published: (2024)
by: Hubinger, Evan, et al.
Published: (2024)
CrashAgent: Crash Scenario Generation via Multi-modal Reasoning
by: Li, Miao, et al.
Published: (2025)
by: Li, Miao, et al.
Published: (2025)
Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety
by: Liu, Junliang, et al.
Published: (2025)
by: Liu, Junliang, et al.
Published: (2025)
AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents
by: Bao, Zhixuan, et al.
Published: (2026)
by: Bao, Zhixuan, et al.
Published: (2026)
Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning
by: Wu, Jingtian, et al.
Published: (2025)
by: Wu, Jingtian, et al.
Published: (2025)
DecepChain: Inducing Deceptive Reasoning in Large Language Models
by: Shen, Wei, et al.
Published: (2025)
by: Shen, Wei, et al.
Published: (2025)
The Agent's First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios
by: Fu, Daocheng, et al.
Published: (2026)
by: Fu, Daocheng, et al.
Published: (2026)
Retrieval- and Argumentation-Enhanced Multi-Agent LLMs for Judgmental Forecasting (Extended Version with Supplementary Material)
by: Gorur, Deniz, et al.
Published: (2025)
by: Gorur, Deniz, et al.
Published: (2025)
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
by: Luo, Haotian, et al.
Published: (2025)
by: Luo, Haotian, et al.
Published: (2025)
A Distribution Semantics for Probabilistic Term Rewriting
by: Vidal, Germán
Published: (2024)
by: Vidal, Germán
Published: (2024)
AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents
by: Guo, Zhengkang, et al.
Published: (2026)
by: Guo, Zhengkang, et al.
Published: (2026)
OODBench: Out-of-Distribution Benchmark for Large Vision-Language Models
by: Lin, Ling, et al.
Published: (2026)
by: Lin, Ling, et al.
Published: (2026)
OpenDeception: Learning Deception and Trust in Human-AI Interaction via Multi-Agent Simulation
by: Wu, Yichen, et al.
Published: (2025)
by: Wu, Yichen, et al.
Published: (2025)
Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces
by: Zhang, Yilin, et al.
Published: (2026)
by: Zhang, Yilin, et al.
Published: (2026)
I-RAVEN-X: Benchmarking Generalization and Robustness of Analogical and Mathematical Reasoning in Large Language and Reasoning Models
by: Camposampiero, Giacomo, et al.
Published: (2025)
by: Camposampiero, Giacomo, et al.
Published: (2025)
Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision
by: Cheng, Ao, et al.
Published: (2025)
by: Cheng, Ao, et al.
Published: (2025)
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
by: Lee, Jihwan, et al.
Published: (2024)
by: Lee, Jihwan, et al.
Published: (2024)
Similar Items
-
DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world Scenarios
by: Huang, Yao, et al.
Published: (2025) -
Enhancing Tool Calling in LLMs with the International Tool Calling Dataset
by: Zhang, Zuoyu, et al.
Published: (2026) -
Deceptive Risk Minimization: Out-of-Distribution Generalization by Deceiving Distribution Shift Detectors
by: Majumdar, Anirudha
Published: (2025) -
LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning
by: Shi, Weijie, et al.
Published: (2025) -
Intentional Deception as Controllable Capability in LLM Agents
by: Starace, Jason, et al.
Published: (2026)