:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Zuoyu, Zhu, Yancheng
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.03242
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world Scenarios
by: Huang, Yao, et al.
Published: (2025)

Enhancing Tool Calling in LLMs with the International Tool Calling Dataset
by: Zhang, Zuoyu, et al.
Published: (2026)

Deceptive Risk Minimization: Out-of-Distribution Generalization by Deceiving Distribution Shift Detectors
by: Majumdar, Anirudha
Published: (2025)

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning
by: Shi, Weijie, et al.
Published: (2025)

Intentional Deception as Controllable Capability in LLM Agents
by: Starace, Jason, et al.
Published: (2026)

Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios
by: Huang, Zhongzhen, et al.
Published: (2025)

Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios
by: Li, Jingzheng, et al.
Published: (2025)

Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving
by: Li, Jingzheng, et al.
Published: (2025)

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models
by: Tan, Yingshui, et al.
Published: (2024)

AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design
by: Zhang, Zhishuai, et al.
Published: (2025)

AI Deception: Risks, Dynamics, and Controls
by: Chen, Boyuan, et al.
Published: (2025)

AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems
by: Ferrag, Mohamed Amine, et al.
Published: (2026)

CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching
by: Wang, Yuzhe, et al.
Published: (2026)

CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs
by: Du, Yongkang, et al.
Published: (2026)

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios
by: Chen, Tianyu, et al.
Published: (2026)

Out-of-Distribution Detection for Safety Assurance of AI and Autonomous Systems
by: Hodge, Victoria J., et al.
Published: (2025)

MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
by: Song, Zhiheng, et al.
Published: (2026)

Exploring the Necessity of Reasoning in LLM-based Agent Scenarios
by: Zhou, Xueyang, et al.
Published: (2025)

FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with Reasonable Adversariality
by: Chen, Keyu, et al.
Published: (2024)

Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning
by: Wang, Haining, et al.
Published: (2024)

MPR-GUI: Benchmarking and Enhancing Multilingual Perception and Reasoning in GUI Agents
by: Chen, Ruihan, et al.
Published: (2025)

Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning
by: Wang, Yiming, et al.
Published: (2024)

CR4T: Rewrite-Based Guardrails for Adolescent LLM Safety
by: An, Heajun, et al.
Published: (2026)

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
by: Hubinger, Evan, et al.
Published: (2024)

CrashAgent: Crash Scenario Generation via Multi-modal Reasoning
by: Li, Miao, et al.
Published: (2025)

Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety
by: Liu, Junliang, et al.
Published: (2025)

AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents
by: Bao, Zhixuan, et al.
Published: (2026)

Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning
by: Wu, Jingtian, et al.
Published: (2025)

DecepChain: Inducing Deceptive Reasoning in Large Language Models
by: Shen, Wei, et al.
Published: (2025)

The Agent's First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios
by: Fu, Daocheng, et al.
Published: (2026)

Retrieval- and Argumentation-Enhanced Multi-Agent LLMs for Judgmental Forecasting (Extended Version with Supplementary Material)
by: Gorur, Deniz, et al.
Published: (2025)

UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
by: Luo, Haotian, et al.
Published: (2025)

A Distribution Semantics for Probabilistic Term Rewriting
by: Vidal, Germán
Published: (2024)

AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents
by: Guo, Zhengkang, et al.
Published: (2026)

OODBench: Out-of-Distribution Benchmark for Large Vision-Language Models
by: Lin, Ling, et al.
Published: (2026)

OpenDeception: Learning Deception and Trust in Human-AI Interaction via Multi-Agent Simulation
by: Wu, Yichen, et al.
Published: (2025)

Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces
by: Zhang, Yilin, et al.
Published: (2026)

I-RAVEN-X: Benchmarking Generalization and Robustness of Analogical and Mathematical Reasoning in Large Language and Reasoning Models
by: Camposampiero, Giacomo, et al.
Published: (2025)

Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision
by: Cheng, Ao, et al.
Published: (2025)

Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
by: Lee, Jihwan, et al.
Published: (2024)