:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cui, Christopher Z., Killian, Taylor W., Ammanabrolu, Prithviraj
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.07021
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning
by: Kim, Bosung, et al.
Published: (2025)

How Reasoning Evolves from Post-Training Data: An Empirical Study Using Chess
by: Dionisopoulos, Lucas, et al.
Published: (2026)

A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
by: Wang, Ruiyi, et al.
Published: (2025)

TALES: Text Adventure Learning Environment Suite
by: Cui, Christopher Zhang, et al.
Published: (2025)

Preference-Based Learning in Audio Applications: A Systematic Analysis
by: Broukhim, Aaron, et al.
Published: (2025)

Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
by: Shen, Yiran, et al.
Published: (2025)

How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning
by: Kim, Bosung, et al.
Published: (2026)

Long Grounded Thoughts: Synthesizing Visual Problems and Reasoning Chains at Scale
by: Acuna, David, et al.
Published: (2025)

Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages
by: Cui, Brandon, et al.
Published: (2026)

Concise Reasoning in the Lens of Lagrangian Optimization
by: Gao, Chengqian, et al.
Published: (2025)

STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules
by: Wu, Di, et al.
Published: (2026)

Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2025)

Efficient Agentic Reasoning Through Self-Regulated Simulative Planning
by: Deng, Mingkai, et al.
Published: (2026)

Governance-Constrained Agentic AI: Blockchain-Enforced Human Oversight for Safety-Critical Wildfire Monitoring
by: Akarma, Ali, et al.
Published: (2026)

FindTheFlaws: Annotated Errors for Detecting Flawed Reasoning and Scalable Oversight Research
by: Recchia, Gabriel, et al.
Published: (2025)

SafeCoT: Improving VLM Safety with Minimal Reasoning
by: Ma, Jiachen, et al.
Published: (2025)

When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel
by: Li, Wenkai, et al.
Published: (2026)

Safety Compliance: Rethinking LLM Safety Reasoning through the Lens of Compliance
by: Hu, Wenbin, et al.
Published: (2025)

What's in the Box? Reasoning about Unseen Objects from Multimodal Cues
by: Ying, Lance, et al.
Published: (2025)

Optimizing Reasoning Efficiency through Prompt Difficulty Prediction
by: Zhao, Bo, et al.
Published: (2025)

Reasoning Structure Matters for Safety Alignment of Reasoning Models
by: In, Yeonjun, et al.
Published: (2026)

Efficiency Will Not Lead to Sustainable Reasoning AI
by: Wiesner, Philipp, et al.
Published: (2025)

Tiered Agentic Oversight: A Hierarchical Multi-Agent System for Healthcare Safety
by: Kim, Yubin, et al.
Published: (2025)

Noise Injection Systemically Degrades Large Language Model Safety Guardrails
by: Shahani, Prithviraj Singh, et al.
Published: (2025)

Modeling Human Beliefs about AI Behavior for Scalable Oversight
by: Lang, Leon, et al.
Published: (2025)

Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
by: Rajbhandari, Pranav, et al.
Published: (2024)

SAINT: Attention-Based Policies for Discrete Combinatorial Action Spaces
by: Landers, Matthew, et al.
Published: (2025)

The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
by: Overman, William, et al.
Published: (2025)

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models
by: Wang, Xunguang, et al.
Published: (2026)

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
by: Lu, Ximing, et al.
Published: (2026)

ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models
by: Li, Changyi, et al.
Published: (2025)

Multi-Agent Reasoning Improves Compute Efficiency: Pareto-Optimal Test-Time Scaling
by: Wunderlich, Florian Valentin, et al.
Published: (2026)

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
by: Zhao, Yuze, et al.
Published: (2026)

Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints
by: Balaji, Adarsha, et al.
Published: (2025)

Reasoning as an Adaptive Defense for Safety
by: Kim, Taeyoun, et al.
Published: (2025)

Safety Reasoning with Guidelines
by: Wang, Haoyu, et al.
Published: (2025)

Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties
by: Minegishi, Gouki, et al.
Published: (2025)

GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking
by: Zhan, Yufei, et al.
Published: (2025)

Calibrating Conservatism for Scalable Oversight
by: Overman, William, et al.
Published: (2026)

CLORE: Content-Level Optimization for Reasoning Efficiency
by: Wu, Yuyang, et al.
Published: (2026)