Saved in:
| Main Author: | Gürsun, Gonca |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.11421 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning
by: Chen, Linze, et al.
Published: (2025)
by: Chen, Linze, et al.
Published: (2025)
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
by: Hu, Yuanzhe, et al.
Published: (2025)
by: Hu, Yuanzhe, et al.
Published: (2025)
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation
by: Long, Yunbo, et al.
Published: (2025)
by: Long, Yunbo, et al.
Published: (2025)
General Modular Harness for LLM Agents in Multi-Turn Gaming Environments
by: Zhang, Yuxuan, et al.
Published: (2025)
by: Zhang, Yuxuan, et al.
Published: (2025)
Evaluating Multi-Turn Bargain Skills in LLM-Based Seller Agent
by: Wang, Issue Yishu, et al.
Published: (2025)
by: Wang, Issue Yishu, et al.
Published: (2025)
Towards Trustworthy LLM-Based Recommendation via Rationale Integration
by: Park, Chung, et al.
Published: (2025)
by: Park, Chung, et al.
Published: (2025)
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
Claw-Eval: Towards Trustworthy Evaluation of Autonomous Agents
by: Ye, Bowen, et al.
Published: (2026)
by: Ye, Bowen, et al.
Published: (2026)
Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks
by: Tian, Aaron Xuxiang, et al.
Published: (2025)
by: Tian, Aaron Xuxiang, et al.
Published: (2025)
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey
by: Guan, Shengyue, et al.
Published: (2025)
by: Guan, Shengyue, et al.
Published: (2025)
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance
by: Jin, Haolin, et al.
Published: (2024)
by: Jin, Haolin, et al.
Published: (2024)
Toward a Trustworthy Optimization Modeling Agent via Verifiable Synthetic Data Generation
by: Lima, Vinicius, et al.
Published: (2025)
by: Lima, Vinicius, et al.
Published: (2025)
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
Improving the Safety and Trustworthiness of Medical AI via Multi-Agent Evaluation Loops
by: Ghafoor, Zainab, et al.
Published: (2026)
by: Ghafoor, Zainab, et al.
Published: (2026)
Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training
by: Fang, Yangyi, et al.
Published: (2026)
by: Fang, Yangyi, et al.
Published: (2026)
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games
by: Xie, Yunfei, et al.
Published: (2026)
by: Xie, Yunfei, et al.
Published: (2026)
MASteer: Multi-Agent Adaptive Steer Strategy for End-to-End LLM Trustworthiness Repair
by: Li, Changqing, et al.
Published: (2025)
by: Li, Changqing, et al.
Published: (2025)
From Helpful to Trustworthy: LLM Agents for Pair Programming
by: Ayon, Ragib Shahariar
Published: (2026)
by: Ayon, Ragib Shahariar
Published: (2026)
MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments
by: Yang, Xiao, et al.
Published: (2025)
by: Yang, Xiao, et al.
Published: (2025)
TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents
by: Djuhera, Aladin, et al.
Published: (2026)
by: Djuhera, Aladin, et al.
Published: (2026)
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers
by: Xin, Ran, et al.
Published: (2025)
by: Xin, Ran, et al.
Published: (2025)
Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
by: Yu, Jiahao, et al.
Published: (2025)
by: Yu, Jiahao, et al.
Published: (2025)
Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors
by: Didolkar, Aniket, et al.
Published: (2025)
by: Didolkar, Aniket, et al.
Published: (2025)
TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents
by: Liu, Yibing, et al.
Published: (2026)
by: Liu, Yibing, et al.
Published: (2026)
Facilitating Trustworthy Human-Agent Collaboration in LLM-based Multi-Agent System oriented Software Engineering
by: Ronanki, Krishna
Published: (2025)
by: Ronanki, Krishna
Published: (2025)
Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems
by: Gosmar, Diego, et al.
Published: (2025)
by: Gosmar, Diego, et al.
Published: (2025)
Adaptive Stopping for Multi-Turn LLM Reasoning
by: Zhou, Xiaofan, et al.
Published: (2026)
by: Zhou, Xiaofan, et al.
Published: (2026)
The Echo Chamber Multi-Turn LLM Jailbreak
by: Alobaid, Ahmad, et al.
Published: (2026)
by: Alobaid, Ahmad, et al.
Published: (2026)
Mitigating Conversational Inertia in Multi-Turn Agents
by: Wan, Yang, et al.
Published: (2026)
by: Wan, Yang, et al.
Published: (2026)
ECG-Agent: On-Device Tool-Calling Agent for ECG Multi-Turn Dialogue
by: Chung, Hyunseung, et al.
Published: (2026)
by: Chung, Hyunseung, et al.
Published: (2026)
Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction
by: Bao, Han, et al.
Published: (2026)
by: Bao, Han, et al.
Published: (2026)
Protect: Towards Robust Guardrailing Stack for Trustworthy Enterprise LLM Systems
by: Avinash, Karthik, et al.
Published: (2025)
by: Avinash, Karthik, et al.
Published: (2025)
Toward Automated and Trustworthy Scientific Analysis and Visualization with LLM-Generated Code
by: Chakroborti, Apu Kumar, et al.
Published: (2025)
by: Chakroborti, Apu Kumar, et al.
Published: (2025)
AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems
by: Kwon, Jiyong, et al.
Published: (2026)
by: Kwon, Jiyong, et al.
Published: (2026)
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
by: Zhang, Hongwei, et al.
Published: (2025)
by: Zhang, Hongwei, et al.
Published: (2025)
Agent Drift: Quantifying Behavioral Degradation in Multi-Agent LLM Systems Over Extended Interactions
by: Rath, Abhishek
Published: (2026)
by: Rath, Abhishek
Published: (2026)
Trustworthy AI Psychotherapy: Multi-Agent LLM Workflow for Counseling and Explainable Mental Disorder Diagnosis
by: Ozgun, Mithat Can, et al.
Published: (2025)
by: Ozgun, Mithat Can, et al.
Published: (2025)
Automating Deception: Scalable Multi-Turn LLM Jailbreaks
by: Kumarappan, Adarsh, et al.
Published: (2025)
by: Kumarappan, Adarsh, et al.
Published: (2025)
Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Guidance with Proxy Constraint
by: Liu, Zhihao, et al.
Published: (2025)
by: Liu, Zhihao, et al.
Published: (2025)
Similar Items
-
Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning
by: Chen, Linze, et al.
Published: (2025) -
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
by: Hu, Yuanzhe, et al.
Published: (2025) -
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024) -
EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation
by: Long, Yunbo, et al.
Published: (2025) -
General Modular Harness for LLM Agents in Multi-Turn Gaming Environments
by: Zhang, Yuxuan, et al.
Published: (2025)