Saved in:
| Main Authors: | Chen, Linze, Cai, Yufan, Hou, Zhe, Dong, Jin Song |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.21033 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning
by: Linze, Chen, et al.
Published: (2026)
by: Linze, Chen, et al.
Published: (2026)
The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap
by: Zhang, Yedi, et al.
Published: (2024)
by: Zhang, Yedi, et al.
Published: (2024)
X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes
by: Tianxi, Gao, et al.
Published: (2026)
by: Tianxi, Gao, et al.
Published: (2026)
Uncertainty Reasoning with Large Language Models for Explainable Disease Diagnosis
by: Fan, Xiaoyang, et al.
Published: (2026)
by: Fan, Xiaoyang, et al.
Published: (2026)
Towards Large Language Model Aided Program Refinement
by: Cai, Yufan, et al.
Published: (2024)
by: Cai, Yufan, et al.
Published: (2024)
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
Bridging Legal Interpretation and Formal Logic: Faithfulness, Assumption, and the Future of AI Legal Reasoning
by: Wang, Olivia Peiyu, et al.
Published: (2026)
by: Wang, Olivia Peiyu, et al.
Published: (2026)
On Verifiable Legal Reasoning: A Multi-Agent Framework with Formalized Knowledge Representations
by: Sadowski, Albert, et al.
Published: (2025)
by: Sadowski, Albert, et al.
Published: (2025)
Towards the Formalization of a Trustworthy AI for Mining Interpretable Models explOiting Sophisticated Algorithms
by: Guidotti, Riccardo, et al.
Published: (2025)
by: Guidotti, Riccardo, et al.
Published: (2025)
Trustworthy Agents for Electronic Health Records through Confidence Estimation
by: Song, Yongwoo, et al.
Published: (2025)
by: Song, Yongwoo, et al.
Published: (2025)
Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance
by: Gürsun, Gonca
Published: (2025)
by: Gürsun, Gonca
Published: (2025)
MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments
by: Yang, Xiao, et al.
Published: (2025)
by: Yang, Xiao, et al.
Published: (2025)
Toward Formalizing LLM-Based Agent Designs through Structural Context Modeling and Semantic Dynamics Analysis
by: Jia, Haoyu, et al.
Published: (2026)
by: Jia, Haoyu, et al.
Published: (2026)
Hybrid Retrieval-Augmented Generation Agent for Trustworthy Legal Question Answering in Judicial Forensics
by: Xi, Yueqing, et al.
Published: (2025)
by: Xi, Yueqing, et al.
Published: (2025)
Formal Architecture Descriptors as Navigation Primitives for AI Coding Agents
by: Jin, Ruoqi
Published: (2026)
by: Jin, Ruoqi
Published: (2026)
Challenges for Generative AI in Legal Reasoning
by: Linna, Eljas, et al.
Published: (2025)
by: Linna, Eljas, et al.
Published: (2025)
A Framework for Formalizing LLM Agent Security
by: Siu, Vincent, et al.
Published: (2026)
by: Siu, Vincent, et al.
Published: (2026)
Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts
by: Zhang, Hongwei, et al.
Published: (2025)
by: Zhang, Hongwei, et al.
Published: (2025)
Cross-Border Legal Adaptation of Autonomous Vehicle Design based on Logic and Non-monotonic Reasoning
by: Yu, Zhe, et al.
Published: (2025)
by: Yu, Zhe, et al.
Published: (2025)
Claw-Eval: Towards Trustworthy Evaluation of Autonomous Agents
by: Ye, Bowen, et al.
Published: (2026)
by: Ye, Bowen, et al.
Published: (2026)
Distributed Legal Infrastructure for a Trustworthy Agentic Web
by: Chaffer, Tomer Jordi, et al.
Published: (2026)
by: Chaffer, Tomer Jordi, et al.
Published: (2026)
Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India
by: Goel, Rakshita, et al.
Published: (2025)
by: Goel, Rakshita, et al.
Published: (2025)
Trustworthiness of Legal Considerations for the Use of LLMs in Education
by: Alaswad, Sara, et al.
Published: (2025)
by: Alaswad, Sara, et al.
Published: (2025)
LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
by: Enguehard, Joseph, et al.
Published: (2025)
by: Enguehard, Joseph, et al.
Published: (2025)
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning
by: Ding, Haoran, et al.
Published: (2026)
by: Ding, Haoran, et al.
Published: (2026)
From Helpful to Trustworthy: LLM Agents for Pair Programming
by: Ayon, Ragib Shahariar
Published: (2026)
by: Ayon, Ragib Shahariar
Published: (2026)
MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning
by: Jing, Huihao, et al.
Published: (2025)
by: Jing, Huihao, et al.
Published: (2025)
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability
by: Wang, Ruida, et al.
Published: (2025)
by: Wang, Ruida, et al.
Published: (2025)
Vibe Coding an LLM-powered Theorem Prover
by: Hou, Zhe
Published: (2026)
by: Hou, Zhe
Published: (2026)
Green Shielding: A User-Centric Approach Towards Trustworthy AI
by: Li, Aaron J., et al.
Published: (2026)
by: Li, Aaron J., et al.
Published: (2026)
Toward Safe and Responsible AI Agents: A Three-Pillar Model for Transparency, Accountability, and Trustworthiness
by: Cheng, Edward C., et al.
Published: (2026)
by: Cheng, Edward C., et al.
Published: (2026)
Towards Urban Planing AI Agent in the Age of Agentic AI
by: Liu, Rui, et al.
Published: (2025)
by: Liu, Rui, et al.
Published: (2025)
Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation
by: Zhang, Chenghao, et al.
Published: (2026)
by: Zhang, Chenghao, et al.
Published: (2026)
Ares: Adaptive Reasoning Effort Selection for Efficient LLM Agents
by: Yang, Jingbo, et al.
Published: (2026)
by: Yang, Jingbo, et al.
Published: (2026)
F$^3$Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos
by: Liu, Zhaoyu, et al.
Published: (2025)
by: Liu, Zhaoyu, et al.
Published: (2025)
Toward Trustworthy Evaluation of Sustainability Rating Methodologies: A Human-AI Collaborative Framework for Benchmark Dataset Construction
by: Cai, Xiaoran, et al.
Published: (2026)
by: Cai, Xiaoran, et al.
Published: (2026)
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues
by: Ge, Yubin, et al.
Published: (2025)
by: Ge, Yubin, et al.
Published: (2025)
Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective
by: Ni, Bo, et al.
Published: (2024)
by: Ni, Bo, et al.
Published: (2024)
Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems
by: Gosmar, Diego, et al.
Published: (2025)
by: Gosmar, Diego, et al.
Published: (2025)
Similar Items
-
Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning
by: Linze, Chen, et al.
Published: (2026) -
The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap
by: Zhang, Yedi, et al.
Published: (2024) -
X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes
by: Tianxi, Gao, et al.
Published: (2026) -
Uncertainty Reasoning with Large Language Models for Explainable Disease Diagnosis
by: Fan, Xiaoyang, et al.
Published: (2026) -
Towards Large Language Model Aided Program Refinement
by: Cai, Yufan, et al.
Published: (2024)