:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiang, Zhaoyang, Fu, Zhizhong, Kim, Yunsoo, Mi, Jiacong, Li, Zicheng, Peng, Xuanqi, Wu, Honghan
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.06339
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Better Accuracies, Worse Reasoning: A Step-Level Audit of Medical Chain-of-Thought Distillation
by: Jiang, Zhaoyang, et al.
Published: (2026)

HH-SAE: Discovering and Steering Hierarchical Knowledge of Complex Manifolds
by: Wu, Honghan, et al.
Published: (2026)

LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
by: Jiang, Zhaoyang, et al.
Published: (2026)

Hallucination Benchmark in Medical Visual Question Answering
by: Wu, Jinge, et al.
Published: (2024)

MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)

Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns
by: Kim, Yunsoo, et al.
Published: (2024)

SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation
by: Wu, Jinge, et al.
Published: (2024)

RadEyeVideo: Enhancing general-domain Large Vision Language Model for chest X-ray analysis with video representations of eye gaze
by: Kim, Yunsoo, et al.
Published: (2025)

BioHopR: A Benchmark for Multi-Hop, Multi-Answer Reasoning in Biomedical Domain
by: Kim, Yunsoo, et al.
Published: (2025)

Sentiment analysis of preservice teachers' reflections using a large language model
by: Park, Yunsoo, et al.
Published: (2024)

TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot
by: Zhang, Kaiqi, et al.
Published: (2024)

Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response
by: Shi, Daqian, et al.
Published: (2025)

MultiModal-Learning for Predicting Molecular Properties: A Framework Based on Image and Graph Structures
by: Wang, Zhuoyuan, et al.
Published: (2023)

An Efficient Recommendation Model Based on Knowledge Graph Attention-Assisted Network (KGATAX)
by: Wu, Zhizhong
Published: (2024)

Multiple Greedy Quasi-Newton Methods for Saddle Point Problems
by: Xiao, Minheng, et al.
Published: (2024)

Select before Act: Spatially Decoupled Action Repetition for Continuous Control
by: Nie, Buqing, et al.
Published: (2025)

OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents
by: Yu, Sheldon, et al.
Published: (2026)

Adjusting the Output of Decision Transformer with Action Gradient
by: Lin, Rui, et al.
Published: (2025)

SAGE-LLM: Towards Safe and Generalizable LLM Controller with Fuzzy-CBF Verification and Graph-Structured Knowledge Retrieval for UAV Decision
by: Zhao, Wenzhe, et al.
Published: (2026)

Building Decision Making Models Through Language Model Regime
by: Zhang, Yu, et al.
Published: (2024)

DecisionLLM: Large Language Models for Long Sequence Decision Exploration
by: Lv, Xiaowei, et al.
Published: (2026)

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios
by: Zhang, Shiyi, et al.
Published: (2025)

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
by: Xu, Kaixuan, et al.
Published: (2025)

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
by: Chong, Yee Hin, et al.
Published: (2026)

The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game
by: Yang, Lanyu, et al.
Published: (2024)

HiMem: Hierarchical Long-Term Memory for LLM Long-Horizon Agents
by: Zhang, Ningning, et al.
Published: (2026)

ReCode: Unify Plan and Action for Universal Granularity Control
by: Yu, Zhaoyang, et al.
Published: (2025)

CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control
by: Ruan, Jingqing, et al.
Published: (2024)

Adversarial Attacks on VQA-NLE: Exposing and Alleviating Inconsistencies in Visual Question Answering Explanations
by: Yeh, Yahsin, et al.
Published: (2025)

Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios
by: Lee, Sangyub, et al.
Published: (2026)

LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization
by: Zhao, Yang, et al.
Published: (2026)

ARGORA: Orchestrated Argumentation for Causally Grounded LLM Reasoning and Decision Making
by: Jin, Youngjin, et al.
Published: (2026)

Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)

Domain-Specialized Tree of Thought through Plug-and-Play Predictors
by: Gao, Xuanqi, et al.
Published: (2026)

Who is a Better Player: LLM against LLM
by: Zhou, Yingjie, et al.
Published: (2025)

MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
by: Gao, Xuanqi, et al.
Published: (2025)

Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models
by: Jin, Ruixing, et al.
Published: (2026)

Executable Code Actions Elicit Better LLM Agents
by: Wang, Xingyao, et al.
Published: (2024)

AI Agents for Inventory Control: Human-LLM-OR Complementarity
by: Baek, Jackie, et al.
Published: (2026)

Confidence-Aware Decision-Making and Control for Tool Selection
by: Meera, Ajith Anil, et al.
Published: (2024)