Saved in:
| Main Authors: | Jiang, Zhaoyang, Fu, Zhizhong, Kim, Yunsoo, Mi, Jiacong, Li, Zicheng, Peng, Xuanqi, Wu, Honghan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.06339 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Better Accuracies, Worse Reasoning: A Step-Level Audit of Medical Chain-of-Thought Distillation
by: Jiang, Zhaoyang, et al.
Published: (2026)
by: Jiang, Zhaoyang, et al.
Published: (2026)
HH-SAE: Discovering and Steering Hierarchical Knowledge of Complex Manifolds
by: Wu, Honghan, et al.
Published: (2026)
by: Wu, Honghan, et al.
Published: (2026)
LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
by: Jiang, Zhaoyang, et al.
Published: (2026)
by: Jiang, Zhaoyang, et al.
Published: (2026)
Hallucination Benchmark in Medical Visual Question Answering
by: Wu, Jinge, et al.
Published: (2024)
by: Wu, Jinge, et al.
Published: (2024)
MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)
by: Kim, Yunsoo, et al.
Published: (2024)
Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns
by: Kim, Yunsoo, et al.
Published: (2024)
by: Kim, Yunsoo, et al.
Published: (2024)
SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation
by: Wu, Jinge, et al.
Published: (2024)
by: Wu, Jinge, et al.
Published: (2024)
RadEyeVideo: Enhancing general-domain Large Vision Language Model for chest X-ray analysis with video representations of eye gaze
by: Kim, Yunsoo, et al.
Published: (2025)
by: Kim, Yunsoo, et al.
Published: (2025)
BioHopR: A Benchmark for Multi-Hop, Multi-Answer Reasoning in Biomedical Domain
by: Kim, Yunsoo, et al.
Published: (2025)
by: Kim, Yunsoo, et al.
Published: (2025)
Sentiment analysis of preservice teachers' reflections using a large language model
by: Park, Yunsoo, et al.
Published: (2024)
by: Park, Yunsoo, et al.
Published: (2024)
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot
by: Zhang, Kaiqi, et al.
Published: (2024)
by: Zhang, Kaiqi, et al.
Published: (2024)
Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response
by: Shi, Daqian, et al.
Published: (2025)
by: Shi, Daqian, et al.
Published: (2025)
MultiModal-Learning for Predicting Molecular Properties: A Framework Based on Image and Graph Structures
by: Wang, Zhuoyuan, et al.
Published: (2023)
by: Wang, Zhuoyuan, et al.
Published: (2023)
An Efficient Recommendation Model Based on Knowledge Graph Attention-Assisted Network (KGATAX)
by: Wu, Zhizhong
Published: (2024)
by: Wu, Zhizhong
Published: (2024)
Multiple Greedy Quasi-Newton Methods for Saddle Point Problems
by: Xiao, Minheng, et al.
Published: (2024)
by: Xiao, Minheng, et al.
Published: (2024)
Select before Act: Spatially Decoupled Action Repetition for Continuous Control
by: Nie, Buqing, et al.
Published: (2025)
by: Nie, Buqing, et al.
Published: (2025)
OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents
by: Yu, Sheldon, et al.
Published: (2026)
by: Yu, Sheldon, et al.
Published: (2026)
Adjusting the Output of Decision Transformer with Action Gradient
by: Lin, Rui, et al.
Published: (2025)
by: Lin, Rui, et al.
Published: (2025)
SAGE-LLM: Towards Safe and Generalizable LLM Controller with Fuzzy-CBF Verification and Graph-Structured Knowledge Retrieval for UAV Decision
by: Zhao, Wenzhe, et al.
Published: (2026)
by: Zhao, Wenzhe, et al.
Published: (2026)
Building Decision Making Models Through Language Model Regime
by: Zhang, Yu, et al.
Published: (2024)
by: Zhang, Yu, et al.
Published: (2024)
DecisionLLM: Large Language Models for Long Sequence Decision Exploration
by: Lv, Xiaowei, et al.
Published: (2026)
by: Lv, Xiaowei, et al.
Published: (2026)
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios
by: Zhang, Shiyi, et al.
Published: (2025)
by: Zhang, Shiyi, et al.
Published: (2025)
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
by: Xu, Kaixuan, et al.
Published: (2025)
by: Xu, Kaixuan, et al.
Published: (2025)
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
by: Chong, Yee Hin, et al.
Published: (2026)
by: Chong, Yee Hin, et al.
Published: (2026)
The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game
by: Yang, Lanyu, et al.
Published: (2024)
by: Yang, Lanyu, et al.
Published: (2024)
HiMem: Hierarchical Long-Term Memory for LLM Long-Horizon Agents
by: Zhang, Ningning, et al.
Published: (2026)
by: Zhang, Ningning, et al.
Published: (2026)
ReCode: Unify Plan and Action for Universal Granularity Control
by: Yu, Zhaoyang, et al.
Published: (2025)
by: Yu, Zhaoyang, et al.
Published: (2025)
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control
by: Ruan, Jingqing, et al.
Published: (2024)
by: Ruan, Jingqing, et al.
Published: (2024)
Adversarial Attacks on VQA-NLE: Exposing and Alleviating Inconsistencies in Visual Question Answering Explanations
by: Yeh, Yahsin, et al.
Published: (2025)
by: Yeh, Yahsin, et al.
Published: (2025)
Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios
by: Lee, Sangyub, et al.
Published: (2026)
by: Lee, Sangyub, et al.
Published: (2026)
LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization
by: Zhao, Yang, et al.
Published: (2026)
by: Zhao, Yang, et al.
Published: (2026)
ARGORA: Orchestrated Argumentation for Causally Grounded LLM Reasoning and Decision Making
by: Jin, Youngjin, et al.
Published: (2026)
by: Jin, Youngjin, et al.
Published: (2026)
Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)
by: Gao, Xuanqi, et al.
Published: (2024)
Domain-Specialized Tree of Thought through Plug-and-Play Predictors
by: Gao, Xuanqi, et al.
Published: (2026)
by: Gao, Xuanqi, et al.
Published: (2026)
Who is a Better Player: LLM against LLM
by: Zhou, Yingjie, et al.
Published: (2025)
by: Zhou, Yingjie, et al.
Published: (2025)
MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
by: Gao, Xuanqi, et al.
Published: (2025)
by: Gao, Xuanqi, et al.
Published: (2025)
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models
by: Jin, Ruixing, et al.
Published: (2026)
by: Jin, Ruixing, et al.
Published: (2026)
Executable Code Actions Elicit Better LLM Agents
by: Wang, Xingyao, et al.
Published: (2024)
by: Wang, Xingyao, et al.
Published: (2024)
AI Agents for Inventory Control: Human-LLM-OR Complementarity
by: Baek, Jackie, et al.
Published: (2026)
by: Baek, Jackie, et al.
Published: (2026)
Confidence-Aware Decision-Making and Control for Tool Selection
by: Meera, Ajith Anil, et al.
Published: (2024)
by: Meera, Ajith Anil, et al.
Published: (2024)
Similar Items
-
Better Accuracies, Worse Reasoning: A Step-Level Audit of Medical Chain-of-Thought Distillation
by: Jiang, Zhaoyang, et al.
Published: (2026) -
HH-SAE: Discovering and Steering Hierarchical Knowledge of Complex Manifolds
by: Wu, Honghan, et al.
Published: (2026) -
LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
by: Jiang, Zhaoyang, et al.
Published: (2026) -
Hallucination Benchmark in Medical Visual Question Answering
by: Wu, Jinge, et al.
Published: (2024) -
MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)