:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Ruiqing, Li, Fengzhi, Zuo, Yuan, Liu, Rui, Liu, Yansong, Ma, Yunfei, Meng, Fanyu, Feng, Junlan
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.02545
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models
by: Li, Fengzhi, et al.
Published: (2026)

CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases
by: Xue, Xiaona, et al.
Published: (2026)

LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings
by: Wang, Duo, et al.
Published: (2024)

Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning
by: Gan, Siyuan, et al.
Published: (2026)

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
by: Ning, Yansong, et al.
Published: (2026)

JT-Safe: Intrinsically Enhancing the Safety and Trustworthiness of LLMs
by: Feng, Junlan, et al.
Published: (2025)

Plan before Solving: Problem-Aware Strategy Routing for Mathematical Reasoning with LLMs
by: Qi, Shihao, et al.
Published: (2025)

USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents
by: Lai, Siqi, et al.
Published: (2025)

Scene-Aware Explainable Multimodal Trajectory Prediction
by: Liu, Pei, et al.
Published: (2024)

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories
by: Wang, Jiaming, et al.
Published: (2026)

Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning
by: Hong, Jialiang, et al.
Published: (2025)

Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
by: Liu, Xiao, et al.
Published: (2024)

RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning
by: Liu, Xiao, et al.
Published: (2025)

Kill Two Birds with One Stone! Trajectory enabled Unified Online Detection of Adversarial Examples and Backdoor Attacks
by: Fu, Anmin, et al.
Published: (2025)

JT-DA: Enhancing Data Analysis with Tool-Integrated Table Reasoning Large Language Models
by: Chi, Ce, et al.
Published: (2025)

From Single to Societal: Analyzing Persona-Induced Bias in Multi-Agent Interactions
by: Li, Jiayi, et al.
Published: (2025)

Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)

UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction
by: Ning, Yansong, et al.
Published: (2024)

Hardness-Aware Dynamic Curriculum Learning for Robust Multimodal Emotion Recognition with Missing Modalities
by: Liu, Rui, et al.
Published: (2025)

Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering
by: Tao, Mingxu, et al.
Published: (2024)

JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning
by: Liu, Huanghai, et al.
Published: (2025)

Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
by: Zhi, Zhuo, et al.
Published: (2025)

CohEx: A Generalized Framework for Cohort Explanation
by: Meng, Fanyu, et al.
Published: (2024)

Empowering Small Language Models with Factual Hallucination-Aware Reasoning for Financial Classification
by: Yuan, Han, et al.
Published: (2026)

From Superficial to Deep: Integrating External Knowledge for Follow-up Question Generation Using Knowledge Graph and LLM
by: Liu, Jianyu, et al.
Published: (2025)

SafeDialBench: A Fine-Grained Safety Evaluation Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak Attacks
by: Cao, Hongye, et al.
Published: (2025)

Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models
by: Bai, Yunfei, et al.
Published: (2026)

TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models
by: Li, Ce, et al.
Published: (2025)

CausalAbstain: Enhancing Multilingual LLMs with Causal Reasoning for Trustworthy Abstention
by: Sun, Yuxi, et al.
Published: (2025)

Efficient Reasoning via Reward Model
by: Wang, Yuhao, et al.
Published: (2025)

Automating Legal Interpretation with LLMs: Retrieval, Generation, and Evaluation
by: Luo, Kangcheng, et al.
Published: (2025)

Effective Learning for Small Reasoning Models: An Empirical Study on 0.5B Reasoning LLMs
by: Zhuang, Xialie, et al.
Published: (2025)

Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks
by: Chen, Junlan, et al.
Published: (2025)

MoveGPT: Scaling Mobility Foundation Models with Spatially-Aware Mixture of Experts
by: Han, Chonghua, et al.
Published: (2025)

Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs
by: Li, Xiaozhe, et al.
Published: (2026)

LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization
by: Zhao, Yang, et al.
Published: (2026)

Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations
by: Chen, Junlan, et al.
Published: (2025)

When Models Learn to Ask Why: Adaptive Causal Reasoning for Trustworthy Medical Vision-Language Models
by: Lin, Jianxin, et al.
Published: (2026)

SeaEvo: Advancing Algorithm Discovery with Strategy Space Evolution
by: Luo, Sichun, et al.
Published: (2026)

Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs
by: Hu, Man, et al.
Published: (2025)