Saved in:
| Main Authors: | Zhao, Ruiqing, Li, Fengzhi, Zuo, Yuan, Liu, Rui, Liu, Yansong, Ma, Yunfei, Meng, Fanyu, Feng, Junlan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.02545 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models
by: Li, Fengzhi, et al.
Published: (2026)
by: Li, Fengzhi, et al.
Published: (2026)
CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases
by: Xue, Xiaona, et al.
Published: (2026)
by: Xue, Xiaona, et al.
Published: (2026)
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings
by: Wang, Duo, et al.
Published: (2024)
by: Wang, Duo, et al.
Published: (2024)
Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning
by: Gan, Siyuan, et al.
Published: (2026)
by: Gan, Siyuan, et al.
Published: (2026)
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
by: Ning, Yansong, et al.
Published: (2026)
by: Ning, Yansong, et al.
Published: (2026)
JT-Safe: Intrinsically Enhancing the Safety and Trustworthiness of LLMs
by: Feng, Junlan, et al.
Published: (2025)
by: Feng, Junlan, et al.
Published: (2025)
Plan before Solving: Problem-Aware Strategy Routing for Mathematical Reasoning with LLMs
by: Qi, Shihao, et al.
Published: (2025)
by: Qi, Shihao, et al.
Published: (2025)
USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents
by: Lai, Siqi, et al.
Published: (2025)
by: Lai, Siqi, et al.
Published: (2025)
Scene-Aware Explainable Multimodal Trajectory Prediction
by: Liu, Pei, et al.
Published: (2024)
by: Liu, Pei, et al.
Published: (2024)
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories
by: Wang, Jiaming, et al.
Published: (2026)
by: Wang, Jiaming, et al.
Published: (2026)
Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning
by: Hong, Jialiang, et al.
Published: (2025)
by: Hong, Jialiang, et al.
Published: (2025)
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
by: Liu, Xiao, et al.
Published: (2024)
by: Liu, Xiao, et al.
Published: (2024)
RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning
by: Liu, Xiao, et al.
Published: (2025)
by: Liu, Xiao, et al.
Published: (2025)
Kill Two Birds with One Stone! Trajectory enabled Unified Online Detection of Adversarial Examples and Backdoor Attacks
by: Fu, Anmin, et al.
Published: (2025)
by: Fu, Anmin, et al.
Published: (2025)
JT-DA: Enhancing Data Analysis with Tool-Integrated Table Reasoning Large Language Models
by: Chi, Ce, et al.
Published: (2025)
by: Chi, Ce, et al.
Published: (2025)
From Single to Societal: Analyzing Persona-Induced Bias in Multi-Agent Interactions
by: Li, Jiayi, et al.
Published: (2025)
by: Li, Jiayi, et al.
Published: (2025)
Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)
by: Wang, Ziqi, et al.
Published: (2026)
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction
by: Ning, Yansong, et al.
Published: (2024)
by: Ning, Yansong, et al.
Published: (2024)
Hardness-Aware Dynamic Curriculum Learning for Robust Multimodal Emotion Recognition with Missing Modalities
by: Liu, Rui, et al.
Published: (2025)
by: Liu, Rui, et al.
Published: (2025)
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering
by: Tao, Mingxu, et al.
Published: (2024)
by: Tao, Mingxu, et al.
Published: (2024)
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning
by: Liu, Huanghai, et al.
Published: (2025)
by: Liu, Huanghai, et al.
Published: (2025)
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
by: Zhi, Zhuo, et al.
Published: (2025)
by: Zhi, Zhuo, et al.
Published: (2025)
CohEx: A Generalized Framework for Cohort Explanation
by: Meng, Fanyu, et al.
Published: (2024)
by: Meng, Fanyu, et al.
Published: (2024)
Empowering Small Language Models with Factual Hallucination-Aware Reasoning for Financial Classification
by: Yuan, Han, et al.
Published: (2026)
by: Yuan, Han, et al.
Published: (2026)
From Superficial to Deep: Integrating External Knowledge for Follow-up Question Generation Using Knowledge Graph and LLM
by: Liu, Jianyu, et al.
Published: (2025)
by: Liu, Jianyu, et al.
Published: (2025)
SafeDialBench: A Fine-Grained Safety Evaluation Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak Attacks
by: Cao, Hongye, et al.
Published: (2025)
by: Cao, Hongye, et al.
Published: (2025)
Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models
by: Bai, Yunfei, et al.
Published: (2026)
by: Bai, Yunfei, et al.
Published: (2026)
TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models
by: Li, Ce, et al.
Published: (2025)
by: Li, Ce, et al.
Published: (2025)
CausalAbstain: Enhancing Multilingual LLMs with Causal Reasoning for Trustworthy Abstention
by: Sun, Yuxi, et al.
Published: (2025)
by: Sun, Yuxi, et al.
Published: (2025)
Efficient Reasoning via Reward Model
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
Automating Legal Interpretation with LLMs: Retrieval, Generation, and Evaluation
by: Luo, Kangcheng, et al.
Published: (2025)
by: Luo, Kangcheng, et al.
Published: (2025)
Effective Learning for Small Reasoning Models: An Empirical Study on 0.5B Reasoning LLMs
by: Zhuang, Xialie, et al.
Published: (2025)
by: Zhuang, Xialie, et al.
Published: (2025)
Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks
by: Chen, Junlan, et al.
Published: (2025)
by: Chen, Junlan, et al.
Published: (2025)
MoveGPT: Scaling Mobility Foundation Models with Spatially-Aware Mixture of Experts
by: Han, Chonghua, et al.
Published: (2025)
by: Han, Chonghua, et al.
Published: (2025)
Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs
by: Li, Xiaozhe, et al.
Published: (2026)
by: Li, Xiaozhe, et al.
Published: (2026)
LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization
by: Zhao, Yang, et al.
Published: (2026)
by: Zhao, Yang, et al.
Published: (2026)
Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations
by: Chen, Junlan, et al.
Published: (2025)
by: Chen, Junlan, et al.
Published: (2025)
When Models Learn to Ask Why: Adaptive Causal Reasoning for Trustworthy Medical Vision-Language Models
by: Lin, Jianxin, et al.
Published: (2026)
by: Lin, Jianxin, et al.
Published: (2026)
SeaEvo: Advancing Algorithm Discovery with Strategy Space Evolution
by: Luo, Sichun, et al.
Published: (2026)
by: Luo, Sichun, et al.
Published: (2026)
Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs
by: Hu, Man, et al.
Published: (2025)
by: Hu, Man, et al.
Published: (2025)
Similar Items
-
Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models
by: Li, Fengzhi, et al.
Published: (2026) -
CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases
by: Xue, Xiaona, et al.
Published: (2026) -
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings
by: Wang, Duo, et al.
Published: (2024) -
Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning
by: Gan, Siyuan, et al.
Published: (2026) -
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
by: Ning, Yansong, et al.
Published: (2026)