Saved in:
| Main Authors: | Zeng, Yongcheng, Sun, Zexu, Ji, Bokai, Min, Erxue, Cai, Hengyi, Wang, Shuaiqiang, Yin, Dawei, Zhang, Haifeng, Chen, Xu, Wang, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.01037 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
by: Sun, Zexu, et al.
Published: (2025)
by: Sun, Zexu, et al.
Published: (2025)
Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
by: Gao, Heyang, et al.
Published: (2025)
by: Gao, Heyang, et al.
Published: (2025)
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding
by: Li, Ziheng, et al.
Published: (2025)
by: Li, Ziheng, et al.
Published: (2025)
AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis
by: Sun, Zexu, et al.
Published: (2026)
by: Sun, Zexu, et al.
Published: (2026)
From Prompting to Alignment: A Generative Framework for Query Recommendation
by: Min, Erxue, et al.
Published: (2025)
by: Min, Erxue, et al.
Published: (2025)
MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching
by: Qu, Changle, et al.
Published: (2026)
by: Qu, Changle, et al.
Published: (2026)
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
by: Qu, Changle, et al.
Published: (2024)
by: Qu, Changle, et al.
Published: (2024)
CTR-Guided Generative Query Suggestion in Conversational Search
by: Min, Erxue, et al.
Published: (2025)
by: Min, Erxue, et al.
Published: (2025)
Not All Preferences Are Created Equal: Stability-Aware and Gradient-Efficient Alignment for Reasoning Models
by: Wu, Hui, et al.
Published: (2026)
by: Wu, Hui, et al.
Published: (2026)
LLMs + Persona-Plug = Personalized LLMs
by: Liu, Jiongnan, et al.
Published: (2024)
by: Liu, Jiongnan, et al.
Published: (2024)
XL$^2$Bench: A Benchmark for Extremely Long Context Understanding with Long-range Dependencies
by: Ni, Xuanfan, et al.
Published: (2024)
by: Ni, Xuanfan, et al.
Published: (2024)
Efficient Thought Space Exploration Through Strategic Intervention
by: Li, Ziheng, et al.
Published: (2025)
by: Li, Ziheng, et al.
Published: (2025)
Towards Completeness-Oriented Tool Retrieval for Large Language Models
by: Qu, Changle, et al.
Published: (2024)
by: Qu, Changle, et al.
Published: (2024)
Tool Learning with Large Language Models: A Survey
by: Qu, Changle, et al.
Published: (2024)
by: Qu, Changle, et al.
Published: (2024)
Hyperbolic Knowledge Transfer in Cross-Domain Recommendation System
by: Yang, Xin, et al.
Published: (2024)
by: Yang, Xin, et al.
Published: (2024)
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
AdaSwitch: Balancing Exploration and Guidance in Knowledge Distillation via Adaptive Switching
by: Peng, Jingyu, et al.
Published: (2025)
by: Peng, Jingyu, et al.
Published: (2025)
Cross-model Control: Improving Multiple Large Language Models in One-time Training
by: Wu, Jiayi, et al.
Published: (2024)
by: Wu, Jiayi, et al.
Published: (2024)
AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization
by: Li, Qiyang, et al.
Published: (2026)
by: Li, Qiyang, et al.
Published: (2026)
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
by: Wu, Jiayi, et al.
Published: (2024)
by: Wu, Jiayi, et al.
Published: (2024)
CurEvo: Curriculum-Guided Self-Evolution for Video Understanding
by: Zeng, Guiyi, et al.
Published: (2026)
by: Zeng, Guiyi, et al.
Published: (2026)
A Question Answering Dataset for Temporal-Sensitive Retrieval-Augmented Generation
by: Chen, Ziyang, et al.
Published: (2025)
by: Chen, Ziyang, et al.
Published: (2025)
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
by: Zhu, Junda, et al.
Published: (2025)
by: Zhu, Junda, et al.
Published: (2025)
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System
by: Yang, Xihong, et al.
Published: (2024)
by: Yang, Xihong, et al.
Published: (2024)
MARA: A Multimodal Adaptive Retrieval-Augmented Framework for Document Question Answering
by: Wu, Hui, et al.
Published: (2026)
by: Wu, Hui, et al.
Published: (2026)
Evolving LLMs' Self-Refinement Capability via Synergistic Training-Inference Optimization
by: Zeng, Yongcheng, et al.
Published: (2025)
by: Zeng, Yongcheng, et al.
Published: (2025)
Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs
by: Huang, Jiani, et al.
Published: (2025)
by: Huang, Jiani, et al.
Published: (2025)
Token-level Direct Preference Optimization
by: Zeng, Yongcheng, et al.
Published: (2024)
by: Zeng, Yongcheng, et al.
Published: (2024)
Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation
by: Zhu, Dongsheng, et al.
Published: (2025)
by: Zhu, Dongsheng, et al.
Published: (2025)
Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation
by: Chen, Jiamin, et al.
Published: (2025)
by: Chen, Jiamin, et al.
Published: (2025)
Reinforced Efficient Reasoning via Semantically Diverse Exploration
by: Zhao, Ziqi, et al.
Published: (2026)
by: Zhao, Ziqi, et al.
Published: (2026)
Leveraging LLMs to Evaluate Usefulness of Document
by: Wang, Xingzhu, et al.
Published: (2025)
by: Wang, Xingzhu, et al.
Published: (2025)
Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization
by: Wu, Jiulong, et al.
Published: (2025)
by: Wu, Jiulong, et al.
Published: (2025)
Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models
by: Xu, Can, et al.
Published: (2026)
by: Xu, Can, et al.
Published: (2026)
Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs
by: Dong, Jiancheng, et al.
Published: (2025)
by: Dong, Jiancheng, et al.
Published: (2025)
GRAF: Multi-turn Jailbreaking via Global Refinement and Active Fabrication
by: Tang, Hua, et al.
Published: (2025)
by: Tang, Hua, et al.
Published: (2025)
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
by: Li, Zihao, et al.
Published: (2025)
by: Li, Zihao, et al.
Published: (2025)
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos
by: Ren, Xubin, et al.
Published: (2025)
by: Ren, Xubin, et al.
Published: (2025)
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
by: Zhao, Yukun, et al.
Published: (2023)
by: Zhao, Yukun, et al.
Published: (2023)
Similar Items
-
Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
by: Sun, Zexu, et al.
Published: (2025) -
Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
by: Gao, Heyang, et al.
Published: (2025) -
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding
by: Li, Ziheng, et al.
Published: (2025) -
AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis
by: Sun, Zexu, et al.
Published: (2026) -
From Prompting to Alignment: A Generative Framework for Query Recommendation
by: Min, Erxue, et al.
Published: (2025)