Saved in:
| Main Authors: | Qiu, Yiwen, Wu, Linjuan, Liu, Yizhou, Yan, Yuchen, Ma, Jin, Tan, Xu, Hu, Yao, Zhang, Daoxin, Zhang, Wenqi, Lu, Weiming, Xiao, Jun, Shen, Yongliang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.19656 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025)
by: Wu, Xingyu, et al.
Published: (2025)
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind
by: Hou, Guiyang, et al.
Published: (2024)
by: Hou, Guiyang, et al.
Published: (2024)
AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
by: Zhang, Xuan, et al.
Published: (2025)
by: Zhang, Xuan, et al.
Published: (2025)
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
by: Zhang, Wenqi, et al.
Published: (2024)
by: Zhang, Wenqi, et al.
Published: (2024)
SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
by: Li, Hongxing, et al.
Published: (2025)
by: Li, Hongxing, et al.
Published: (2025)
Hierarchical Budget Policy Optimization for Adaptive Reasoning
by: Lyu, Shangke, et al.
Published: (2025)
by: Lyu, Shangke, et al.
Published: (2025)
CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
by: Pan, Teng, et al.
Published: (2026)
by: Pan, Teng, et al.
Published: (2026)
GroundAct: Can LLM Agents Ground Actions in Environmental States?
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
STaR-SQL: Self-Taught Reasoner for Text-to-SQL
by: He, Mingqian, et al.
Published: (2025)
by: He, Mingqian, et al.
Published: (2025)
Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
by: Hong, Haitao, et al.
Published: (2025)
by: Hong, Haitao, et al.
Published: (2025)
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
by: He, Mingqian, et al.
Published: (2024)
by: He, Mingqian, et al.
Published: (2024)
Language as a Latent Variable for Reasoning Optimization
by: Wu, Linjuan, et al.
Published: (2026)
by: Wu, Linjuan, et al.
Published: (2026)
EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
by: Hou, Guiyang, et al.
Published: (2024)
by: Hou, Guiyang, et al.
Published: (2024)
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
by: Zhang, Wenqi, et al.
Published: (2023)
by: Zhang, Wenqi, et al.
Published: (2023)
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
by: Xu, Haolei, et al.
Published: (2025)
by: Xu, Haolei, et al.
Published: (2025)
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation
by: Chen, Siqi, et al.
Published: (2025)
by: Chen, Siqi, et al.
Published: (2025)
Let LRMs Break Free from Overthinking via Self-Braking Tuning
by: Zhao, Haoran, et al.
Published: (2025)
by: Zhao, Haoran, et al.
Published: (2025)
Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning
by: Xu, Haolei, et al.
Published: (2025)
by: Xu, Haolei, et al.
Published: (2025)
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization
by: Lu, Zhengxi, et al.
Published: (2026)
by: Lu, Zhengxi, et al.
Published: (2026)
GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
by: Tang, Fei, et al.
Published: (2025)
by: Tang, Fei, et al.
Published: (2025)
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents
by: Tang, Fei, et al.
Published: (2026)
by: Tang, Fei, et al.
Published: (2026)
Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning
by: Du, Jialu, et al.
Published: (2025)
by: Du, Jialu, et al.
Published: (2025)
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
by: Du, Yong, et al.
Published: (2025)
by: Du, Yong, et al.
Published: (2025)
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL
by: Ma, Haoyuan, et al.
Published: (2025)
by: Ma, Haoyuan, et al.
Published: (2025)
Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning
by: Wang, Aozhe, et al.
Published: (2026)
by: Wang, Aozhe, et al.
Published: (2026)
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation
by: Chen, Tongbo, et al.
Published: (2026)
by: Chen, Tongbo, et al.
Published: (2026)
Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems
by: Tang, Fei, et al.
Published: (2025)
by: Tang, Fei, et al.
Published: (2025)
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
by: Zhang, Wenqi, et al.
Published: (2024)
by: Zhang, Wenqi, et al.
Published: (2024)
GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts
by: Yuan, Fan, et al.
Published: (2025)
by: Yuan, Fan, et al.
Published: (2025)
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
by: Tang, Fei, et al.
Published: (2024)
by: Tang, Fei, et al.
Published: (2024)
Insert or Attach: Taxonomy Completion via Box Embedding
by: Xue, Wei, et al.
Published: (2023)
by: Xue, Wei, et al.
Published: (2023)
A Survey on (M)LLM-Based GUI Agents
by: Tang, Fei, et al.
Published: (2025)
by: Tang, Fei, et al.
Published: (2025)
Milestone-Guided Policy Learning for Long-Horizon Language Agents
by: Wang, Zixuan, et al.
Published: (2026)
by: Wang, Zixuan, et al.
Published: (2026)
TaskBench: Benchmarking Large Language Models for Task Automation
by: Shen, Yongliang, et al.
Published: (2023)
by: Shen, Yongliang, et al.
Published: (2023)
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
by: Yan, Yuchen, et al.
Published: (2025)
by: Yan, Yuchen, et al.
Published: (2025)
ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
by: Li, Dingming, et al.
Published: (2025)
by: Li, Dingming, et al.
Published: (2025)
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
by: Yan, Yuchen, et al.
Published: (2025)
by: Yan, Yuchen, et al.
Published: (2025)
Listen, Pause, and Reason: Toward Perception-Grounded Hybrid Reasoning for Audio Understanding
by: Wang, Jieyi, et al.
Published: (2026)
by: Wang, Jieyi, et al.
Published: (2026)
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
by: Zhang, Wenqi, et al.
Published: (2025)
by: Zhang, Wenqi, et al.
Published: (2025)
UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding
by: Tang, Fei, et al.
Published: (2026)
by: Tang, Fei, et al.
Published: (2026)
Similar Items
-
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025) -
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind
by: Hou, Guiyang, et al.
Published: (2024) -
AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
by: Zhang, Xuan, et al.
Published: (2025) -
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
by: Zhang, Wenqi, et al.
Published: (2024) -
SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
by: Li, Hongxing, et al.
Published: (2025)