:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Guoxin, Chen, Jie, Chen, Lei, Zhao, Jiale, Meng, Fanzhe, Zhao, Wayne Xin, Song, Ruihua, Chen, Cheng, Wen, Ji-Rong, Jia, Kai
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2604.13018
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?
by: Chen, Guoxin, et al.
Published: (2026)

Immersion in the GitHub Universe: Scaling Coding Agents to Mastery
by: Zhao, Jiale, et al.
Published: (2026)

IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)

Decomposing the Entropy-Performance Exchange: The Missing Keys to Unlocking Effective Reinforcement Learning
by: Deng, Jia, et al.
Published: (2025)

Search-Based Interaction For Conversation Recommendation via Generative Reward Model Based Simulated User
by: Wang, Xiaolei, et al.
Published: (2025)

ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization
by: Chen, Guoxin, et al.
Published: (2025)

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training
by: Song, Huatong, et al.
Published: (2026)

Computer Environments Elicit General Agentic Intelligence in LLMs
by: Cheng, Daixuan, et al.
Published: (2026)

Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment
by: Chen, Zhipeng, et al.
Published: (2024)

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
by: Song, Huatong, et al.
Published: (2025)

KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph
by: Jiang, Jinhao, et al.
Published: (2024)

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models
by: Chen, Jie, et al.
Published: (2024)

MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)

Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
by: Chen, Zhipeng, et al.
Published: (2026)

Towards Event-oriented Long Video Understanding
by: Du, Yifan, et al.
Published: (2024)

Towards Long-horizon Agentic Multimodal Search
by: Du, Yifan, et al.
Published: (2026)

Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
by: Chen, Zhipeng, et al.
Published: (2024)

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models
by: Sun, Haoxiang, et al.
Published: (2025)

Universal Item Tokenization for Transferable Generative Recommendation
by: Zheng, Bowen, et al.
Published: (2025)

Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning
by: Chen, Zhipeng, et al.
Published: (2026)

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR
by: Deng, Jia, et al.
Published: (2025)

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models
by: Li, Junyi, et al.
Published: (2024)

Towards Effective Code-Integrated Reasoning
by: Bai, Fei, et al.
Published: (2025)

A Survey on Large Language Model based Autonomous Agents
by: Wang, Lei, et al.
Published: (2023)

ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
by: Cheng, Xiaoxue, et al.
Published: (2024)

Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking
by: Cheng, Xiaoxue, et al.
Published: (2025)

Irrational Complex Rotations Empower Low-bit Optimizers
by: Tian, Zhen, et al.
Published: (2025)

Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)

BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models
by: Dong, Zican, et al.
Published: (2023)

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
by: Chen, Jie, et al.
Published: (2025)

User Behavior Simulation with Large Language Model based Agents
by: Wang, Lei, et al.
Published: (2023)

Enhancing Graph Contrastive Learning with Reliable and Informative Augmentation for Recommendation
by: Zheng, Bowen, et al.
Published: (2024)

Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation
by: Zheng, Bowen, et al.
Published: (2023)

MagicWorld: Towards Long-Horizon Stability for Interactive Video World Exploration
by: Li, Guangyuan, et al.
Published: (2025)

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
by: Du, Yifan, et al.
Published: (2023)

Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
by: Chen, Yushuo, et al.
Published: (2024)

ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests
by: Xu, Shiyi, et al.
Published: (2025)

Aletheia: Quantifying Cognitive Conviction in Reasoning Models via Regularized Inverse Confusion Matrix
by: Fu, Fanzhe
Published: (2026)

The Meta-Prompting Protocol: Orchestrating LLMs via Adversarial Feedback Loops
by: Fu, Fanzhe
Published: (2025)

DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation
by: Zheng, Bowen, et al.
Published: (2025)