:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Jiangyuan, Xiao, Kejun, Zhao, Huaipeng, Luo, Tao, Zeng, Xiaoyi
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.23716
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Shopping Companion: Benchmarking and Training LLM Agents for Long-Horizon Preference-Grounded E-Commerce Tasks
by: Yu, Zijian, et al.
Published: (2026)

Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion
by: Sun, Qi, et al.
Published: (2026)

ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents
by: Wang, Jiangyuan, et al.
Published: (2025)

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
by: Hong, Haoyang, et al.
Published: (2025)

O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL
by: Yao, Yi, et al.
Published: (2026)

Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design
by: Zhu, Bin, et al.
Published: (2026)

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
by: Qiu, Jiahao, et al.
Published: (2025)

Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance
by: Qiu, Baopu, et al.
Published: (2026)

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories
by: Wang, Jiaming, et al.
Published: (2026)

Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
by: Prabhakar, Akshara, et al.
Published: (2025)

ACC: Compiling Agent Trajectories for Long-Context Training
by: Su, Qisheng, et al.
Published: (2026)

DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent
by: Wu, Tongzhou, et al.
Published: (2026)

AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce
by: Peshevski, Dimitar, et al.
Published: (2025)

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning
by: Zhao, Gang, et al.
Published: (2024)

LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent
by: Li, Wanli, et al.
Published: (2026)

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
by: Fang, Tianqing, et al.
Published: (2025)

AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
by: Luo, Yinyi, et al.
Published: (2026)

IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning
by: Luo, Haohao, et al.
Published: (2026)

ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks
by: Li, Minghao, et al.
Published: (2025)

DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference
by: Wang, Zihan, et al.
Published: (2026)

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
by: Ni, Jingwei, et al.
Published: (2026)

ShopGym: An Integrated Framework for Realistic Simulation and Scalable Benchmarking of E-Commerce Web Agents
by: Savadikar, Chinmay, et al.
Published: (2026)

Self-Optimizing Multi-Agent Systems for Deep Research
by: Câmara, Arthur, et al.
Published: (2026)

Evaluating Stochasticity in Deep Research Agents
by: Zhai, Haotian, et al.
Published: (2026)

MAVEN-T: Reinforced Heterogeneous Distillation for Real-Time Multi-Agent Trajectory Prediction
by: Duan, Wenchang, et al.
Published: (2026)

VeriTrace: Evolving Mental Models for Deep Research Agents
by: Zhao, Haolang, et al.
Published: (2026)

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
by: Li, Weizhen, et al.
Published: (2025)

Deep Research Bench: Evaluating AI Web Research Agents
by: FutureSearch, et al.
Published: (2025)

Neural Interaction Energy for Multi-Agent Trajectory Prediction
by: Shen, Kaixin, et al.
Published: (2024)

MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)

AgentOrchestra: Orchestrating Multi-Agent Intelligence with the Tool-Environment-Agent(TEA) Protocol
by: Zhang, Wentao, et al.
Published: (2025)

SimGym: Traffic-Grounded Browser Agents for Offline A/B Testing in E-Commerce
by: Castelo, Alberto, et al.
Published: (2026)

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
by: Ye, Fangda, et al.
Published: (2026)

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes
by: Ning, Jingjie, et al.
Published: (2026)

When Does Memory Help Multi-Trajectory Inference for Tool-Use LLM Agents?
by: Li, Xinzhe, et al.
Published: (2026)

Deep Research Agents: A Systematic Examination And Roadmap
by: Huang, Yuxuan, et al.
Published: (2025)

Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents
by: Chandrahasan, Prahaladh, et al.
Published: (2025)

CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict
by: Sun, Yanhui, et al.
Published: (2026)

SensingAgents: A Multi-Agent Collaborative Framework for Robust IMU Activity Recognition
by: Zheng, Naiyu, et al.
Published: (2026)

Attention-MoA: Enhancing Mixture-of-Agents via Inter-Agent Semantic Attention and Deep Residual Synthesis
by: Wen, Jianyu, et al.
Published: (2026)