Saved in:
| Main Authors: | Wang, Jiangyuan, Xiao, Kejun, Zhao, Huaipeng, Luo, Tao, Zeng, Xiaoyi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.23716 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shopping Companion: Benchmarking and Training LLM Agents for Long-Horizon Preference-Grounded E-Commerce Tasks
by: Yu, Zijian, et al.
Published: (2026)
by: Yu, Zijian, et al.
Published: (2026)
Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion
by: Sun, Qi, et al.
Published: (2026)
by: Sun, Qi, et al.
Published: (2026)
ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents
by: Wang, Jiangyuan, et al.
Published: (2025)
by: Wang, Jiangyuan, et al.
Published: (2025)
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
by: Hong, Haoyang, et al.
Published: (2025)
by: Hong, Haoyang, et al.
Published: (2025)
O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL
by: Yao, Yi, et al.
Published: (2026)
by: Yao, Yi, et al.
Published: (2026)
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design
by: Zhu, Bin, et al.
Published: (2026)
by: Zhu, Bin, et al.
Published: (2026)
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
by: Qiu, Jiahao, et al.
Published: (2025)
by: Qiu, Jiahao, et al.
Published: (2025)
Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance
by: Qiu, Baopu, et al.
Published: (2026)
by: Qiu, Baopu, et al.
Published: (2026)
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories
by: Wang, Jiaming, et al.
Published: (2026)
by: Wang, Jiaming, et al.
Published: (2026)
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
by: Prabhakar, Akshara, et al.
Published: (2025)
by: Prabhakar, Akshara, et al.
Published: (2025)
ACC: Compiling Agent Trajectories for Long-Context Training
by: Su, Qisheng, et al.
Published: (2026)
by: Su, Qisheng, et al.
Published: (2026)
DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent
by: Wu, Tongzhou, et al.
Published: (2026)
by: Wu, Tongzhou, et al.
Published: (2026)
AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce
by: Peshevski, Dimitar, et al.
Published: (2025)
by: Peshevski, Dimitar, et al.
Published: (2025)
Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning
by: Zhao, Gang, et al.
Published: (2024)
by: Zhao, Gang, et al.
Published: (2024)
LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent
by: Li, Wanli, et al.
Published: (2026)
by: Li, Wanli, et al.
Published: (2026)
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
by: Fang, Tianqing, et al.
Published: (2025)
by: Fang, Tianqing, et al.
Published: (2025)
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning
by: Luo, Haohao, et al.
Published: (2026)
by: Luo, Haohao, et al.
Published: (2026)
ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks
by: Li, Minghao, et al.
Published: (2025)
by: Li, Minghao, et al.
Published: (2025)
DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference
by: Wang, Zihan, et al.
Published: (2026)
by: Wang, Zihan, et al.
Published: (2026)
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
by: Ni, Jingwei, et al.
Published: (2026)
by: Ni, Jingwei, et al.
Published: (2026)
ShopGym: An Integrated Framework for Realistic Simulation and Scalable Benchmarking of E-Commerce Web Agents
by: Savadikar, Chinmay, et al.
Published: (2026)
by: Savadikar, Chinmay, et al.
Published: (2026)
Self-Optimizing Multi-Agent Systems for Deep Research
by: Câmara, Arthur, et al.
Published: (2026)
by: Câmara, Arthur, et al.
Published: (2026)
Evaluating Stochasticity in Deep Research Agents
by: Zhai, Haotian, et al.
Published: (2026)
by: Zhai, Haotian, et al.
Published: (2026)
MAVEN-T: Reinforced Heterogeneous Distillation for Real-Time Multi-Agent Trajectory Prediction
by: Duan, Wenchang, et al.
Published: (2026)
by: Duan, Wenchang, et al.
Published: (2026)
VeriTrace: Evolving Mental Models for Deep Research Agents
by: Zhao, Haolang, et al.
Published: (2026)
by: Zhao, Haolang, et al.
Published: (2026)
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
by: Li, Weizhen, et al.
Published: (2025)
by: Li, Weizhen, et al.
Published: (2025)
Deep Research Bench: Evaluating AI Web Research Agents
by: FutureSearch, et al.
Published: (2025)
by: FutureSearch, et al.
Published: (2025)
Neural Interaction Energy for Multi-Agent Trajectory Prediction
by: Shen, Kaixin, et al.
Published: (2024)
by: Shen, Kaixin, et al.
Published: (2024)
MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
AgentOrchestra: Orchestrating Multi-Agent Intelligence with the Tool-Environment-Agent(TEA) Protocol
by: Zhang, Wentao, et al.
Published: (2025)
by: Zhang, Wentao, et al.
Published: (2025)
SimGym: Traffic-Grounded Browser Agents for Offline A/B Testing in E-Commerce
by: Castelo, Alberto, et al.
Published: (2026)
by: Castelo, Alberto, et al.
Published: (2026)
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
by: Ye, Fangda, et al.
Published: (2026)
by: Ye, Fangda, et al.
Published: (2026)
Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes
by: Ning, Jingjie, et al.
Published: (2026)
by: Ning, Jingjie, et al.
Published: (2026)
When Does Memory Help Multi-Trajectory Inference for Tool-Use LLM Agents?
by: Li, Xinzhe, et al.
Published: (2026)
by: Li, Xinzhe, et al.
Published: (2026)
Deep Research Agents: A Systematic Examination And Roadmap
by: Huang, Yuxuan, et al.
Published: (2025)
by: Huang, Yuxuan, et al.
Published: (2025)
Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents
by: Chandrahasan, Prahaladh, et al.
Published: (2025)
by: Chandrahasan, Prahaladh, et al.
Published: (2025)
CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict
by: Sun, Yanhui, et al.
Published: (2026)
by: Sun, Yanhui, et al.
Published: (2026)
SensingAgents: A Multi-Agent Collaborative Framework for Robust IMU Activity Recognition
by: Zheng, Naiyu, et al.
Published: (2026)
by: Zheng, Naiyu, et al.
Published: (2026)
Attention-MoA: Enhancing Mixture-of-Agents via Inter-Agent Semantic Attention and Deep Residual Synthesis
by: Wen, Jianyu, et al.
Published: (2026)
by: Wen, Jianyu, et al.
Published: (2026)
Similar Items
-
Shopping Companion: Benchmarking and Training LLM Agents for Long-Horizon Preference-Grounded E-Commerce Tasks
by: Yu, Zijian, et al.
Published: (2026) -
Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion
by: Sun, Qi, et al.
Published: (2026) -
ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents
by: Wang, Jiangyuan, et al.
Published: (2025) -
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
by: Hong, Haoyang, et al.
Published: (2025) -
O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL
by: Yao, Yi, et al.
Published: (2026)