Saved in:
| Main Authors: | Min, Rui, Qiao, Zile, Xu, Ze, Zhai, Jiawen, Gao, Wenyu, Chen, Xuanzhong, Sun, Haozhen, Zhang, Zhen, Wang, Xinyu, Zhou, Hong, Yin, Wenbiao, Zhang, Bo, Zhou, Xuan, Yan, Ming, Jiang, Yong, Liu, Haicheng, Ding, Liang, Zou, Ling, Fung, Yi R., Li, Yalong, Xie, Pengjun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.08868 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
by: Chen, Xuanzhong, et al.
Published: (2025)
by: Chen, Xuanzhong, et al.
Published: (2025)
EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-commerce Models
by: Ling, Xinyi, et al.
Published: (2025)
by: Ling, Xinyi, et al.
Published: (2025)
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
by: Qiao, Zile, et al.
Published: (2025)
by: Qiao, Zile, et al.
Published: (2025)
OxyEcomBench: Benchmarking Multimodal Foundation Models across E-Commerce Ecosystems
by: Liu, Yong, et al.
Published: (2026)
by: Liu, Yong, et al.
Published: (2026)
IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
HiMA-Ecom: Enabling Joint Training of Hierarchical Multi-Agent E-commerce Assistants
by: Hu, Junxing, et al.
Published: (2025)
by: Hu, Junxing, et al.
Published: (2025)
EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association
by: Wang, Weiqi, et al.
Published: (2025)
by: Wang, Weiqi, et al.
Published: (2025)
DynamicBench: Evaluating Real-Time Report Generation in Large Language Models
by: Li, Jingyao, et al.
Published: (2025)
by: Li, Jingyao, et al.
Published: (2025)
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models
by: Chen, Haibin, et al.
Published: (2025)
by: Chen, Haibin, et al.
Published: (2025)
Scaling Agents via Continual Pre-training
by: Su, Liangcai, et al.
Published: (2025)
by: Su, Liangcai, et al.
Published: (2025)
EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
by: Lau, Ching Ming Samuel, et al.
Published: (2024)
by: Lau, Ching Ming Samuel, et al.
Published: (2024)
ProteinBench: A Holistic Evaluation of Protein Foundation Models
by: Ye, Fei, et al.
Published: (2024)
by: Ye, Fei, et al.
Published: (2024)
KBM: Delineating Knowledge Boundary for Adaptive Retrieval in Large Language Models
by: Zhang, Zhen, et al.
Published: (2024)
by: Zhang, Zhen, et al.
Published: (2024)
The Breakthrough and Confrontation of Mainland Chinese Opera Films in Hong Kong under the Cold War Framework (1953–1957)
by: Du, Jiachen, et al.
Published: (2025)
by: Du, Jiachen, et al.
Published: (2025)
Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling
by: Qiao, Zile, et al.
Published: (2024)
by: Qiao, Zile, et al.
Published: (2024)
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
by: Ou, Litu, et al.
Published: (2025)
by: Ou, Litu, et al.
Published: (2025)
RareBench: Can LLMs Serve as Rare Diseases Specialists?
by: Chen, Xuanzhong, et al.
Published: (2024)
by: Chen, Xuanzhong, et al.
Published: (2024)
DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules
by: Zhou, Chenyu, et al.
Published: (2025)
by: Zhou, Chenyu, et al.
Published: (2025)
AgentFold: Long-Horizon Web Agents with Proactive Context Management
by: Ye, Rui, et al.
Published: (2025)
by: Ye, Rui, et al.
Published: (2025)
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization
by: Tao, Zhengwei, et al.
Published: (2025)
by: Tao, Zhengwei, et al.
Published: (2025)
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
by: Li, Kuan, et al.
Published: (2025)
by: Li, Kuan, et al.
Published: (2025)
Simple Vertex Algebras Arising From Congruence Subgroups
by: Dai, Xuanzhong, et al.
Published: (2022)
by: Dai, Xuanzhong, et al.
Published: (2022)
Hall algebras associated to root categories
by: Zhang, Haicheng
Published: (2022)
by: Zhang, Haicheng
Published: (2022)
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
by: Zhang, Alexander, et al.
Published: (2025)
by: Zhang, Alexander, et al.
Published: (2025)
Nested Browser-Use Learning for Agentic Information Seeking
by: Li, Baixuan, et al.
Published: (2025)
by: Li, Baixuan, et al.
Published: (2025)
WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking
by: Tao, Zhengwei, et al.
Published: (2025)
by: Tao, Zhengwei, et al.
Published: (2025)
Valley3: Scaling Omni Foundation Models for E-commerce
by: Chen, Zeyu, et al.
Published: (2026)
by: Chen, Zeyu, et al.
Published: (2026)
C sequential optimization numbers
by: Hui, Zile
Published: (2024)
by: Hui, Zile
Published: (2024)
Extriangulated length categories: torsion classes and $τ$-tilting theory
by: Wang, Li, et al.
Published: (2025)
by: Wang, Li, et al.
Published: (2025)
WebWalker: Benchmarking LLMs in Web Traversal
by: Wu, Jialong, et al.
Published: (2025)
by: Wu, Jialong, et al.
Published: (2025)
LookBench: A Live and Holistic Open Benchmark for Fashion Image Retrieval
by: ai, Gensmo., et al.
Published: (2026)
by: ai, Gensmo., et al.
Published: (2026)
ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking
by: Li, Baixuan, et al.
Published: (2025)
by: Li, Baixuan, et al.
Published: (2025)
Regist3R: Incremental Registration with Stereo Foundation Model
by: Liu, Sidun, et al.
Published: (2025)
by: Liu, Sidun, et al.
Published: (2025)
V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation
by: Zhang, Guiwei, et al.
Published: (2025)
by: Zhang, Guiwei, et al.
Published: (2025)
Unsupervised Detection of Fraudulent Transactions in E-commerce Using Contrastive Learning
by: Li, Xuan, et al.
Published: (2025)
by: Li, Xuan, et al.
Published: (2025)
IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs
by: Bai, Songlin, et al.
Published: (2026)
by: Bai, Songlin, et al.
Published: (2026)
A degeneration formula of Donaldson-Thomas theory on Calabi-Yau 4-folds
by: Cao, Yalong, et al.
Published: (2024)
by: Cao, Yalong, et al.
Published: (2024)
Stable envelopes for critical loci
by: Cao, Yalong, et al.
Published: (2025)
by: Cao, Yalong, et al.
Published: (2025)
Similar Items
-
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
by: Chen, Xuanzhong, et al.
Published: (2025) -
EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-commerce Models
by: Ling, Xinyi, et al.
Published: (2025) -
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
by: Qiao, Zile, et al.
Published: (2025) -
OxyEcomBench: Benchmarking Multimodal Foundation Models across E-Commerce Ecosystems
by: Liu, Yong, et al.
Published: (2026) -
IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)