:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Min, Rui, Qiao, Zile, Xu, Ze, Zhai, Jiawen, Gao, Wenyu, Chen, Xuanzhong, Sun, Haozhen, Zhang, Zhen, Wang, Xinyu, Zhou, Hong, Yin, Wenbiao, Zhang, Bo, Zhou, Xuan, Yan, Ming, Jiang, Yong, Liu, Haicheng, Ding, Liang, Zou, Ling, Fung, Yi R., Li, Yalong, Xie, Pengjun
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.08868
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
by: Chen, Xuanzhong, et al.
Published: (2025)

EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-commerce Models
by: Ling, Xinyi, et al.
Published: (2025)

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
by: Qiao, Zile, et al.
Published: (2025)

OxyEcomBench: Benchmarking Multimodal Foundation Models across E-Commerce Ecosystems
by: Liu, Yong, et al.
Published: (2026)

IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)

HiMA-Ecom: Enabling Joint Training of Hierarchical Multi-Agent E-commerce Assistants
by: Hu, Junxing, et al.
Published: (2025)

EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association
by: Wang, Weiqi, et al.
Published: (2025)

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models
by: Li, Jingyao, et al.
Published: (2025)

ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models
by: Chen, Haibin, et al.
Published: (2025)

Scaling Agents via Continual Pre-training
by: Su, Liangcai, et al.
Published: (2025)

EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
by: Lau, Ching Ming Samuel, et al.
Published: (2024)

ProteinBench: A Holistic Evaluation of Protein Foundation Models
by: Ye, Fei, et al.
Published: (2024)

KBM: Delineating Knowledge Boundary for Adaptive Retrieval in Large Language Models
by: Zhang, Zhen, et al.
Published: (2024)

The Breakthrough and Confrontation of Mainland Chinese Opera Films in Hong Kong under the Cold War Framework (1953–1957)
by: Du, Jiachen, et al.
Published: (2025)

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling
by: Qiao, Zile, et al.
Published: (2024)

ZeroSearch: Incentivize the Search Capability of LLMs without Searching
by: Sun, Hao, et al.
Published: (2025)

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
by: Ou, Litu, et al.
Published: (2025)

RareBench: Can LLMs Serve as Rare Diseases Specialists?
by: Chen, Xuanzhong, et al.
Published: (2024)

DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling
by: Sun, Hao, et al.
Published: (2025)

Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules
by: Zhou, Chenyu, et al.
Published: (2025)

AgentFold: Long-Horizon Web Agents with Proactive Context Management
by: Ye, Rui, et al.
Published: (2025)

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization
by: Tao, Zhengwei, et al.
Published: (2025)

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
by: Li, Kuan, et al.
Published: (2025)

Simple Vertex Algebras Arising From Congruence Subgroups
by: Dai, Xuanzhong, et al.
Published: (2022)

Hall algebras associated to root categories
by: Zhang, Haicheng
Published: (2022)

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
by: Zhang, Alexander, et al.
Published: (2025)

Nested Browser-Use Learning for Agentic Information Seeking
by: Li, Baixuan, et al.
Published: (2025)

WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking
by: Tao, Zhengwei, et al.
Published: (2025)

Valley3: Scaling Omni Foundation Models for E-commerce
by: Chen, Zeyu, et al.
Published: (2026)

C sequential optimization numbers
by: Hui, Zile
Published: (2024)

Extriangulated length categories: torsion classes and $τ$-tilting theory
by: Wang, Li, et al.
Published: (2025)

WebWalker: Benchmarking LLMs in Web Traversal
by: Wu, Jialong, et al.
Published: (2025)

LookBench: A Live and Holistic Open Benchmark for Fashion Image Retrieval
by: ai, Gensmo., et al.
Published: (2026)

ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking
by: Li, Baixuan, et al.
Published: (2025)

Regist3R: Incremental Registration with Stereo Foundation Model
by: Liu, Sidun, et al.
Published: (2025)

V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation
by: Zhang, Guiwei, et al.
Published: (2025)

Unsupervised Detection of Fraudulent Transactions in E-commerce Using Contrastive Learning
by: Li, Xuan, et al.
Published: (2025)

IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs
by: Bai, Songlin, et al.
Published: (2026)

A degeneration formula of Donaldson-Thomas theory on Calabi-Yau 4-folds
by: Cao, Yalong, et al.
Published: (2024)

Stable envelopes for critical loci
by: Cao, Yalong, et al.
Published: (2025)