Saved in:
| Main Authors: | Fan, HuiMing, Wang, Xiao, Chu, Zheng, Wang, Qianyu, Wang, Zhuoyao, Liu, Ming, Qin, Bing, XingYu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.28721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents
by: Zhang, Huanyao, et al.
Published: (2026)
by: Zhang, Huanyao, et al.
Published: (2026)
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
by: Li, Shilong, et al.
Published: (2025)
by: Li, Shilong, et al.
Published: (2025)
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
by: Chu, Zheng, et al.
Published: (2026)
by: Chu, Zheng, et al.
Published: (2026)
BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents
by: Wei, Jason, et al.
Published: (2025)
by: Wei, Jason, et al.
Published: (2025)
VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents
by: Zhang, Zhengbo, et al.
Published: (2026)
by: Zhang, Zhengbo, et al.
Published: (2026)
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts
by: Lee, Nahyun, et al.
Published: (2026)
by: Lee, Nahyun, et al.
Published: (2026)
Search Theory and Browsing
by: Morse, Philip M.
Published: (1970)
by: Morse, Philip M.
Published: (1970)
W2T: LoRA Weights Already Know What They Can Do
by: Han, Xiaolong, et al.
Published: (2026)
by: Han, Xiaolong, et al.
Published: (2026)
LLM Agents Already Know When to Call Tools -- Even Without Reasoning
by: Sun, Chung-En, et al.
Published: (2026)
by: Sun, Chung-En, et al.
Published: (2026)
On Browsing: The Use of Search Theory in the Search for Information.
by: Morse, Philip M.
Published: (1970)
by: Morse, Philip M.
Published: (1970)
InteractComp: Evaluating Search Agents With Ambiguous Queries
by: Deng, Mingyi, et al.
Published: (2025)
by: Deng, Mingyi, et al.
Published: (2025)
MA-GTS: A Multi-Agent Framework for Solving Complex Graph Problems in Real-World Applications
by: Yuan, Zike, et al.
Published: (2025)
by: Yuan, Zike, et al.
Published: (2025)
AGPO: Asymmetric Group Policy Optimization for Verifiable Reasoning and Search Ads Relevance at JD
by: Xu, Yang, et al.
Published: (2026)
by: Xu, Yang, et al.
Published: (2026)
BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
by: Zhou, Peilin, et al.
Published: (2025)
by: Zhou, Peilin, et al.
Published: (2025)
MMSearch-Plus: Benchmarking Provenance-Aware Search for Multimodal Browsing Agents
by: Tao, Xijia, et al.
Published: (2025)
by: Tao, Xijia, et al.
Published: (2025)
What Will Be Already Exists
Published: (2021)
Published: (2021)
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
by: Yang, Ruihan, et al.
Published: (2024)
by: Yang, Ruihan, et al.
Published: (2024)
OceanDocs Search Guide: Browse, Discover, Simple and Advanced Search.
by: Simpson, Pauline
Published: (2016)
by: Simpson, Pauline
Published: (2016)
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
by: Hu, Liang, et al.
Published: (2025)
by: Hu, Liang, et al.
Published: (2025)
MedBrowseComp: Benchmarking Medical Deep Research and Computer Use
by: Chen, Shan, et al.
Published: (2025)
by: Chen, Shan, et al.
Published: (2025)
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
by: Yang, Yiqian, et al.
Published: (2025)
by: Yang, Yiqian, et al.
Published: (2025)
Searching for gravitational-wave bursts with space-borne detectors
by: Wu, Zheng, et al.
Published: (2023)
by: Wu, Zheng, et al.
Published: (2023)
Browsing Lost Unformed Recollections: A Benchmark for Tip-of-the-Tongue Search and Reasoning
by: CH-Wang, Sky, et al.
Published: (2025)
by: CH-Wang, Sky, et al.
Published: (2025)
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
by: Chu, Zheng, et al.
Published: (2025)
by: Chu, Zheng, et al.
Published: (2025)
FP-Agent: Fingerprinting AI Browsing Agents
by: Wang, Ethan, et al.
Published: (2026)
by: Wang, Ethan, et al.
Published: (2026)
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
by: Chen, Zijian, et al.
Published: (2025)
by: Chen, Zijian, et al.
Published: (2025)
Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases
by: Tan, Zhao, et al.
Published: (2026)
by: Tan, Zhao, et al.
Published: (2026)
The World Already Knows: Tracing Unity–Disunity in the Physical Record
by: van der Erve, Marcus, et al.
Published: (2025)
by: van der Erve, Marcus, et al.
Published: (2025)
Bridging Search and Recommendation through Latent Cross Reasoning
by: Shi, Teng, et al.
Published: (2025)
by: Shi, Teng, et al.
Published: (2025)
Benefit from Rich: Tackling Search Interaction Sparsity in Search Enhanced Recommendation
by: Shi, Teng, et al.
Published: (2025)
by: Shi, Teng, et al.
Published: (2025)
Recon, Answer, Verify: Agents in Search of Truth
by: Shukla, Satyam, et al.
Published: (2025)
by: Shukla, Satyam, et al.
Published: (2025)
GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models
by: Yuan, Zike, et al.
Published: (2024)
by: Yuan, Zike, et al.
Published: (2024)
Analytical Searching vs. Browsing in Hypertext Information Retrieval Systems.
by: Qiu, Liwen
Published: (1993)
by: Qiu, Liwen
Published: (1993)
Verifying Hierarchic Multipartite and Network Nonlocalities with a Unified Method
by: Luo, Ming-Xing, et al.
Published: (2024)
by: Luo, Ming-Xing, et al.
Published: (2024)
RustCompCert: A Verified and Verifying Compiler for a Sequential Subset of Rust
by: Wu, Jinhua, et al.
Published: (2026)
by: Wu, Jinhua, et al.
Published: (2026)
Why Learn What Physics Already Knows? Realizing Agile mmWave-based Human Pose Estimation via Physics-Guided Preprocessing
by: Zheng, Shuntian, et al.
Published: (2026)
by: Zheng, Shuntian, et al.
Published: (2026)
Quantum Circuit Transformation Based on Tabu Search
by: Jiang, Hui, et al.
Published: (2021)
by: Jiang, Hui, et al.
Published: (2021)
Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning
by: Yu, Fei, et al.
Published: (2025)
by: Yu, Fei, et al.
Published: (2025)
Comparing Children's Use of Browsing and Keyword Searching on the Science Library Catalog.
by: Hirsh, Sandra G., et al.
Published: (1995)
by: Hirsh, Sandra G., et al.
Published: (1995)
Know What You Know: Metacognitive Entropy Calibration for Verifiable RL Reasoning
by: Zhao, Qiannian, et al.
Published: (2026)
by: Zhao, Qiannian, et al.
Published: (2026)
Similar Items
-
BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents
by: Zhang, Huanyao, et al.
Published: (2026) -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
by: Li, Shilong, et al.
Published: (2025) -
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
by: Chu, Zheng, et al.
Published: (2026) -
BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents
by: Wei, Jason, et al.
Published: (2025) -
VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents
by: Zhang, Zhengbo, et al.
Published: (2026)