:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yan, Yu, Sun, Sheng, Li, Mingfeng, Yang, Zheming, Zhu, Chiwei, Ma, Fei, Xu, Benfeng, Liu, Min, Li, Qi
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2601.04093
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding
by: Zhu, Chiwei, et al.
Published: (2025)

Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach
by: Li, Ruizhe, et al.
Published: (2025)

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
by: Du, Mingxuan, et al.
Published: (2025)

DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report
by: Li, Ruizhe, et al.
Published: (2026)

Red-teaming the Multimodal Reasoning: Jailbreaking Vision-Language Models via Cross-modal Entanglement Attacks
by: Yan, Yu, et al.
Published: (2026)

SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
by: Dong, Jianshuo, et al.
Published: (2025)

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools
by: Guo, Zikang, et al.
Published: (2025)

Ruby Teaming: Improving Quality Diversity Search with Memory for Automated Red Teaming
by: Han, Vernon Toh Yan, et al.
Published: (2024)

When Search Goes Wrong: Red-Teaming Web-Augmented Large Language Models
by: Ou, Haoran, et al.
Published: (2025)

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability
by: Zhu, Chiwei, et al.
Published: (2025)

FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents
by: Zhu, Chiwei, et al.
Published: (2026)

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora
by: Wang, Pengyu, et al.
Published: (2026)

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
by: Du, Mingxuan, et al.
Published: (2026)

Benchmarking LLMs in an Embodied Environment for Blue Team Threat Hunting
by: Liu, Xiaoqun, et al.
Published: (2025)

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming
by: Zhang, Zheng, et al.
Published: (2025)

Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
by: Verma, Apurv, et al.
Published: (2024)

ARMs: Adaptive Red-Teaming Agent against Multimodal Models with Plug-and-Play Attacks
by: Chen, Zhaorun, et al.
Published: (2025)

When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs
by: Kim, Hanna, et al.
Published: (2024)

Rethinking the Threat and Accessibility of Adversarial Attacks against Face Recognition Systems
by: Cao, Yuxin, et al.
Published: (2024)

Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles
by: Wang, Shaohan, et al.
Published: (2026)

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
by: Chiu, Yu Ying, et al.
Published: (2024)

ZeroSearch: Incentivize the Search Capability of LLMs without Searching
by: Sun, Hao, et al.
Published: (2025)

Dispatching and Pricing in Two-Sided Spatial Queues
by: Xu, Ang, et al.
Published: (2025)

Towards LLM-Based Automatic Playtest
by: Zhao, Yan, et al.
Published: (2025)

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
by: Lee, Hyomin, et al.
Published: (2026)

ATAAT: Adaptive Threat-Aware Adversarial Tuning Framework against Backdoor Attacks on Vision-Language-Action Models
by: Chen, Kewei, et al.
Published: (2026)

ScholarSearch: Benchmarking Scholar Searching Ability of LLMs
by: Zhou, Junting, et al.
Published: (2025)

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
by: Zhou, Andy, et al.
Published: (2025)

An Efficient Proximity Graph-based Approach to Table Union Search
by: Xie, Yiming, et al.
Published: (2025)

WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
by: Dihan, Mahir Labib, et al.
Published: (2025)

MUZZLE: Adaptive Agentic Red-Teaming of Web Agents Against Indirect Prompt Injection Attacks
by: Syros, Georgios, et al.
Published: (2026)

OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs
by: Wang, Xin, et al.
Published: (2026)

WebANNS: Fast and Efficient Approximate Nearest Neighbor Search in Web Browsers
by: Liu, Mugeng, et al.
Published: (2025)

Searching the Web: Introduction to Search Techniques on the Web. [Videotape.]
Published: (1997)

Red Teaming Visual Language Models
by: Li, Mukai, et al.
Published: (2024)

FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models
by: Wang, Xin, et al.
Published: (2025)

When LLMs Team Up: A Coordinated Attack Framework for Automated Cyber Intrusions
by: Qi, Minfeng, et al.
Published: (2026)

Coherence Fraction in Grover Search Algorithm
by: Zhou, Si-Qi, et al.
Published: (2025)

Topic Knowledge and Online Catalog Search Formulation.
by: Allen, Bryce
Published: (1991)

An Index-based Approach for Efficient and Effective Web Content Extraction
by: Chen, Yihan, et al.
Published: (2025)