:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Linyi, Weng, Yixuan
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.12194
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AgentStudio: A Toolkit for Building General Virtual Agents
by: Zheng, Longtao, et al.
Published: (2024)

CycleResearcher: Improving Automated Research via Automated Review
by: Weng, Yixuan, et al.
Published: (2024)

Deep Research Agents: A Systematic Examination And Roadmap
by: Huang, Yuxuan, et al.
Published: (2025)

AI Scientists Fail Without Strong Implementation Capability
by: Zhu, Minjun, et al.
Published: (2025)

See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
by: Zhao, Haoyu, et al.
Published: (2025)

MineStudio: A Streamlined Package for Minecraft AI Agent Development
by: Cai, Shaofei, et al.
Published: (2024)

Learning to Intervene on Concept Bottlenecks
by: Steinmann, David, et al.
Published: (2023)

StepShield: When, Not Whether to Intervene on Rogue Agents
by: Felicia, Gloria, et al.
Published: (2026)

A Flexible Multi-Agent LLM-Human Framework for Fast Human Validated Tool Building
by: Xavier, Daull, et al.
Published: (2025)

AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems
by: Dibia, Victor, et al.
Published: (2024)

Can AI Perceive Physical Danger and Intervene?
by: Jindal, Abhishek, et al.
Published: (2025)

DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process
by: Zhu, Minjun, et al.
Published: (2025)

SimWorld Studio: Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning
by: Kang, Haoqiang, et al.
Published: (2026)

DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference
by: Wang, Zihan, et al.
Published: (2026)

Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents
by: Chandrahasan, Prahaladh, et al.
Published: (2025)

How Far Are AI Scientists from Changing the World?
by: Xie, Qiujie, et al.
Published: (2025)

From Understanding the World to Intervening in It: A Unified Multi-Scale Framework for Embodied Cognition
by: Wang, Maijunxian
Published: (2025)

Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping
by: Meng, Lingyi, et al.
Published: (2025)

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
by: Zhang, Jianguo, et al.
Published: (2025)

LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent
by: Li, Wanli, et al.
Published: (2026)

BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents
by: Wu, Jinge, et al.
Published: (2026)

Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning
by: Yang, Wei, et al.
Published: (2026)

How Likely Do LLMs with CoT Mimic Human Reasoning?
by: Bao, Guangsheng, et al.
Published: (2024)

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
by: Yu, Tao, et al.
Published: (2025)

How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework
by: uulu, Choro Ulan, et al.
Published: (2026)

V-CEM: Bridging Performance and Intervenability in Concept-based Models
by: De Santis, Francesco, et al.
Published: (2025)

CentaurTA Studio: A Self-Improving Human-Agent Collaboration System for Thematic Analysis
by: Wang, Lei, et al.
Published: (2026)

Identifying Intervenable and Interpretable Features via Orthogonality Regularization
by: Miller, Moritz, et al.
Published: (2026)

Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
by: Qian, Jiaye, et al.
Published: (2025)

Abduct, Act, Predict: Scaffolding Causal Inference for Automated Failure Attribution in Multi-Agent Systems
by: West, Alva, et al.
Published: (2025)

Judging by Appearances? Auditing and Intervening Vision-Language Models for Bail Prediction
by: Basu, Sagnik, et al.
Published: (2025)

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
by: Fang, Tianqing, et al.
Published: (2025)

AMA: Adaptive Memory via Multi-Agent Collaboration
by: Huang, Weiquan, et al.
Published: (2026)

XBOUND: Exploring Capability Boundaries of Device-Control Agents at the State Level
by: Zhang, Shaoqing, et al.
Published: (2025)

Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
by: Zhang, Xiangyue
Published: (2026)

ClawGym: A Scalable Framework for Building Effective Claw Agents
by: Bai, Fei, et al.
Published: (2026)

Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization
by: Bidusa, Or Raphael, et al.
Published: (2025)

LawThinker: A Deep Research Legal Agent in Dynamic Environments
by: Yang, Xinyu, et al.
Published: (2026)

DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent
by: Wu, Tongzhou, et al.
Published: (2026)

IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning
by: Hoque, Ryan, et al.
Published: (2024)