Saved in:
| Main Authors: | Yang, Linyi, Weng, Yixuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.12194 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AgentStudio: A Toolkit for Building General Virtual Agents
by: Zheng, Longtao, et al.
Published: (2024)
by: Zheng, Longtao, et al.
Published: (2024)
CycleResearcher: Improving Automated Research via Automated Review
by: Weng, Yixuan, et al.
Published: (2024)
by: Weng, Yixuan, et al.
Published: (2024)
Deep Research Agents: A Systematic Examination And Roadmap
by: Huang, Yuxuan, et al.
Published: (2025)
by: Huang, Yuxuan, et al.
Published: (2025)
AI Scientists Fail Without Strong Implementation Capability
by: Zhu, Minjun, et al.
Published: (2025)
by: Zhu, Minjun, et al.
Published: (2025)
See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
by: Zhao, Haoyu, et al.
Published: (2025)
by: Zhao, Haoyu, et al.
Published: (2025)
MineStudio: A Streamlined Package for Minecraft AI Agent Development
by: Cai, Shaofei, et al.
Published: (2024)
by: Cai, Shaofei, et al.
Published: (2024)
Learning to Intervene on Concept Bottlenecks
by: Steinmann, David, et al.
Published: (2023)
by: Steinmann, David, et al.
Published: (2023)
StepShield: When, Not Whether to Intervene on Rogue Agents
by: Felicia, Gloria, et al.
Published: (2026)
by: Felicia, Gloria, et al.
Published: (2026)
A Flexible Multi-Agent LLM-Human Framework for Fast Human Validated Tool Building
by: Xavier, Daull, et al.
Published: (2025)
by: Xavier, Daull, et al.
Published: (2025)
AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems
by: Dibia, Victor, et al.
Published: (2024)
by: Dibia, Victor, et al.
Published: (2024)
Can AI Perceive Physical Danger and Intervene?
by: Jindal, Abhishek, et al.
Published: (2025)
by: Jindal, Abhishek, et al.
Published: (2025)
DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process
by: Zhu, Minjun, et al.
Published: (2025)
by: Zhu, Minjun, et al.
Published: (2025)
SimWorld Studio: Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning
by: Kang, Haoqiang, et al.
Published: (2026)
by: Kang, Haoqiang, et al.
Published: (2026)
DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference
by: Wang, Zihan, et al.
Published: (2026)
by: Wang, Zihan, et al.
Published: (2026)
Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents
by: Chandrahasan, Prahaladh, et al.
Published: (2025)
by: Chandrahasan, Prahaladh, et al.
Published: (2025)
How Far Are AI Scientists from Changing the World?
by: Xie, Qiujie, et al.
Published: (2025)
by: Xie, Qiujie, et al.
Published: (2025)
From Understanding the World to Intervening in It: A Unified Multi-Scale Framework for Embodied Cognition
by: Wang, Maijunxian
Published: (2025)
by: Wang, Maijunxian
Published: (2025)
Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping
by: Meng, Lingyi, et al.
Published: (2025)
by: Meng, Lingyi, et al.
Published: (2025)
ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
by: Zhang, Jianguo, et al.
Published: (2025)
by: Zhang, Jianguo, et al.
Published: (2025)
LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent
by: Li, Wanli, et al.
Published: (2026)
by: Li, Wanli, et al.
Published: (2026)
BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents
by: Wu, Jinge, et al.
Published: (2026)
by: Wu, Jinge, et al.
Published: (2026)
Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning
by: Yang, Wei, et al.
Published: (2026)
by: Yang, Wei, et al.
Published: (2026)
How Likely Do LLMs with CoT Mimic Human Reasoning?
by: Bao, Guangsheng, et al.
Published: (2024)
by: Bao, Guangsheng, et al.
Published: (2024)
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
by: Yu, Tao, et al.
Published: (2025)
by: Yu, Tao, et al.
Published: (2025)
How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework
by: uulu, Choro Ulan, et al.
Published: (2026)
by: uulu, Choro Ulan, et al.
Published: (2026)
V-CEM: Bridging Performance and Intervenability in Concept-based Models
by: De Santis, Francesco, et al.
Published: (2025)
by: De Santis, Francesco, et al.
Published: (2025)
CentaurTA Studio: A Self-Improving Human-Agent Collaboration System for Thematic Analysis
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Identifying Intervenable and Interpretable Features via Orthogonality Regularization
by: Miller, Moritz, et al.
Published: (2026)
by: Miller, Moritz, et al.
Published: (2026)
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
by: Qian, Jiaye, et al.
Published: (2025)
by: Qian, Jiaye, et al.
Published: (2025)
Abduct, Act, Predict: Scaffolding Causal Inference for Automated Failure Attribution in Multi-Agent Systems
by: West, Alva, et al.
Published: (2025)
by: West, Alva, et al.
Published: (2025)
Judging by Appearances? Auditing and Intervening Vision-Language Models for Bail Prediction
by: Basu, Sagnik, et al.
Published: (2025)
by: Basu, Sagnik, et al.
Published: (2025)
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
by: Fang, Tianqing, et al.
Published: (2025)
by: Fang, Tianqing, et al.
Published: (2025)
AMA: Adaptive Memory via Multi-Agent Collaboration
by: Huang, Weiquan, et al.
Published: (2026)
by: Huang, Weiquan, et al.
Published: (2026)
XBOUND: Exploring Capability Boundaries of Device-Control Agents at the State Level
by: Zhang, Shaoqing, et al.
Published: (2025)
by: Zhang, Shaoqing, et al.
Published: (2025)
Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
by: Zhang, Xiangyue
Published: (2026)
by: Zhang, Xiangyue
Published: (2026)
ClawGym: A Scalable Framework for Building Effective Claw Agents
by: Bai, Fei, et al.
Published: (2026)
by: Bai, Fei, et al.
Published: (2026)
Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization
by: Bidusa, Or Raphael, et al.
Published: (2025)
by: Bidusa, Or Raphael, et al.
Published: (2025)
LawThinker: A Deep Research Legal Agent in Dynamic Environments
by: Yang, Xinyu, et al.
Published: (2026)
by: Yang, Xinyu, et al.
Published: (2026)
DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent
by: Wu, Tongzhou, et al.
Published: (2026)
by: Wu, Tongzhou, et al.
Published: (2026)
IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning
by: Hoque, Ryan, et al.
Published: (2024)
by: Hoque, Ryan, et al.
Published: (2024)
Similar Items
-
AgentStudio: A Toolkit for Building General Virtual Agents
by: Zheng, Longtao, et al.
Published: (2024) -
CycleResearcher: Improving Automated Research via Automated Review
by: Weng, Yixuan, et al.
Published: (2024) -
Deep Research Agents: A Systematic Examination And Roadmap
by: Huang, Yuxuan, et al.
Published: (2025) -
AI Scientists Fail Without Strong Implementation Capability
by: Zhu, Minjun, et al.
Published: (2025) -
See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
by: Zhao, Haoyu, et al.
Published: (2025)