Saved in:
| Main Authors: | Zhu, Zihao, Wu, Bingzhe, Zhang, Zhengyou, Han, Lei, Liu, Qingshan, Wu, Baoyuan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.04449 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
by: Zhu, Zihao, et al.
Published: (2023)
by: Zhu, Zihao, et al.
Published: (2023)
The Authorization-Execution Gap Is a Major Safety and Security Problem in Open-World Agents
by: Wu, Baoyuan, et al.
Published: (2026)
by: Wu, Baoyuan, et al.
Published: (2026)
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
by: Ma, Huan, et al.
Published: (2024)
by: Ma, Huan, et al.
Published: (2024)
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning
by: Zhang, Min, et al.
Published: (2024)
by: Zhang, Min, et al.
Published: (2024)
ICAT: Incident-Case-Grounded Adaptive Testing for Physical-Risk Prediction in Embodied World Models
by: Lai, Zhenglin, et al.
Published: (2026)
by: Lai, Zhenglin, et al.
Published: (2026)
HMGIE: Hierarchical and Multi-Grained Inconsistency Evaluation for Vision-Language Data Cleansing
by: Zhu, Zihao, et al.
Published: (2024)
by: Zhu, Zihao, et al.
Published: (2024)
Unveiling Covert Toxicity in Multimodal Data via Toxicity Association Graphs: A Graph-Based Metric and Interpretable Detection Framework
by: Wu, Guanzong, et al.
Published: (2026)
by: Wu, Guanzong, et al.
Published: (2026)
BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation
by: Zhu, Zihao, et al.
Published: (2026)
by: Zhu, Zihao, et al.
Published: (2026)
MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
by: Wang, Junjian, et al.
Published: (2025)
by: Wang, Junjian, et al.
Published: (2025)
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)
by: Zhang, Lingfeng, et al.
Published: (2024)
STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning
by: Lei, Mingcong, et al.
Published: (2025)
by: Lei, Mingcong, et al.
Published: (2025)
AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models
by: Zhu, Zihao, et al.
Published: (2025)
by: Zhu, Zihao, et al.
Published: (2025)
AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents
by: Kim, Hojoon, et al.
Published: (2026)
by: Kim, Hojoon, et al.
Published: (2026)
ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning
by: Zhou, Weijie, et al.
Published: (2025)
by: Zhou, Weijie, et al.
Published: (2025)
Attacks in Adversarial Machine Learning: A Systematic Survey from the Life-cycle Perspective
by: Wu, Baoyuan, et al.
Published: (2023)
by: Wu, Baoyuan, et al.
Published: (2023)
A Survey on Robotics with Foundation Models: toward Embodied AI
by: Xu, Zhiyuan, et al.
Published: (2024)
by: Xu, Zhiyuan, et al.
Published: (2024)
BrainMem: Brain-Inspired Evolving Memory for Embodied Agent Task Planning
by: Ma, Xiaoyu, et al.
Published: (2026)
by: Ma, Xiaoyu, et al.
Published: (2026)
SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of Foundation Model-based Embodied Agents
by: Zhan, Simon Sinong, et al.
Published: (2025)
by: Zhan, Simon Sinong, et al.
Published: (2025)
The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents
by: Wang, Ziyu, et al.
Published: (2026)
by: Wang, Ziyu, et al.
Published: (2026)
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
by: Zhu, Zihao, et al.
Published: (2025)
by: Zhu, Zihao, et al.
Published: (2025)
MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents
by: Jiang, Dongming, et al.
Published: (2026)
by: Jiang, Dongming, et al.
Published: (2026)
Automatic Cognitive Task Generation for In-Situ Evaluation of Embodied Agents
by: He, Xinyi, et al.
Published: (2026)
by: He, Xinyi, et al.
Published: (2026)
Physical Reasoning and Object Planning for Household Embodied Agents
by: Agrawal, Ayush, et al.
Published: (2023)
by: Agrawal, Ayush, et al.
Published: (2023)
Plan Verification for LLM-Based Embodied Task Completion Agents
by: Hariharan, Ananth, et al.
Published: (2025)
by: Hariharan, Ananth, et al.
Published: (2025)
SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
by: Shen, Zichao, et al.
Published: (2025)
by: Shen, Zichao, et al.
Published: (2025)
Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
by: Zhang, Mingda, et al.
Published: (2024)
by: Zhang, Mingda, et al.
Published: (2024)
EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents
by: Ju, Ruofei, et al.
Published: (2026)
by: Ju, Ruofei, et al.
Published: (2026)
Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level
by: Wang, Chenxu, et al.
Published: (2024)
by: Wang, Chenxu, et al.
Published: (2024)
CycloneMAE: A Scalable Multi-Task Learning Model for Global Tropical Cyclone Probabilistic Forecasting
by: Hang, Renlong, et al.
Published: (2026)
by: Hang, Renlong, et al.
Published: (2026)
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
by: Kim, Byeonghwi, et al.
Published: (2023)
by: Kim, Byeonghwi, et al.
Published: (2023)
FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
by: Wang, Zihao, et al.
Published: (2023)
by: Wang, Zihao, et al.
Published: (2023)
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025)
by: Chen, Ziyi, et al.
Published: (2025)
Embodied AI Agents: Modeling the World
by: Fung, Pascale, et al.
Published: (2025)
by: Fung, Pascale, et al.
Published: (2025)
SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents
by: Yin, Sheng, et al.
Published: (2024)
by: Yin, Sheng, et al.
Published: (2024)
LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning
by: Wang, Shu, et al.
Published: (2024)
by: Wang, Shu, et al.
Published: (2024)
Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents
by: Lu, Qinghua, et al.
Published: (2023)
by: Lu, Qinghua, et al.
Published: (2023)
Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations
by: Liu, Shaoshan, et al.
Published: (2025)
by: Liu, Shaoshan, et al.
Published: (2025)
TPS-Bench: Evaluating AI Agents' Tool Planning \& Scheduling Abilities in Compounding Tasks
by: Xu, Hanwen, et al.
Published: (2025)
by: Xu, Hanwen, et al.
Published: (2025)
Similar Items
-
VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
by: Zhu, Zihao, et al.
Published: (2023) -
The Authorization-Execution Gap Is a Major Safety and Security Problem in Open-World Agents
by: Wu, Baoyuan, et al.
Published: (2026) -
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
by: Ma, Huan, et al.
Published: (2024) -
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning
by: Zhang, Min, et al.
Published: (2024) -
ICAT: Incident-Case-Grounded Adaptive Testing for Physical-Risk Prediction in Embodied World Models
by: Lai, Zhenglin, et al.
Published: (2026)