:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Zihao, Wu, Bingzhe, Zhang, Zhengyou, Han, Lei, Liu, Qingshan, Wu, Baoyuan
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2408.04449
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
by: Zhu, Zihao, et al.
Published: (2023)

The Authorization-Execution Gap Is a Major Safety and Security Problem in Open-World Agents
by: Wu, Baoyuan, et al.
Published: (2026)

Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
by: Ma, Huan, et al.
Published: (2024)

MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning
by: Zhang, Min, et al.
Published: (2024)

ICAT: Incident-Case-Grounded Adaptive Testing for Physical-Risk Prediction in Embodied World Models
by: Lai, Zhenglin, et al.
Published: (2026)

HMGIE: Hierarchical and Multi-Grained Inconsistency Evaluation for Vision-Language Data Cleansing
by: Zhu, Zihao, et al.
Published: (2024)

Unveiling Covert Toxicity in Multimodal Data via Toxicity Association Graphs: A Graph-Based Metric and Interpretable Detection Framework
by: Wu, Guanzong, et al.
Published: (2026)

BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation
by: Zhu, Zihao, et al.
Published: (2026)

MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
by: Wang, Junjian, et al.
Published: (2025)

ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)

STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning
by: Lei, Mingcong, et al.
Published: (2025)

AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models
by: Zhu, Zihao, et al.
Published: (2025)

AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents
by: Kim, Hojoon, et al.
Published: (2026)

ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning
by: Zhou, Weijie, et al.
Published: (2025)

Attacks in Adversarial Machine Learning: A Systematic Survey from the Life-cycle Perspective
by: Wu, Baoyuan, et al.
Published: (2023)

A Survey on Robotics with Foundation Models: toward Embodied AI
by: Xu, Zhiyuan, et al.
Published: (2024)

BrainMem: Brain-Inspired Evolving Memory for Embodied Agent Task Planning
by: Ma, Xiaoyu, et al.
Published: (2026)

SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of Foundation Model-based Embodied Agents
by: Zhan, Simon Sinong, et al.
Published: (2025)

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents
by: Wang, Ziyu, et al.
Published: (2026)

To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
by: Zhu, Zihao, et al.
Published: (2025)

MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents
by: Jiang, Dongming, et al.
Published: (2026)

Automatic Cognitive Task Generation for In-Situ Evaluation of Embodied Agents
by: He, Xinyi, et al.
Published: (2026)

Physical Reasoning and Object Planning for Household Embodied Agents
by: Agrawal, Ayush, et al.
Published: (2023)

Plan Verification for LLM-Based Embodied Task Completion Agents
by: Hariharan, Ananth, et al.
Published: (2025)

SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
by: Shen, Zichao, et al.
Published: (2025)

Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
by: Zhang, Mingda, et al.
Published: (2024)

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents
by: Ju, Ruofei, et al.
Published: (2026)

Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level
by: Wang, Chenxu, et al.
Published: (2024)

CycloneMAE: A Scalable Multi-Task Learning Model for Global Tropical Cyclone Probabilistic Forecasting
by: Hang, Renlong, et al.
Published: (2026)

Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
by: Kim, Byeonghwi, et al.
Published: (2023)

FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units
by: Wang, Jian, et al.
Published: (2025)

SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
by: Zhu, Kunlun, et al.
Published: (2025)

Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
by: Wang, Zihao, et al.
Published: (2023)

SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025)

Embodied AI Agents: Modeling the World
by: Fung, Pascale, et al.
Published: (2025)

SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents
by: Yin, Sheng, et al.
Published: (2024)

LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning
by: Wang, Shu, et al.
Published: (2024)

Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents
by: Lu, Qinghua, et al.
Published: (2023)

Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations
by: Liu, Shaoshan, et al.
Published: (2025)

TPS-Bench: Evaluating AI Agents' Tool Planning \& Scheduling Abilities in Compounding Tasks
by: Xu, Hanwen, et al.
Published: (2025)