Guardado en:
| Autores principales: | Lu, Xiaoya, Zhou, Yijin, Chen, Zeren, Wang, Ruocheng, Sima, Bingrui, Zhou, Enshen, Sheng, Lu, Liu, Dongrui, Shao, Jing |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.14367 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
por: Lu, Xiaoya, et al.
Publicado: (2025)
por: Lu, Xiaoya, et al.
Publicado: (2025)
INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems
por: Zhou, Yijin, et al.
Publicado: (2026)
por: Zhou, Yijin, et al.
Publicado: (2026)
Geometrically-Constrained Agent for Spatial Reasoning
por: Chen, Zeren, et al.
Publicado: (2025)
por: Chen, Zeren, et al.
Publicado: (2025)
ProGuard: Towards Proactive Multimodal Safeguard
por: Yu, Shaohan, et al.
Publicado: (2025)
por: Yu, Shaohan, et al.
Publicado: (2025)
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
por: Qin, Yiran, et al.
Publicado: (2023)
por: Qin, Yiran, et al.
Publicado: (2023)
Systematic Reward Gap Optimization for Mitigating VLM Hallucinations
por: He, Lehan, et al.
Publicado: (2024)
por: He, Lehan, et al.
Publicado: (2024)
X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability
por: Lu, Xiaoya, et al.
Publicado: (2025)
por: Lu, Xiaoya, et al.
Publicado: (2025)
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
por: Helff, Lukas, et al.
Publicado: (2024)
por: Helff, Lukas, et al.
Publicado: (2024)
LLMs Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions
por: Hu, Xuhao, et al.
Publicado: (2025)
por: Hu, Xuhao, et al.
Publicado: (2025)
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
por: Zhou, Enshen, et al.
Publicado: (2024)
por: Zhou, Enshen, et al.
Publicado: (2024)
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
por: Chen, Zeren, et al.
Publicado: (2024)
por: Chen, Zeren, et al.
Publicado: (2024)
VLM-Guard: Safeguarding Vision-Language Models via Fulfilling Safety Alignment Gap
por: Liu, Qin, et al.
Publicado: (2025)
por: Liu, Qin, et al.
Publicado: (2025)
Self-Guard: Empower the LLM to Safeguard Itself
por: Wang, Zezhong, et al.
Publicado: (2023)
por: Wang, Zezhong, et al.
Publicado: (2023)
GuardDoor: Safeguarding Against Malicious Diffusion Editing via Protective Backdoors
por: Zeng, Yaopei, et al.
Publicado: (2025)
por: Zeng, Yaopei, et al.
Publicado: (2025)
CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks
por: Zhang, Xu, et al.
Publicado: (2025)
por: Zhang, Xu, et al.
Publicado: (2025)
Prune4Web: DOM Tree Pruning Programming for Web Agent
por: Zhang, Jiayuan, et al.
Publicado: (2025)
por: Zhang, Jiayuan, et al.
Publicado: (2025)
RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
por: Yang, Jingyi, et al.
Publicado: (2025)
por: Yang, Jingyi, et al.
Publicado: (2025)
GuardReasoner: Towards Reasoning-based LLM Safeguards
por: Liu, Yue, et al.
Publicado: (2025)
por: Liu, Yue, et al.
Publicado: (2025)
GrandGuard: Taxonomy, Benchmark, and Safeguards for Elderly-Chatbot Interaction Safety
por: Fan, Changxuan, et al.
Publicado: (2026)
por: Fan, Changxuan, et al.
Publicado: (2026)
HELP: Hierarchical Embodied Language Planner for Household Tasks
por: Korchemnyi, Alexandr V., et al.
Publicado: (2025)
por: Korchemnyi, Alexandr V., et al.
Publicado: (2025)
VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models
por: Sima, Bingrui, et al.
Publicado: (2025)
por: Sima, Bingrui, et al.
Publicado: (2025)
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
por: Chen, Zeren, et al.
Publicado: (2023)
por: Chen, Zeren, et al.
Publicado: (2023)
TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems
por: Wang, Kai, et al.
Publicado: (2026)
por: Wang, Kai, et al.
Publicado: (2026)
PropGuard: Safeguarding LLM-MAS via Propagation-Aware Exploration and Remediation
por: Yan, Bingyu, et al.
Publicado: (2026)
por: Yan, Bingyu, et al.
Publicado: (2026)
SafeAgent: Safeguarding LLM Agents via an Automated Risk Simulator
por: Zhou, Xueyang, et al.
Publicado: (2025)
por: Zhou, Xueyang, et al.
Publicado: (2025)
ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs
por: Cui, Shiyao, et al.
Publicado: (2025)
por: Cui, Shiyao, et al.
Publicado: (2025)
Growth characteristics of Dahurian larch (Larix gmelinii) in northeast China during 1965-2015
por: Jia, Bingrui, et al.
Publicado: (2017)
por: Jia, Bingrui, et al.
Publicado: (2017)
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
por: Liu, Dongrui, et al.
Publicado: (2026)
por: Liu, Dongrui, et al.
Publicado: (2026)
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
por: Zhou, Enshen, et al.
Publicado: (2024)
por: Zhou, Enshen, et al.
Publicado: (2024)
LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts
por: Ren, Qibing, et al.
Publicado: (2024)
por: Ren, Qibing, et al.
Publicado: (2024)
GLiGuard: Schema-Conditioned Classification for LLM Safeguard
por: Zaratiana, Urchade, et al.
Publicado: (2026)
por: Zaratiana, Urchade, et al.
Publicado: (2026)
Brokerage and patronage: Regional chambers of commerce and firm subsidies in China
por: Zeren Li, et al.
Publicado: (2025)
por: Zeren Li, et al.
Publicado: (2025)
AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots
por: Zhaxizhuoma, Zhaxizhuoma, et al.
Publicado: (2024)
por: Zhaxizhuoma, Zhaxizhuoma, et al.
Publicado: (2024)
ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents
por: Wang, Yichen, et al.
Publicado: (2025)
por: Wang, Yichen, et al.
Publicado: (2025)
Risks of AI Scientists: Prioritizing Safeguarding Over Autonomy
por: Tang, Xiangru, et al.
Publicado: (2024)
por: Tang, Xiangru, et al.
Publicado: (2024)
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
por: Han, Yi, et al.
Publicado: (2025)
por: Han, Yi, et al.
Publicado: (2025)
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
por: Zhou, Tianyi, et al.
Publicado: (2026)
por: Zhou, Tianyi, et al.
Publicado: (2026)
MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents
por: Zhang, Ruoxuan, et al.
Publicado: (2025)
por: Zhang, Ruoxuan, et al.
Publicado: (2025)
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics
por: Glocker, Marc, et al.
Publicado: (2025)
por: Glocker, Marc, et al.
Publicado: (2025)
Compromising Embodied Agents with Contextual Backdoor Attacks
por: Liu, Aishan, et al.
Publicado: (2024)
por: Liu, Aishan, et al.
Publicado: (2024)
Ejemplares similares
-
IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
por: Lu, Xiaoya, et al.
Publicado: (2025) -
INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems
por: Zhou, Yijin, et al.
Publicado: (2026) -
Geometrically-Constrained Agent for Spatial Reasoning
por: Chen, Zeren, et al.
Publicado: (2025) -
ProGuard: Towards Proactive Multimodal Safeguard
por: Yu, Shaohan, et al.
Publicado: (2025) -
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
por: Qin, Yiran, et al.
Publicado: (2023)