Saved in:
| Main Authors: | X, Tencent Robotics, Team, HY Vision, :, Yu, Xumin, Liu, Zuyan, Wang, Ziyi, Zhang, He, Rao, Yongming, Liu, Fangfu, Zhang, Yani, Zhao, Ruowen, Wang, Oran, Liang, Yves, Lin, Haitao, Wang, Minghui, Dong, Yubo, Cheng, Kevin, Ni, Bolin, Huang, Rui, Hu, Han, Zhang, Zhengyou, Linus, Yao, Shunyu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.07430 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
by: Tencent HY Team
Published: (2026)
by: Tencent HY Team
Published: (2026)
AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics
by: Tencent HY Team
Published: (2026)
by: Tencent HY Team
Published: (2026)
Script-a-Video: Deep Structured Audio-visual Captions via Factorized Streams and Relational Grounding
by: Tencent Hunyuan Team
Published: (2026)
by: Tencent Hunyuan Team
Published: (2026)
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
by: Wang, Yikun, et al.
Published: (2025)
by: Wang, Yikun, et al.
Published: (2025)
Vision Generalist Model: A Survey
by: Wang, Ziyi, et al.
Published: (2025)
by: Wang, Ziyi, et al.
Published: (2025)
EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents
by: Zhu, Zihao, et al.
Published: (2024)
by: Zhu, Zihao, et al.
Published: (2024)
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
by: Wang, Jiahui, et al.
Published: (2025)
by: Wang, Jiahui, et al.
Published: (2025)
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
by: Liu, Fangfu, et al.
Published: (2024)
by: Liu, Fangfu, et al.
Published: (2024)
BadRobot: Jailbreaking Embodied LLMs in the Physical World
by: Zhang, Hangtao, et al.
Published: (2024)
by: Zhang, Hangtao, et al.
Published: (2024)
866σ CERN IRR: Relational Time + Anti-Hallucination KI
by: IRR_Vision
Published: (2025)
by: IRR_Vision
Published: (2025)
EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence
by: Zou, Ding, et al.
Published: (2025)
by: Zou, Ding, et al.
Published: (2025)
EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents
by: Ju, Ruofei, et al.
Published: (2026)
by: Ju, Ruofei, et al.
Published: (2026)
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
by: HY-World, Team, et al.
Published: (2026)
by: HY-World, Team, et al.
Published: (2026)
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)
by: Li, Manling, et al.
Published: (2024)
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
by: Zhang, Wenqi, et al.
Published: (2025)
by: Zhang, Wenqi, et al.
Published: (2025)
Embodied Navigation Foundation Model
by: Zhang, Jiazhao, et al.
Published: (2025)
by: Zhang, Jiazhao, et al.
Published: (2025)
TrackVLA: Embodied Visual Tracking in the Wild
by: Wang, Shaoan, et al.
Published: (2025)
by: Wang, Shaoan, et al.
Published: (2025)
EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence
by: Wang, Xinjie, et al.
Published: (2025)
by: Wang, Xinjie, et al.
Published: (2025)
Embodied3DBench: Benchmarking Low-Level Embodied Spatial Intelligence of Vision Language Models
by: Zhang, Jiyao, et al.
Published: (2026)
by: Zhang, Jiyao, et al.
Published: (2026)
ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents
by: Wang, Yichen, et al.
Published: (2025)
by: Wang, Yichen, et al.
Published: (2025)
Embodied Science: Closing the Discovery Loop with Agentic Embodied AI
by: Zhuang, Xiang, et al.
Published: (2026)
by: Zhuang, Xiang, et al.
Published: (2026)
Embodied Image Compression
by: Li, Chunyi, et al.
Published: (2025)
by: Li, Chunyi, et al.
Published: (2025)
KUNPENG: An Embodied Large Model for Intelligent Maritime
by: Wang, Naiyao, et al.
Published: (2024)
by: Wang, Naiyao, et al.
Published: (2024)
ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning
by: Zhou, Weijie, et al.
Published: (2025)
by: Zhou, Weijie, et al.
Published: (2025)
Lifelong Embodied Navigation Learning
by: Wang, Xudong, et al.
Published: (2026)
by: Wang, Xudong, et al.
Published: (2026)
HY3D-Bench: Generation of 3D Assets
by: Hunyuan3D, Team, et al.
Published: (2026)
by: Hunyuan3D, Team, et al.
Published: (2026)
EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition
by: Liu, Bingxi, et al.
Published: (2025)
by: Liu, Bingxi, et al.
Published: (2025)
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
by: Yang, Rui, et al.
Published: (2025)
by: Yang, Rui, et al.
Published: (2025)
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models
by: Dong, Yuhao, et al.
Published: (2026)
by: Dong, Yuhao, et al.
Published: (2026)
Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model
by: Xu, Wenjiang, et al.
Published: (2025)
by: Xu, Wenjiang, et al.
Published: (2025)
Compromising Embodied Agents with Contextual Backdoor Attacks
by: Liu, Aishan, et al.
Published: (2024)
by: Liu, Aishan, et al.
Published: (2024)
AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions
by: Ying, Zonghao, et al.
Published: (2025)
by: Ying, Zonghao, et al.
Published: (2025)
Ella: Embodied Social Agents with Lifelong Memory
by: Zhang, Hongxin, et al.
Published: (2025)
by: Zhang, Hongxin, et al.
Published: (2025)
Universal Actions for Enhanced Embodied Foundation Models
by: Zheng, Jinliang, et al.
Published: (2025)
by: Zheng, Jinliang, et al.
Published: (2025)
Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
by: Li, Zhuoran, et al.
Published: (2025)
by: Li, Zhuoran, et al.
Published: (2025)
EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Molecular Association Chemistry Enables High‐Voltage Fast‐Charging Lithium Batteries
by: Junfeng Huang, et al.
Published: (2025)
by: Junfeng Huang, et al.
Published: (2025)
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
by: Ni, Fei, et al.
Published: (2025)
by: Ni, Fei, et al.
Published: (2025)
Toward Embodied AGI: A Review of Embodied AI and the Road Ahead
by: Wang, Yequan, et al.
Published: (2025)
by: Wang, Yequan, et al.
Published: (2025)
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
by: Chen, Hanyang, et al.
Published: (2025)
by: Chen, Hanyang, et al.
Published: (2025)
Similar Items
-
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
by: Tencent HY Team
Published: (2026) -
AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics
by: Tencent HY Team
Published: (2026) -
Script-a-Video: Deep Structured Audio-visual Captions via Factorized Streams and Relational Grounding
by: Tencent Hunyuan Team
Published: (2026) -
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
by: Wang, Yikun, et al.
Published: (2025) -
Vision Generalist Model: A Survey
by: Wang, Ziyi, et al.
Published: (2025)