Saved in:
Bibliographic Details
Main Authors: Chen, Shuai, Chen, Hao, Bei, Yuanchen, Zhao, Tianyang, Zhou, Zhibo, Huang, Feiran
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.08876
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911374445117440
author Chen, Shuai
Chen, Hao
Bei, Yuanchen
Zhao, Tianyang
Zhou, Zhibo
Huang, Feiran
author_facet Chen, Shuai
Chen, Hao
Bei, Yuanchen
Zhao, Tianyang
Zhou, Zhibo
Huang, Feiran
contents Semantic information in embodied AI is inherently multi-source and multi-stage, making it challenging to fully leverage for achieving stable perception-to-action loops in real-world environments. Early studies have combined manual engineering with deep neural networks, achieving notable progress in specific semantic-related embodied tasks. However, as embodied agents encounter increasingly complex environments and open-ended tasks, the demand for more generalizable and robust semantic processing capabilities has become imperative. Recent advances in foundation models (FMs) address this challenge through their cross-domain generalization abilities and rich semantic priors, reshaping the landscape of embodied AI research. In this survey, we propose the Semantic Lifecycle as a unified framework to characterize the evolution of semantic knowledge within embodied AI driven by foundation models. Departing from traditional paradigms that treat semantic processing as isolated modules or disjoint tasks, our framework offers a holistic perspective that captures the continuous flow and maintenance of semantic knowledge. Guided by this embodied semantic lifecycle, we further analyze and compare recent advances across three key stages: acquisition, representation, and storage. Finally, we summarize existing challenges and outline promising directions for future research.
format Preprint
id arxiv_https___arxiv_org_abs_2601_08876
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle The Semantic Lifecycle in Embodied AI: Acquisition, Representation and Storage via Foundation Models
Chen, Shuai
Chen, Hao
Bei, Yuanchen
Zhao, Tianyang
Zhou, Zhibo
Huang, Feiran
Computer Vision and Pattern Recognition
Semantic information in embodied AI is inherently multi-source and multi-stage, making it challenging to fully leverage for achieving stable perception-to-action loops in real-world environments. Early studies have combined manual engineering with deep neural networks, achieving notable progress in specific semantic-related embodied tasks. However, as embodied agents encounter increasingly complex environments and open-ended tasks, the demand for more generalizable and robust semantic processing capabilities has become imperative. Recent advances in foundation models (FMs) address this challenge through their cross-domain generalization abilities and rich semantic priors, reshaping the landscape of embodied AI research. In this survey, we propose the Semantic Lifecycle as a unified framework to characterize the evolution of semantic knowledge within embodied AI driven by foundation models. Departing from traditional paradigms that treat semantic processing as isolated modules or disjoint tasks, our framework offers a holistic perspective that captures the continuous flow and maintenance of semantic knowledge. Guided by this embodied semantic lifecycle, we further analyze and compare recent advances across three key stages: acquisition, representation, and storage. Finally, we summarize existing challenges and outline promising directions for future research.
title The Semantic Lifecycle in Embodied AI: Acquisition, Representation and Storage via Foundation Models
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2601.08876