Saved in:
Bibliographic Details
Main Authors: Zhu, Zheng, Wang, Xiaofeng, Zhao, Wangbo, Min, Chen, Li, Bohan, Deng, Nianchen, Dou, Min, Wang, Yuqi, Shi, Botian, Wang, Kai, Zhang, Chi, You, Yang, Zhang, Zhaoxiang, Zhao, Dawei, Xiao, Liang, Zhao, Jian, Lu, Jiwen, Huang, Guan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.03520
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912672968081408
author Zhu, Zheng
Wang, Xiaofeng
Zhao, Wangbo
Min, Chen
Li, Bohan
Deng, Nianchen
Dou, Min
Wang, Yuqi
Shi, Botian
Wang, Kai
Zhang, Chi
You, Yang
Zhang, Zhaoxiang
Zhao, Dawei
Xiao, Liang
Zhao, Jian
Lu, Jiwen
Huang, Guan
author_facet Zhu, Zheng
Wang, Xiaofeng
Zhao, Wangbo
Min, Chen
Li, Bohan
Deng, Nianchen
Dou, Min
Wang, Yuqi
Shi, Botian
Wang, Kai
Zhang, Chi
You, Yang
Zhang, Zhaoxiang
Zhao, Dawei
Xiao, Liang
Zhao, Jian
Lu, Jiwen
Huang, Guan
contents General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems. Recently, the emergence of the Sora model has attained significant attention due to its remarkable simulation capabilities, which exhibits an incipient comprehension of physical laws. In this survey, we embark on a comprehensive exploration of the latest advancements in world models. Our analysis navigates through the forefront of generative methodologies in video generation, where world models stand as pivotal constructs facilitating the synthesis of highly realistic visual content. Additionally, we scrutinize the burgeoning field of autonomous-driving world models, meticulously delineating their indispensable role in reshaping transportation and urban mobility. Furthermore, we delve into the intricacies inherent in world models deployed within autonomous agents, shedding light on their profound significance in enabling intelligent interactions within dynamic environmental contexts. At last, we examine challenges and limitations of world models, and discuss their potential future directions. We hope this survey can serve as a foundational reference for the research community and inspire continued innovation. This survey will be regularly updated at: https://github.com/GigaAI-research/General-World-Models-Survey.
format Preprint
id arxiv_https___arxiv_org_abs_2405_03520
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zhu, Zheng
Wang, Xiaofeng
Zhao, Wangbo
Min, Chen
Li, Bohan
Deng, Nianchen
Dou, Min
Wang, Yuqi
Shi, Botian
Wang, Kai
Zhang, Chi
You, Yang
Zhang, Zhaoxiang
Zhao, Dawei
Xiao, Liang
Zhao, Jian
Lu, Jiwen
Huang, Guan
Computer Vision and Pattern Recognition
General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems. Recently, the emergence of the Sora model has attained significant attention due to its remarkable simulation capabilities, which exhibits an incipient comprehension of physical laws. In this survey, we embark on a comprehensive exploration of the latest advancements in world models. Our analysis navigates through the forefront of generative methodologies in video generation, where world models stand as pivotal constructs facilitating the synthesis of highly realistic visual content. Additionally, we scrutinize the burgeoning field of autonomous-driving world models, meticulously delineating their indispensable role in reshaping transportation and urban mobility. Furthermore, we delve into the intricacies inherent in world models deployed within autonomous agents, shedding light on their profound significance in enabling intelligent interactions within dynamic environmental contexts. At last, we examine challenges and limitations of world models, and discuss their potential future directions. We hope this survey can serve as a foundational reference for the research community and inspire continued innovation. This survey will be regularly updated at: https://github.com/GigaAI-research/General-World-Models-Survey.
title Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2405.03520