Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Chengxuan, Huang, Di, Lu, Zeyu, Xiao, Yang, Pei, Qingqi, Bai, Lei
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2403.16407
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866910381479297024
author Li, Chengxuan
Huang, Di
Lu, Zeyu
Xiao, Yang
Pei, Qingqi
Bai, Lei
author_facet Li, Chengxuan
Huang, Di
Lu, Zeyu
Xiao, Yang
Pei, Qingqi
Bai, Lei
contents Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications. One critical aspect of this field is the generation of long-duration videos, which presents unique challenges and opportunities. This paper presents the first survey of recent advancements in long video generation and summarises them into two key paradigms: divide and conquer temporal autoregressive. We delve into the common models employed in each paradigm, including aspects of network design and conditioning techniques. Furthermore, we offer a comprehensive overview and classification of the datasets and evaluation metrics which are crucial for advancing long video generation research. Concluding with a summary of existing studies, we also discuss the emerging challenges and future directions in this dynamic field. We hope that this survey will serve as an essential reference for researchers and practitioners in the realm of long video generation.
format Preprint
id arxiv_https___arxiv_org_abs_2403_16407
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A Survey on Long Video Generation: Challenges, Methods, and Prospects
Li, Chengxuan
Huang, Di
Lu, Zeyu
Xiao, Yang
Pei, Qingqi
Bai, Lei
Computer Vision and Pattern Recognition
Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications. One critical aspect of this field is the generation of long-duration videos, which presents unique challenges and opportunities. This paper presents the first survey of recent advancements in long video generation and summarises them into two key paradigms: divide and conquer temporal autoregressive. We delve into the common models employed in each paradigm, including aspects of network design and conditioning techniques. Furthermore, we offer a comprehensive overview and classification of the datasets and evaluation metrics which are crucial for advancing long video generation research. Concluding with a summary of existing studies, we also discuss the emerging challenges and future directions in this dynamic field. We hope that this survey will serve as an essential reference for researchers and practitioners in the realm of long video generation.
title A Survey on Long Video Generation: Challenges, Methods, and Prospects
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.16407