Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lin, Jiajing, Wang, Zhenzhong, Hou, Yongjie, Tang, Yuzhou, Jiang, Min
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2409.07179
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909311434752000
author Lin, Jiajing
Wang, Zhenzhong
Hou, Yongjie
Tang, Yuzhou
Jiang, Min
author_facet Lin, Jiajing
Wang, Zhenzhong
Hou, Yongjie
Tang, Yuzhou
Jiang, Min
contents 4D content generation focuses on creating dynamic 3D objects that change over time. Existing methods primarily rely on pre-trained video diffusion models, utilizing sampling processes or reference videos. However, these approaches face significant challenges. Firstly, the generated 4D content often fails to adhere to real-world physics since video diffusion models do not incorporate physical priors. Secondly, the extensive sampling process and the large number of parameters in diffusion models result in exceedingly time-consuming generation processes. To address these issues, we introduce Phy124, a novel, fast, and physics-driven method for controllable 4D content generation from a single image. Phy124 integrates physical simulation directly into the 4D generation process, ensuring that the resulting 4D content adheres to natural physical laws. Phy124 also eliminates the use of diffusion models during the 4D dynamics generation phase, significantly speeding up the process. Phy124 allows for the control of 4D dynamics, including movement speed and direction, by manipulating external forces. Extensive experiments demonstrate that Phy124 generates high-fidelity 4D content with significantly reduced inference times, achieving stateof-the-art performance. The code and generated 4D content are available at the provided link: https://anonymous.4open.science/r/BBF2/.
format Preprint
id arxiv_https___arxiv_org_abs_2409_07179
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Phy124: Fast Physics-Driven 4D Content Generation from a Single Image
Lin, Jiajing
Wang, Zhenzhong
Hou, Yongjie
Tang, Yuzhou
Jiang, Min
Computer Vision and Pattern Recognition
4D content generation focuses on creating dynamic 3D objects that change over time. Existing methods primarily rely on pre-trained video diffusion models, utilizing sampling processes or reference videos. However, these approaches face significant challenges. Firstly, the generated 4D content often fails to adhere to real-world physics since video diffusion models do not incorporate physical priors. Secondly, the extensive sampling process and the large number of parameters in diffusion models result in exceedingly time-consuming generation processes. To address these issues, we introduce Phy124, a novel, fast, and physics-driven method for controllable 4D content generation from a single image. Phy124 integrates physical simulation directly into the 4D generation process, ensuring that the resulting 4D content adheres to natural physical laws. Phy124 also eliminates the use of diffusion models during the 4D dynamics generation phase, significantly speeding up the process. Phy124 allows for the control of 4D dynamics, including movement speed and direction, by manipulating external forces. Extensive experiments demonstrate that Phy124 generates high-fidelity 4D content with significantly reduced inference times, achieving stateof-the-art performance. The code and generated 4D content are available at the provided link: https://anonymous.4open.science/r/BBF2/.
title Phy124: Fast Physics-Driven 4D Content Generation from a Single Image
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2409.07179