:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guo, Ying, Gan, Qijun, Zhang, Yifu, Liu, Jinlai, Hu, Yifei, Xie, Pan, Qian, Dongjun, Zhang, Yu, Li, Ruiqi, Zhang, Yuqi, Lu, Ruibiao, Mei, Xiaofeng, Han, Bo, Yin, Xiang, Peng, Bingyue, Yuan, Zehuan
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.08682
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Waver: Wave Your Way to Lifelike Video Generation
by: Zhang, Yifu, et al.
Published: (2025)

InfinityHuman: Towards Long-Term Audio-Driven Human
by: Li, Xiaodi, et al.
Published: (2025)

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
by: Gan, Qijun, et al.
Published: (2025)

Generative Refinement Networks for Visual Synthesis
by: Han, Jian, et al.
Published: (2026)

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
by: Han, Jian, et al.
Published: (2024)

VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs
by: Qian, Dongjun, et al.
Published: (2025)

FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
by: Zhang, Shilong, et al.
Published: (2025)

OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
by: Gan, Qijun, et al.
Published: (2025)

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions
by: Chen, Zhiyuan, et al.
Published: (2024)

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation
by: Liu, Jinlai, et al.
Published: (2025)

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling
by: Chen, Junyi, et al.
Published: (2024)

Motion Generation Review: Exploring Deep Learning for Lifelike Animation with Manifold
by: Zhao, Jiayi, et al.
Published: (2024)

Optimal Estimation and Uncertainty Quantification for Stochastic Inverse Problems via Variational Bayesian Methods
by: Song, Ruibiao, et al.
Published: (2025)

UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control
by: Sun, Wenzhang, et al.
Published: (2024)

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
by: Du, Fangyu, et al.
Published: (2025)

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
by: Xu, Sicheng, et al.
Published: (2024)

BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration
by: Li, Zhaoyang, et al.
Published: (2025)

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
by: Tian, Keyu, et al.
Published: (2024)

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
by: Zhang, Yiming, et al.
Published: (2024)

Audio-Synchronized Visual Animation
by: Zhang, Lin, et al.
Published: (2024)

Graph neural network for colliding particles with an application to sea ice floe modeling
by: Zhu, Ruibiao
Published: (2026)

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
by: Sun, Peize, et al.
Published: (2024)

Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
by: Zhen, Dingcheng, et al.
Published: (2025)

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image
by: Xu, Sicheng, et al.
Published: (2025)

HLLM-Creator: Hierarchical LLM-based Personalized Creative Generation
by: Chen, Junyi, et al.
Published: (2025)

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
by: Li, Yifei, et al.
Published: (2025)

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
by: Huang, Zehuan, et al.
Published: (2025)

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)

Goku: Flow Based Video Generative Foundation Models
by: Chen, Shoufa, et al.
Published: (2025)

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
by: Wang, Xiang, et al.
Published: (2024)

Role of Decidual Natural Killer Cells in the Pathogenesis of Preeclampsia
by: Shuang Yue, et al.
Published: (2024)

Lifelike Agility and Play in Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models
by: Han, Lei, et al.
Published: (2023)

From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
by: Mu, Xiangyu, et al.
Published: (2025)

ALIVE: An Avatar-Lecture Interactive Video Engine with Content-Aware Retrieval for Real-Time Interaction
by: Islam, Md Zabirul, et al.
Published: (2025)

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
by: Hu, Li, et al.
Published: (2023)

Bridging Your Imagination with Audio-Video Generation via a Unified Director
by: Zhang, Jiaxu, et al.
Published: (2025)

StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model
by: Yang, Yifan, et al.
Published: (2025)

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models
by: Hu, Xiang, et al.
Published: (2025)

AnimateAnything: Consistent and Controllable Animation for Video Generation
by: Lei, Guojun, et al.
Published: (2024)

Animate Your Motion: Turning Still Images into Dynamic Videos
by: Li, Mingxiao, et al.
Published: (2024)