:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Xiaoran, Huang, Zien, Yu, Chonghan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.14715
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
by: Xu, Yifang, et al.
Published: (2024)

GenFusion: Closing the Loop between Reconstruction and Generation via Videos
by: Wu, Sibo, et al.
Published: (2025)

EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
by: Qu, Qiang, et al.
Published: (2025)

GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
by: Sun, Yunzhuo, et al.
Published: (2024)

MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
by: Niu, Muyao, et al.
Published: (2024)

MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence
by: Liu, Chonghan, et al.
Published: (2025)

StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2025)

EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration
by: Li, Wuyang, et al.
Published: (2026)

StableAnimator: High-Quality Identity-Preserving Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2024)

LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
by: Zhao, Xiaoran, et al.
Published: (2024)

Embedded Representation Learning Network for Animating Styled Video Portrait
by: Wang, Tianyong, et al.
Published: (2024)

Multi-identity Human Image Animation with Structural Video Diffusion
by: Wang, Zhenzhi, et al.
Published: (2025)

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
by: Huang, Zhe, et al.
Published: (2025)

LoopAnimate: Loopable Salient Object Animation
by: Wang, Fanyi, et al.
Published: (2024)

Spotlighting Partially Visible Cinematic Language for Video-to-Audio Generation via Self-distillation
by: Huang, Feizhen, et al.
Published: (2025)

iHuman: Instant Animatable Digital Humans From Monocular Videos
by: Paudel, Pramish, et al.
Published: (2024)

OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
by: Gan, Qijun, et al.
Published: (2025)

Representing Animatable Avatar via Factorized Neural Fields
by: Song, Chunjin, et al.
Published: (2024)

VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction
by: Ji, Longbin, et al.
Published: (2026)

Enhanced Convolutional Neural Networks for Improved Image Classification
by: Yang, Xiaoran, et al.
Published: (2025)

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
by: Qiu, Lingteng, et al.
Published: (2025)

Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
by: Lu, Yizhuo, et al.
Published: (2024)

AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
by: Qiu, Lingteng, et al.
Published: (2024)

Learning Spectral Diffusion Prior for Hyperspectral Image Reconstruction
by: Yu, Mingyang, et al.
Published: (2025)

Is Visual Realism Enough? Evaluating Gait Biometric Fidelity in Generative AI Human Animation
by: DeAndres-Tame, Ivan, et al.
Published: (2025)

ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
by: Zhang, Ting, et al.
Published: (2024)

Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility
by: Lin, Honglin, et al.
Published: (2026)

Zero-shot High-fidelity and Pose-controllable Character Animation
by: Zhu, Bingwen, et al.
Published: (2024)

Implicit Preference Alignment for Human Image Animation
by: Wang, Yuanzhi, et al.
Published: (2026)

PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation
by: Zhang, Tianyuan, et al.
Published: (2024)

Generative AI for Cel-Animation: A Survey
by: Tang, Yolo Y., et al.
Published: (2025)

EA-RAS: Towards Efficient and Accurate End-to-End Reconstruction of Anatomical Skeleton
by: Peng, Zhiheng, et al.
Published: (2024)

VideoMaMa: Mask-Guided Video Matting via Generative Prior
by: Lim, Sangbeom, et al.
Published: (2026)

TDMM-LM: Bridging Facial Understanding and Animation via Language Models
by: Song, Luchuan, et al.
Published: (2026)

Progressive Image Restoration via Text-Conditioned Video Generation
by: Kang, Peng, et al.
Published: (2025)

RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
by: Lin, Yijing, et al.
Published: (2025)

Detecting AI-Generated Video via Frame Consistency
by: Ma, Long, et al.
Published: (2024)

Generative Animations: A Multi-Model Pipeline for Prompt-Driven Motion Synthesis
by: Khurana, Mannat, et al.
Published: (2026)

Synergistic Global-space Camera and Human Reconstruction from Videos
by: Zhao, Yizhou, et al.
Published: (2024)

Animate Any Character in Any World
by: Wang, Yitong, et al.
Published: (2025)