Saved in:
| Main Authors: | Wu, Xiaoran, Huang, Zien, Yu, Chonghan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.14715 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
by: Xu, Yifang, et al.
Published: (2024)
by: Xu, Yifang, et al.
Published: (2024)
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
by: Wu, Sibo, et al.
Published: (2025)
by: Wu, Sibo, et al.
Published: (2025)
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
by: Qu, Qiang, et al.
Published: (2025)
by: Qu, Qiang, et al.
Published: (2025)
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
by: Sun, Yunzhuo, et al.
Published: (2024)
by: Sun, Yunzhuo, et al.
Published: (2024)
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
by: Niu, Muyao, et al.
Published: (2024)
by: Niu, Muyao, et al.
Published: (2024)
MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence
by: Liu, Chonghan, et al.
Published: (2025)
by: Liu, Chonghan, et al.
Published: (2025)
StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2025)
by: Tu, Shuyuan, et al.
Published: (2025)
EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration
by: Li, Wuyang, et al.
Published: (2026)
by: Li, Wuyang, et al.
Published: (2026)
StableAnimator: High-Quality Identity-Preserving Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2024)
by: Tu, Shuyuan, et al.
Published: (2024)
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
by: Zhao, Xiaoran, et al.
Published: (2024)
by: Zhao, Xiaoran, et al.
Published: (2024)
Embedded Representation Learning Network for Animating Styled Video Portrait
by: Wang, Tianyong, et al.
Published: (2024)
by: Wang, Tianyong, et al.
Published: (2024)
Multi-identity Human Image Animation with Structural Video Diffusion
by: Wang, Zhenzhi, et al.
Published: (2025)
by: Wang, Zhenzhi, et al.
Published: (2025)
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
by: Huang, Zhe, et al.
Published: (2025)
by: Huang, Zhe, et al.
Published: (2025)
LoopAnimate: Loopable Salient Object Animation
by: Wang, Fanyi, et al.
Published: (2024)
by: Wang, Fanyi, et al.
Published: (2024)
Spotlighting Partially Visible Cinematic Language for Video-to-Audio Generation via Self-distillation
by: Huang, Feizhen, et al.
Published: (2025)
by: Huang, Feizhen, et al.
Published: (2025)
iHuman: Instant Animatable Digital Humans From Monocular Videos
by: Paudel, Pramish, et al.
Published: (2024)
by: Paudel, Pramish, et al.
Published: (2024)
OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
by: Gan, Qijun, et al.
Published: (2025)
by: Gan, Qijun, et al.
Published: (2025)
Representing Animatable Avatar via Factorized Neural Fields
by: Song, Chunjin, et al.
Published: (2024)
by: Song, Chunjin, et al.
Published: (2024)
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction
by: Ji, Longbin, et al.
Published: (2026)
by: Ji, Longbin, et al.
Published: (2026)
Enhanced Convolutional Neural Networks for Improved Image Classification
by: Yang, Xiaoran, et al.
Published: (2025)
by: Yang, Xiaoran, et al.
Published: (2025)
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
by: Qiu, Lingteng, et al.
Published: (2025)
by: Qiu, Lingteng, et al.
Published: (2025)
Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
by: Lu, Yizhuo, et al.
Published: (2024)
by: Lu, Yizhuo, et al.
Published: (2024)
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
by: Qiu, Lingteng, et al.
Published: (2024)
by: Qiu, Lingteng, et al.
Published: (2024)
Learning Spectral Diffusion Prior for Hyperspectral Image Reconstruction
by: Yu, Mingyang, et al.
Published: (2025)
by: Yu, Mingyang, et al.
Published: (2025)
Is Visual Realism Enough? Evaluating Gait Biometric Fidelity in Generative AI Human Animation
by: DeAndres-Tame, Ivan, et al.
Published: (2025)
by: DeAndres-Tame, Ivan, et al.
Published: (2025)
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
by: Zhang, Ting, et al.
Published: (2024)
by: Zhang, Ting, et al.
Published: (2024)
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility
by: Lin, Honglin, et al.
Published: (2026)
by: Lin, Honglin, et al.
Published: (2026)
Zero-shot High-fidelity and Pose-controllable Character Animation
by: Zhu, Bingwen, et al.
Published: (2024)
by: Zhu, Bingwen, et al.
Published: (2024)
Implicit Preference Alignment for Human Image Animation
by: Wang, Yuanzhi, et al.
Published: (2026)
by: Wang, Yuanzhi, et al.
Published: (2026)
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation
by: Zhang, Tianyuan, et al.
Published: (2024)
by: Zhang, Tianyuan, et al.
Published: (2024)
Generative AI for Cel-Animation: A Survey
by: Tang, Yolo Y., et al.
Published: (2025)
by: Tang, Yolo Y., et al.
Published: (2025)
EA-RAS: Towards Efficient and Accurate End-to-End Reconstruction of Anatomical Skeleton
by: Peng, Zhiheng, et al.
Published: (2024)
by: Peng, Zhiheng, et al.
Published: (2024)
VideoMaMa: Mask-Guided Video Matting via Generative Prior
by: Lim, Sangbeom, et al.
Published: (2026)
by: Lim, Sangbeom, et al.
Published: (2026)
TDMM-LM: Bridging Facial Understanding and Animation via Language Models
by: Song, Luchuan, et al.
Published: (2026)
by: Song, Luchuan, et al.
Published: (2026)
Progressive Image Restoration via Text-Conditioned Video Generation
by: Kang, Peng, et al.
Published: (2025)
by: Kang, Peng, et al.
Published: (2025)
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
by: Lin, Yijing, et al.
Published: (2025)
by: Lin, Yijing, et al.
Published: (2025)
Detecting AI-Generated Video via Frame Consistency
by: Ma, Long, et al.
Published: (2024)
by: Ma, Long, et al.
Published: (2024)
Generative Animations: A Multi-Model Pipeline for Prompt-Driven Motion Synthesis
by: Khurana, Mannat, et al.
Published: (2026)
by: Khurana, Mannat, et al.
Published: (2026)
Synergistic Global-space Camera and Human Reconstruction from Videos
by: Zhao, Yizhou, et al.
Published: (2024)
by: Zhao, Yizhou, et al.
Published: (2024)
Animate Any Character in Any World
by: Wang, Yitong, et al.
Published: (2025)
by: Wang, Yitong, et al.
Published: (2025)
Similar Items
-
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
by: Xu, Yifang, et al.
Published: (2024) -
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
by: Wu, Sibo, et al.
Published: (2025) -
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
by: Qu, Qiang, et al.
Published: (2025) -
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
by: Sun, Yunzhuo, et al.
Published: (2024) -
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
by: Niu, Muyao, et al.
Published: (2024)