Saved in:
| Main Authors: | Wu, Jingchao, Kang, Zejian, Liu, Haibo, Fei, Yuanchen, Huang, Xiangru |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.11321 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AudioFace: Language-Assisted Speech-Driven Facial Animation with Multimodal Language Models
by: Zheng, Kai, et al.
Published: (2026)
by: Zheng, Kai, et al.
Published: (2026)
SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space
by: Kang, Zejian, et al.
Published: (2026)
by: Kang, Zejian, et al.
Published: (2026)
SuperFace: Preference-Aligned Facial Expression Estimation Beyond Pseudo Supervision
by: Kang, Zejian, et al.
Published: (2026)
by: Kang, Zejian, et al.
Published: (2026)
KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation
by: Lyu, Tianle, et al.
Published: (2025)
by: Lyu, Tianle, et al.
Published: (2025)
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
by: Wang, Xingrui, et al.
Published: (2025)
by: Wang, Xingrui, et al.
Published: (2025)
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
by: Mullen Jr, James F., et al.
Published: (2022)
by: Mullen Jr, James F., et al.
Published: (2022)
Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
by: Fei, Yuanchen, et al.
Published: (2026)
by: Fei, Yuanchen, et al.
Published: (2026)
Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing
by: Liu, Lin, et al.
Published: (2026)
by: Liu, Lin, et al.
Published: (2026)
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
by: Wang, Zixuan, et al.
Published: (2024)
by: Wang, Zixuan, et al.
Published: (2024)
Learning Semantic Facial Descriptors for Accurate Face Animation
by: Zhu, Lei, et al.
Published: (2025)
by: Zhu, Lei, et al.
Published: (2025)
Coarse-to-Fine 3D Keyframe Transporter
by: Zhu, Xupeng, et al.
Published: (2025)
by: Zhu, Xupeng, et al.
Published: (2025)
Controllable Human-centric Keyframe Interpolation with Generative Prior
by: Guo, Zujin, et al.
Published: (2025)
by: Guo, Zujin, et al.
Published: (2025)
The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
by: He, Qingdong, et al.
Published: (2025)
by: He, Qingdong, et al.
Published: (2025)
CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition
by: Phung, Quynh, et al.
Published: (2025)
by: Phung, Quynh, et al.
Published: (2025)
Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders
by: Fang, Bo, et al.
Published: (2025)
by: Fang, Bo, et al.
Published: (2025)
KFFocus: Highlighting Keyframes for Enhanced Video Understanding
by: Nie, Ming, et al.
Published: (2025)
by: Nie, Ming, et al.
Published: (2025)
Agentic Keyframe Search for Video Question Answering
by: Fan, Sunqi, et al.
Published: (2025)
by: Fan, Sunqi, et al.
Published: (2025)
SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models
by: He, Ziheng, et al.
Published: (2026)
by: He, Ziheng, et al.
Published: (2026)
Generative Motion Infilling From Imprecisely Timed Keyframes
by: Goel, Purvi, et al.
Published: (2025)
by: Goel, Purvi, et al.
Published: (2025)
KS-APR: Keyframe Selection for Robust Absolute Pose Regression
by: Liu, Changkun, et al.
Published: (2023)
by: Liu, Changkun, et al.
Published: (2023)
Keyframe-Based Feed-Forward Visual Odometry
by: Dai, Weichen, et al.
Published: (2026)
by: Dai, Weichen, et al.
Published: (2026)
Large Model based Sequential Keyframe Extraction for Video Summarization
by: Tan, Kailong, et al.
Published: (2024)
by: Tan, Kailong, et al.
Published: (2024)
Less is More: Improving Motion Diffusion Models with Sparse Keyframes
by: Bae, Jinseok, et al.
Published: (2025)
by: Bae, Jinseok, et al.
Published: (2025)
SignSparK: Efficient Multilingual Sign Language Production via Sparse Keyframe Learning
by: Low, Jianhe, et al.
Published: (2026)
by: Low, Jianhe, et al.
Published: (2026)
Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM
by: Li, Monica M. Q., et al.
Published: (2026)
by: Li, Monica M. Q., et al.
Published: (2026)
SparseOIT: Improving Order-Independent Transparency 3DGS via Active Set Method
by: Yang, Wentao, et al.
Published: (2026)
by: Yang, Wentao, et al.
Published: (2026)
VTAgent: Agentic Keyframe Anchoring for Evidence-Aware Video TextVQA
by: He, Haibin, et al.
Published: (2026)
by: He, Haibin, et al.
Published: (2026)
Range-Agnostic Multi-View Depth Estimation With Keyframe Selection
by: Conti, Andrea, et al.
Published: (2024)
by: Conti, Andrea, et al.
Published: (2024)
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation
by: Wang, Xiaojuan, et al.
Published: (2024)
by: Wang, Xiaojuan, et al.
Published: (2024)
From Captions to Keyframes: KeyScore for Multimodal Frame Scoring and Video-Language Understanding
by: Lin, Shih-Yao, et al.
Published: (2025)
by: Lin, Shih-Yao, et al.
Published: (2025)
Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval
by: Shlapentokh-Rothman, Michal, et al.
Published: (2026)
by: Shlapentokh-Rothman, Michal, et al.
Published: (2026)
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
by: Zhang, Shuheng, et al.
Published: (2025)
by: Zhang, Shuheng, et al.
Published: (2025)
Adaptive Keyframe Sampling for Long Video Understanding
by: Tang, Xi, et al.
Published: (2025)
by: Tang, Xi, et al.
Published: (2025)
FOCUS: Efficient Keyframe Selection for Long Video Understanding
by: Zhu, Zirui, et al.
Published: (2025)
by: Zhu, Zirui, et al.
Published: (2025)
KeyVideoLLM: Towards Large-scale Video Keyframe Selection
by: Liang, Hao, et al.
Published: (2024)
by: Liang, Hao, et al.
Published: (2024)
Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data
by: Kamal, Azmyin Md., et al.
Published: (2024)
by: Kamal, Azmyin Md., et al.
Published: (2024)
SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation
by: Yu, Jiongze, et al.
Published: (2026)
by: Yu, Jiongze, et al.
Published: (2026)
Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior
by: Li, Yulin, et al.
Published: (2025)
by: Li, Yulin, et al.
Published: (2025)
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
by: Li, Lingen, et al.
Published: (2025)
by: Li, Lingen, et al.
Published: (2025)
DreamFrame: Enhancing Video Understanding via Automatically Generated QA and Style-Consistent Keyframes
by: Song, Zhende, et al.
Published: (2024)
by: Song, Zhende, et al.
Published: (2024)
Similar Items
-
AudioFace: Language-Assisted Speech-Driven Facial Animation with Multimodal Language Models
by: Zheng, Kai, et al.
Published: (2026) -
SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space
by: Kang, Zejian, et al.
Published: (2026) -
SuperFace: Preference-Aligned Facial Expression Estimation Beyond Pseudo Supervision
by: Kang, Zejian, et al.
Published: (2026) -
KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation
by: Lyu, Tianle, et al.
Published: (2025) -
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
by: Wang, Xingrui, et al.
Published: (2025)