:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Jingchao, Kang, Zejian, Liu, Haibo, Fei, Yuanchen, Huang, Xiangru
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.11321
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AudioFace: Language-Assisted Speech-Driven Facial Animation with Multimodal Language Models
by: Zheng, Kai, et al.
Published: (2026)

SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space
by: Kang, Zejian, et al.
Published: (2026)

SuperFace: Preference-Aligned Facial Expression Estimation Beyond Pseudo Supervision
by: Kang, Zejian, et al.
Published: (2026)

KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation
by: Lyu, Tianle, et al.
Published: (2025)

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
by: Wang, Xingrui, et al.
Published: (2025)

Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
by: Mullen Jr, James F., et al.
Published: (2022)

Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
by: Fei, Yuanchen, et al.
Published: (2026)

Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing
by: Liu, Lin, et al.
Published: (2026)

DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
by: Wang, Zixuan, et al.
Published: (2024)

Learning Semantic Facial Descriptors for Accurate Face Animation
by: Zhu, Lei, et al.
Published: (2025)

Coarse-to-Fine 3D Keyframe Transporter
by: Zhu, Xupeng, et al.
Published: (2025)

Controllable Human-centric Keyframe Interpolation with Generative Prior
by: Guo, Zujin, et al.
Published: (2025)

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
by: He, Qingdong, et al.
Published: (2025)

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition
by: Phung, Quynh, et al.
Published: (2025)

Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders
by: Fang, Bo, et al.
Published: (2025)

KFFocus: Highlighting Keyframes for Enhanced Video Understanding
by: Nie, Ming, et al.
Published: (2025)

Agentic Keyframe Search for Video Question Answering
by: Fan, Sunqi, et al.
Published: (2025)

SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models
by: He, Ziheng, et al.
Published: (2026)

Generative Motion Infilling From Imprecisely Timed Keyframes
by: Goel, Purvi, et al.
Published: (2025)

KS-APR: Keyframe Selection for Robust Absolute Pose Regression
by: Liu, Changkun, et al.
Published: (2023)

Keyframe-Based Feed-Forward Visual Odometry
by: Dai, Weichen, et al.
Published: (2026)

Large Model based Sequential Keyframe Extraction for Video Summarization
by: Tan, Kailong, et al.
Published: (2024)

Less is More: Improving Motion Diffusion Models with Sparse Keyframes
by: Bae, Jinseok, et al.
Published: (2025)

SignSparK: Efficient Multilingual Sign Language Production via Sparse Keyframe Learning
by: Low, Jianhe, et al.
Published: (2026)

Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM
by: Li, Monica M. Q., et al.
Published: (2026)

SparseOIT: Improving Order-Independent Transparency 3DGS via Active Set Method
by: Yang, Wentao, et al.
Published: (2026)

VTAgent: Agentic Keyframe Anchoring for Evidence-Aware Video TextVQA
by: He, Haibin, et al.
Published: (2026)

Range-Agnostic Multi-View Depth Estimation With Keyframe Selection
by: Conti, Andrea, et al.
Published: (2024)

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation
by: Wang, Xiaojuan, et al.
Published: (2024)

From Captions to Keyframes: KeyScore for Multimodal Frame Scoring and Video-Language Understanding
by: Lin, Shih-Yao, et al.
Published: (2025)

Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval
by: Shlapentokh-Rothman, Michal, et al.
Published: (2026)

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
by: Zhang, Shuheng, et al.
Published: (2025)

Adaptive Keyframe Sampling for Long Video Understanding
by: Tang, Xi, et al.
Published: (2025)

FOCUS: Efficient Keyframe Selection for Long Video Understanding
by: Zhu, Zirui, et al.
Published: (2025)

KeyVideoLLM: Towards Large-scale Video Keyframe Selection
by: Liang, Hao, et al.
Published: (2024)

Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data
by: Kamal, Azmyin Md., et al.
Published: (2024)

SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation
by: Yu, Jiongze, et al.
Published: (2026)

Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior
by: Li, Yulin, et al.
Published: (2025)

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
by: Li, Lingen, et al.
Published: (2025)

DreamFrame: Enhancing Video Understanding via Automatically Generated QA and Style-Consistent Keyframes
by: Song, Zhende, et al.
Published: (2024)