:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Tan, Kailong, Zhou, Yuxiang, Xia, Qianchen, Liu, Rui, Chen, Yong
Format:	Preprint
Publié:	2024
Sujets:	Computer Vision and Pattern Recognition
Accès en ligne:	https://arxiv.org/abs/2401.04962
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation
par: Wang, Xiaojuan, et autres
Publié: (2024)

KeyVideoLLM: Towards Large-scale Video Keyframe Selection
par: Liang, Hao, et autres
Publié: (2024)

Video Summarization with Large Language Models
par: Lee, Min Jung, et autres
Publié: (2025)

KFFocus: Highlighting Keyframes for Enhanced Video Understanding
par: Nie, Ming, et autres
Publié: (2025)

Agentic Keyframe Search for Video Question Answering
par: Fan, Sunqi, et autres
Publié: (2025)

KeyframeFace: Language-Driven Facial Animation via Semantic Keyframes
par: Wu, Jingchao, et autres
Publié: (2025)

AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
par: Zhang, Shuheng, et autres
Publié: (2025)

FOCUS: Efficient Keyframe Selection for Long Video Understanding
par: Zhu, Zirui, et autres
Publié: (2025)

VTAgent: Agentic Keyframe Anchoring for Evidence-Aware Video TextVQA
par: He, Haibin, et autres
Publié: (2026)

PRISM: Perceptual Recognition for Identifying Standout Moments in Human-Centric Keyframe Extraction
par: Cakmak, Mert Can, et autres
Publié: (2025)

Minimal Clips, Maximum Salience: Long Video Summarization via Key Moment Extraction
par: Pennec, Galann, et autres
Publié: (2025)

Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing
par: Liu, Lin, et autres
Publié: (2026)

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
par: Wang, Xingrui, et autres
Publié: (2025)

Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders
par: Fang, Bo, et autres
Publié: (2025)

KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs
par: Song, Baiyang, et autres
Publié: (2026)

10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection
par: Xin, Biqiao, et autres
Publié: (2025)

From Captions to Keyframes: KeyScore for Multimodal Frame Scoring and Video-Language Understanding
par: Lin, Shih-Yao, et autres
Publié: (2025)

Scaling Up Video Summarization Pretraining with Large Language Models
par: Argaw, Dawit Mureja, et autres
Publié: (2024)

Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval
par: Shlapentokh-Rothman, Michal, et autres
Publié: (2026)

Where to Focus: Query-Modulated Multimodal Keyframe Selection for Long Video Understanding
par: Wang, Shaoguang, et autres
Publié: (2026)

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
par: He, Qingdong, et autres
Publié: (2025)

Less is More: Improving Motion Diffusion Models with Sparse Keyframes
par: Bae, Jinseok, et autres
Publié: (2025)

DreamFrame: Enhancing Video Understanding via Automatically Generated QA and Style-Consistent Keyframes
par: Song, Zhende, et autres
Publié: (2024)

Prompts to Summaries: Zero-Shot Language-Guided Video Summarization with Large Language and Video Models
par: Barbara, Mario, et autres
Publié: (2025)

Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
par: Korolkov, Vasilii
Publié: (2025)

SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models
par: He, Ziheng, et autres
Publié: (2026)

Adaptive Keyframe Sampling for Long Video Understanding
par: Tang, Xi, et autres
Publié: (2025)

Controllable Human-centric Keyframe Interpolation with Generative Prior
par: Guo, Zujin, et autres
Publié: (2025)

Realizing Video Summarization from the Path of Language-based Semantic Understanding
par: Mu, Kuan-Chen, et autres
Publié: (2024)

A Large Language Model for Disaster Structural Reconnaissance Summarization
par: Gao, Yuqing, et autres
Publié: (2026)

CSTA: CNN-based Spatiotemporal Attention for Video Summarization
par: Son, Jaewon, et autres
Publié: (2024)

KS-APR: Keyframe Selection for Robust Absolute Pose Regression
par: Liu, Changkun, et autres
Publié: (2023)

Dense Video Captioning using Graph-based Sentence Summarization
par: Zhang, Zhiwang, et autres
Publié: (2025)

Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization
par: Zhang, Zhiwang, et autres
Publié: (2025)

AdaRD-key: Adaptive Relevance-Diversity Keyframe Sampling for Long-form Video understanding
par: Zhang, Xian, et autres
Publié: (2025)

Fetal Brain Imaging: A Composite Neural Network Approach for Keyframe Detection in Ultrasound Videos
par: Zamojski, Aleksander, et autres
Publié: (2026)

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition
par: Phung, Quynh, et autres
Publié: (2025)

Generative Motion Infilling From Imprecisely Timed Keyframes
par: Goel, Purvi, et autres
Publié: (2025)

Deep Hashing with Semantic Hash Centers for Image Retrieval
par: Chen, Li, et autres
Publié: (2025)

Video Summarization using Denoising Diffusion Probabilistic Model
par: Shang, Zirui, et autres
Publié: (2024)