:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ali, Muhammad Kashif, Im, Eun Woo, Kim, Dongjin, Kim, Tae Hyun, Gupta, Vivek, Luo, Haonan, Li, Tianrui
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2508.18859
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Harnessing Meta-Learning for Improving Full-Frame Video Stabilization
by: Ali, Muhammad Kashif, et al.
Published: (2024)

Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
by: Im, Eun Woo, et al.
Published: (2025)

IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising
by: Kim, Dongjin, et al.
Published: (2025)

Deep Variational Bayesian Modeling of Haze Degradation Process
by: Im, Eun Woo, et al.
Published: (2024)

Continuous Degradation Modeling via Latent Flow Matching for Real-World Super-Resolution
by: Kim, Hyeonjae, et al.
Published: (2026)

Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning
by: Ko, Jaekyun, et al.
Published: (2026)

VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
by: Li, Chaoyu, et al.
Published: (2024)

Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains
by: Kim, Jaeyeul, et al.
Published: (2023)

REPrune: Channel Pruning via Kernel Representative Selection
by: Park, Mincheol, et al.
Published: (2024)

Dynamic Full-body Motion Agent with Object Interaction via Blending Pre-trained Modular Controllers
by: Nam, Sanghyeok, et al.
Published: (2026)

LAN: Learning to Adapt Noise for Image Denoising
by: Kim, Changjin, et al.
Published: (2024)

TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
by: Kim, Min-Jung, et al.
Published: (2025)

Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
by: Jeon, MinJu, et al.
Published: (2025)

FrameMind: Frame-Interleaved Video Reasoning via Reinforcement Learning
by: Ge, Haonan, et al.
Published: (2025)

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics
by: Cho, Woojin, et al.
Published: (2024)

Learning-based Axial Video Motion Magnification
by: Byung-Ki, Kwon, et al.
Published: (2023)

Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields
by: Kim, Taewoo, et al.
Published: (2025)

DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
by: Hwang, Geunmin, et al.
Published: (2025)

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
by: Kim, SiWoo, et al.
Published: (2025)

Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
by: Shen, Zhixuan, et al.
Published: (2024)

ELITE: Efficient Gaussian Head Avatar from a Monocular Video via Learned Initialization and TEst-time Generative Adaptation
by: Youwang, Kim, et al.
Published: (2026)

Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending
by: Jung, Junsik, et al.
Published: (2025)

UCMNet: Uncertainty-Aware Context Memory Network for Under-Display Camera Image Restoration
by: Kim, Daehyun, et al.
Published: (2026)

Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation
by: Woo, Taeyun, et al.
Published: (2025)

mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval
by: Kim, Kyeong Seon, et al.
Published: (2026)

Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation
by: Chae-Yeon, Lee, et al.
Published: (2025)

3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)

WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron Concepts
by: Ahn, Yong Hyun, et al.
Published: (2024)

Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
by: Kim, Hyeon Bae, et al.
Published: (2024)

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion
by: Woo, Sungmin, et al.
Published: (2024)

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
by: Jang, Sangwon, et al.
Published: (2025)

Revisiting Learning-based Video Motion Magnification for Real-time Processing
by: Ha, Hyunwoo, et al.
Published: (2024)

Temporal Grounding as a Learning Signal for Referring Video Object Segmentation
by: Lee, Seunghun, et al.
Published: (2025)

Bi-MCQ: Reformulating Vision-Language Alignment for Negation Understanding
by: Kim, Tae Hun, et al.
Published: (2026)

Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
by: Chung, Hyungjin, et al.
Published: (2025)

Beyond the Frame: Generating 360 Panoramic Videos from Perspective Videos
by: Luo, Rundong, et al.
Published: (2025)

Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation
by: Kim, Jaeyeul, et al.
Published: (2024)

FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
by: Kim, GeonU, et al.
Published: (2024)

Balancing Efficiency and Quality: MoEISR for Arbitrary-Scale Image Super-Resolution
by: Oh, Young Jae, et al.
Published: (2023)