Saved in:
| Main Authors: | Ali, Muhammad Kashif, Im, Eun Woo, Kim, Dongjin, Kim, Tae Hyun, Gupta, Vivek, Luo, Haonan, Li, Tianrui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.18859 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Harnessing Meta-Learning for Improving Full-Frame Video Stabilization
by: Ali, Muhammad Kashif, et al.
Published: (2024)
by: Ali, Muhammad Kashif, et al.
Published: (2024)
Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
by: Im, Eun Woo, et al.
Published: (2025)
by: Im, Eun Woo, et al.
Published: (2025)
IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising
by: Kim, Dongjin, et al.
Published: (2025)
by: Kim, Dongjin, et al.
Published: (2025)
Deep Variational Bayesian Modeling of Haze Degradation Process
by: Im, Eun Woo, et al.
Published: (2024)
by: Im, Eun Woo, et al.
Published: (2024)
Continuous Degradation Modeling via Latent Flow Matching for Real-World Super-Resolution
by: Kim, Hyeonjae, et al.
Published: (2026)
by: Kim, Hyeonjae, et al.
Published: (2026)
Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning
by: Ko, Jaekyun, et al.
Published: (2026)
by: Ko, Jaekyun, et al.
Published: (2026)
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
by: Li, Chaoyu, et al.
Published: (2024)
by: Li, Chaoyu, et al.
Published: (2024)
Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains
by: Kim, Jaeyeul, et al.
Published: (2023)
by: Kim, Jaeyeul, et al.
Published: (2023)
REPrune: Channel Pruning via Kernel Representative Selection
by: Park, Mincheol, et al.
Published: (2024)
by: Park, Mincheol, et al.
Published: (2024)
Dynamic Full-body Motion Agent with Object Interaction via Blending Pre-trained Modular Controllers
by: Nam, Sanghyeok, et al.
Published: (2026)
by: Nam, Sanghyeok, et al.
Published: (2026)
LAN: Learning to Adapt Noise for Image Denoising
by: Kim, Changjin, et al.
Published: (2024)
by: Kim, Changjin, et al.
Published: (2024)
TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
by: Kim, Min-Jung, et al.
Published: (2025)
by: Kim, Min-Jung, et al.
Published: (2025)
Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
by: Jeon, MinJu, et al.
Published: (2025)
by: Jeon, MinJu, et al.
Published: (2025)
FrameMind: Frame-Interleaved Video Reasoning via Reinforcement Learning
by: Ge, Haonan, et al.
Published: (2025)
by: Ge, Haonan, et al.
Published: (2025)
Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics
by: Cho, Woojin, et al.
Published: (2024)
by: Cho, Woojin, et al.
Published: (2024)
Learning-based Axial Video Motion Magnification
by: Byung-Ki, Kwon, et al.
Published: (2023)
by: Byung-Ki, Kwon, et al.
Published: (2023)
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields
by: Kim, Taewoo, et al.
Published: (2025)
by: Kim, Taewoo, et al.
Published: (2025)
DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
by: Hwang, Geunmin, et al.
Published: (2025)
by: Hwang, Geunmin, et al.
Published: (2025)
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)
by: Um, Sung Jin, et al.
Published: (2025)
Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
by: Kim, SiWoo, et al.
Published: (2025)
by: Kim, SiWoo, et al.
Published: (2025)
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
by: Shen, Zhixuan, et al.
Published: (2024)
by: Shen, Zhixuan, et al.
Published: (2024)
ELITE: Efficient Gaussian Head Avatar from a Monocular Video via Learned Initialization and TEst-time Generative Adaptation
by: Youwang, Kim, et al.
Published: (2026)
by: Youwang, Kim, et al.
Published: (2026)
Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending
by: Jung, Junsik, et al.
Published: (2025)
by: Jung, Junsik, et al.
Published: (2025)
UCMNet: Uncertainty-Aware Context Memory Network for Under-Display Camera Image Restoration
by: Kim, Daehyun, et al.
Published: (2026)
by: Kim, Daehyun, et al.
Published: (2026)
Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation
by: Woo, Taeyun, et al.
Published: (2025)
by: Woo, Taeyun, et al.
Published: (2025)
mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval
by: Kim, Kyeong Seon, et al.
Published: (2026)
by: Kim, Kyeong Seon, et al.
Published: (2026)
Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation
by: Chae-Yeon, Lee, et al.
Published: (2025)
by: Chae-Yeon, Lee, et al.
Published: (2025)
3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)
by: Kim, Hwidong, et al.
Published: (2026)
WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron Concepts
by: Ahn, Yong Hyun, et al.
Published: (2024)
by: Ahn, Yong Hyun, et al.
Published: (2024)
Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
by: Kim, Hyeon Bae, et al.
Published: (2024)
by: Kim, Hyeon Bae, et al.
Published: (2024)
ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion
by: Woo, Sungmin, et al.
Published: (2024)
by: Woo, Sungmin, et al.
Published: (2024)
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
by: Jang, Sangwon, et al.
Published: (2025)
by: Jang, Sangwon, et al.
Published: (2025)
Revisiting Learning-based Video Motion Magnification for Real-time Processing
by: Ha, Hyunwoo, et al.
Published: (2024)
by: Ha, Hyunwoo, et al.
Published: (2024)
Temporal Grounding as a Learning Signal for Referring Video Object Segmentation
by: Lee, Seunghun, et al.
Published: (2025)
by: Lee, Seunghun, et al.
Published: (2025)
Bi-MCQ: Reformulating Vision-Language Alignment for Negation Understanding
by: Kim, Tae Hun, et al.
Published: (2026)
by: Kim, Tae Hun, et al.
Published: (2026)
Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
by: Chung, Hyungjin, et al.
Published: (2025)
by: Chung, Hyungjin, et al.
Published: (2025)
Beyond the Frame: Generating 360 Panoramic Videos from Perspective Videos
by: Luo, Rundong, et al.
Published: (2025)
by: Luo, Rundong, et al.
Published: (2025)
Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation
by: Kim, Jaeyeul, et al.
Published: (2024)
by: Kim, Jaeyeul, et al.
Published: (2024)
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
by: Kim, GeonU, et al.
Published: (2024)
by: Kim, GeonU, et al.
Published: (2024)
Balancing Efficiency and Quality: MoEISR for Arbitrary-Scale Image Super-Resolution
by: Oh, Young Jae, et al.
Published: (2023)
by: Oh, Young Jae, et al.
Published: (2023)
Similar Items
-
Harnessing Meta-Learning for Improving Full-Frame Video Stabilization
by: Ali, Muhammad Kashif, et al.
Published: (2024) -
Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
by: Im, Eun Woo, et al.
Published: (2025) -
IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising
by: Kim, Dongjin, et al.
Published: (2025) -
Deep Variational Bayesian Modeling of Haze Degradation Process
by: Im, Eun Woo, et al.
Published: (2024) -
Continuous Degradation Modeling via Latent Flow Matching for Real-World Super-Resolution
by: Kim, Hyeonjae, et al.
Published: (2026)