Saved in:
| Main Authors: | Tang, Zhiyang, Zhu, Yiming, Huang, Ruimin, Yang, Meng, Ma, Yong, Huang, Jun, Fan, Fan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.21192 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating the Effect of Compression on Video Temporal Consistency Using Objective Quality Metrics
by: Zsoldos, Peter
Published: (2026)
by: Zsoldos, Peter
Published: (2026)
Digital analysis of early color photographs taken using regular color screen processes
by: Hubička, Jan, et al.
Published: (2023)
by: Hubička, Jan, et al.
Published: (2023)
Person detection and re-identification in open-world settings of retail stores and public spaces
by: Brkljač, Branko, et al.
Published: (2025)
by: Brkljač, Branko, et al.
Published: (2025)
Enhancing rice leaf images: An overview of image denoising techniques
by: Chutia, Rupjyoti, et al.
Published: (2025)
by: Chutia, Rupjyoti, et al.
Published: (2025)
Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
by: Korolkov, Vasilii
Published: (2025)
by: Korolkov, Vasilii
Published: (2025)
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
by: Lu, Wanglong, et al.
Published: (2024)
by: Lu, Wanglong, et al.
Published: (2024)
FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing
by: Lu, Wanglong, et al.
Published: (2024)
by: Lu, Wanglong, et al.
Published: (2024)
Leum-VL Technical Report
by: He, Yuxuan, et al.
Published: (2026)
by: He, Yuxuan, et al.
Published: (2026)
Transforming faces into video stories -- VideoFace2.0
by: Brkljač, Branko, et al.
Published: (2025)
by: Brkljač, Branko, et al.
Published: (2025)
FeedbackSTS-Det: Sparse Frames-Based Spatio-Temporal Semantic Feedback Network for Moving Infrared Small Target Detection
by: Huang, Yian, et al.
Published: (2026)
by: Huang, Yian, et al.
Published: (2026)
Improving Visual Object Tracking through Visual Prompting
by: Chen, Shih-Fang, et al.
Published: (2024)
by: Chen, Shih-Fang, et al.
Published: (2024)
Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation
by: Nemati, Nader
Published: (2025)
by: Nemati, Nader
Published: (2025)
Noisier2Inverse: Self-Supervised Learning for Image Reconstruction with Correlated Noise
by: Gruber, Nadja, et al.
Published: (2025)
by: Gruber, Nadja, et al.
Published: (2025)
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
by: Leng, Yicheng, et al.
Published: (2025)
by: Leng, Yicheng, et al.
Published: (2025)
Two-step Authentication: Multi-biometric System Using Voice and Facial Recognition
by: Chen, Kuan Wei, et al.
Published: (2026)
by: Chen, Kuan Wei, et al.
Published: (2026)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
Optimized $k$-means color quantization of digital images in machine-based and human perception-based colorspaces
by: Maitra, Ranjan
Published: (2026)
by: Maitra, Ranjan
Published: (2026)
Compressive sensing inspired self-supervised single-pixel imaging
by: Lu, Jijun, et al.
Published: (2026)
by: Lu, Jijun, et al.
Published: (2026)
A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection
by: Boumber, Dainis, et al.
Published: (2024)
by: Boumber, Dainis, et al.
Published: (2024)
Understanding Identity Continuity in Thermal Video through Scene-Level Consistency
by: Sun, Wei-Chieh, et al.
Published: (2026)
by: Sun, Wei-Chieh, et al.
Published: (2026)
Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention
by: Korolkov, Vasilii, et al.
Published: (2025)
by: Korolkov, Vasilii, et al.
Published: (2025)
Image and Video Compression using Generative Sparse Representation with Fidelity Controls
by: Jiang, Wei, et al.
Published: (2024)
by: Jiang, Wei, et al.
Published: (2024)
Seeing The Words: Evaluating AI-generated Biblical Art
by: Makimei, Hidde, et al.
Published: (2025)
by: Makimei, Hidde, et al.
Published: (2025)
Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
by: Adžemović, Momir
Published: (2025)
by: Adžemović, Momir
Published: (2025)
DeepInverse: A Python package for solving imaging inverse problems with deep learning
by: Tachella, Julián, et al.
Published: (2025)
by: Tachella, Julián, et al.
Published: (2025)
IF-D: A High-Frequency, General-Purpose Inertial Foundation Dataset for Self-Supervised Learning
by: Ferreira, Patrick, et al.
Published: (2025)
by: Ferreira, Patrick, et al.
Published: (2025)
Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
by: Wu, Kunlin, et al.
Published: (2026)
by: Wu, Kunlin, et al.
Published: (2026)
FLD+: Data-efficient Evaluation Metric for Generative Models
by: Jeevan, Pranav, et al.
Published: (2024)
by: Jeevan, Pranav, et al.
Published: (2024)
WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency
by: Jeevan, Pranav, et al.
Published: (2024)
by: Jeevan, Pranav, et al.
Published: (2024)
Normalizing Flow-Based Metric for Image Generation
by: Jeevan, Pranav, et al.
Published: (2024)
by: Jeevan, Pranav, et al.
Published: (2024)
Fairness Without Labels: Pseudo-Balancing for Bias Mitigation in Face Gender Classification
by: Dong, Haohua, et al.
Published: (2025)
by: Dong, Haohua, et al.
Published: (2025)
BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
by: Zhang, Jian, et al.
Published: (2024)
by: Zhang, Jian, et al.
Published: (2024)
Neural Image Compression Using Masked Sparse Visual Representation
by: Jiang, Wei, et al.
Published: (2023)
by: Jiang, Wei, et al.
Published: (2023)
ADD for Multi-Bit Image Watermarking
by: Luo, An, et al.
Published: (2026)
by: Luo, An, et al.
Published: (2026)
Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving
by: Wu, Songhan
Published: (2025)
by: Wu, Songhan
Published: (2025)
The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks
by: Hanson, Andrew J., et al.
Published: (2025)
by: Hanson, Andrew J., et al.
Published: (2025)
Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection
by: Ferdousi, Rahatara, et al.
Published: (2023)
by: Ferdousi, Rahatara, et al.
Published: (2023)
Light Future: Multimodal Action Frame Prediction via InstructPix2Pix
by: Zhong, Zesen, et al.
Published: (2025)
by: Zhong, Zesen, et al.
Published: (2025)
Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification
by: Aliouat, Ahcen, et al.
Published: (2025)
by: Aliouat, Ahcen, et al.
Published: (2025)
The Impact of Image Resolution on Face Detection: A Comparative Analysis of MTCNN, YOLOv XI and YOLOv XII models
by: Ömercikoğlu, Ahmet Can, et al.
Published: (2025)
by: Ömercikoğlu, Ahmet Can, et al.
Published: (2025)
Similar Items
-
Evaluating the Effect of Compression on Video Temporal Consistency Using Objective Quality Metrics
by: Zsoldos, Peter
Published: (2026) -
Digital analysis of early color photographs taken using regular color screen processes
by: Hubička, Jan, et al.
Published: (2023) -
Person detection and re-identification in open-world settings of retail stores and public spaces
by: Brkljač, Branko, et al.
Published: (2025) -
Enhancing rice leaf images: An overview of image denoising techniques
by: Chutia, Rupjyoti, et al.
Published: (2025) -
Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
by: Korolkov, Vasilii
Published: (2025)