:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tang, Zhiyang, Zhu, Yiming, Huang, Ruimin, Yang, Meng, Ma, Yong, Huang, Jun, Fan, Fan
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Multimedia 94A08, 65K10, 90C25, 68T07 I.4.8; I.4.4; I.2.6
Online Access:	https://arxiv.org/abs/2603.21192
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluating the Effect of Compression on Video Temporal Consistency Using Objective Quality Metrics
by: Zsoldos, Peter
Published: (2026)

Digital analysis of early color photographs taken using regular color screen processes
by: Hubička, Jan, et al.
Published: (2023)

Person detection and re-identification in open-world settings of retail stores and public spaces
by: Brkljač, Branko, et al.
Published: (2025)

Enhancing rice leaf images: An overview of image denoising techniques
by: Chutia, Rupjyoti, et al.
Published: (2025)

Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
by: Korolkov, Vasilii
Published: (2025)

Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
by: Lu, Wanglong, et al.
Published: (2024)

FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing
by: Lu, Wanglong, et al.
Published: (2024)

Leum-VL Technical Report
by: He, Yuxuan, et al.
Published: (2026)

Transforming faces into video stories -- VideoFace2.0
by: Brkljač, Branko, et al.
Published: (2025)

FeedbackSTS-Det: Sparse Frames-Based Spatio-Temporal Semantic Feedback Network for Moving Infrared Small Target Detection
by: Huang, Yian, et al.
Published: (2026)

Improving Visual Object Tracking through Visual Prompting
by: Chen, Shih-Fang, et al.
Published: (2024)

Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation
by: Nemati, Nader
Published: (2025)

Noisier2Inverse: Self-Supervised Learning for Image Reconstruction with Correlated Noise
by: Gruber, Nadja, et al.
Published: (2025)

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
by: Leng, Yicheng, et al.
Published: (2025)

Two-step Authentication: Multi-biometric System Using Voice and Facial Recognition
by: Chen, Kuan Wei, et al.
Published: (2026)

CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)

Optimized $k$-means color quantization of digital images in machine-based and human perception-based colorspaces
by: Maitra, Ranjan
Published: (2026)

Compressive sensing inspired self-supervised single-pixel imaging
by: Lu, Jijun, et al.
Published: (2026)

A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection
by: Boumber, Dainis, et al.
Published: (2024)

Understanding Identity Continuity in Thermal Video through Scene-Level Consistency
by: Sun, Wei-Chieh, et al.
Published: (2026)

Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention
by: Korolkov, Vasilii, et al.
Published: (2025)

Image and Video Compression using Generative Sparse Representation with Fidelity Controls
by: Jiang, Wei, et al.
Published: (2024)

Seeing The Words: Evaluating AI-generated Biblical Art
by: Makimei, Hidde, et al.
Published: (2025)

Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
by: Adžemović, Momir
Published: (2025)

DeepInverse: A Python package for solving imaging inverse problems with deep learning
by: Tachella, Julián, et al.
Published: (2025)

IF-D: A High-Frequency, General-Purpose Inertial Foundation Dataset for Self-Supervised Learning
by: Ferreira, Patrick, et al.
Published: (2025)

Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
by: Wu, Kunlin, et al.
Published: (2026)

FLD+: Data-efficient Evaluation Metric for Generative Models
by: Jeevan, Pranav, et al.
Published: (2024)

WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency
by: Jeevan, Pranav, et al.
Published: (2024)

Normalizing Flow-Based Metric for Image Generation
by: Jeevan, Pranav, et al.
Published: (2024)

Fairness Without Labels: Pseudo-Balancing for Bias Mitigation in Face Gender Classification
by: Dong, Haohua, et al.
Published: (2025)

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
by: Zhang, Jian, et al.
Published: (2024)

Neural Image Compression Using Masked Sparse Visual Representation
by: Jiang, Wei, et al.
Published: (2023)

ADD for Multi-Bit Image Watermarking
by: Luo, An, et al.
Published: (2026)

Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving
by: Wu, Songhan
Published: (2025)

The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks
by: Hanson, Andrew J., et al.
Published: (2025)

Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection
by: Ferdousi, Rahatara, et al.
Published: (2023)

Light Future: Multimodal Action Frame Prediction via InstructPix2Pix
by: Zhong, Zesen, et al.
Published: (2025)

Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification
by: Aliouat, Ahcen, et al.
Published: (2025)

The Impact of Image Resolution on Face Detection: A Comparative Analysis of MTCNN, YOLOv XI and YOLOv XII models
by: Ömercikoğlu, Ahmet Can, et al.
Published: (2025)