:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Feng, Deng, Haoyou, Li, Zhiqiang, Li, Lida, Xu, Bin, Lu, Qingbo, Cao, Zisheng, Wei, Minchen, Gao, Changxin, Sang, Nong, Bai, Xiang
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.11613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
by: Deng, Haoyou, et al.
Published: (2025)

Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
by: Zhang, Feng, et al.
Published: (2023)

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment
by: Deng, Haoyou, et al.
Published: (2026)

REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for Noisy Correspondence
by: Zheng, Ruochen, et al.
Published: (2024)

DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
by: Jiao, Siyi, et al.
Published: (2024)

Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation
by: Wu, Dongyue, et al.
Published: (2024)

SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation
by: Xu, Zhengze, et al.
Published: (2023)

Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets
by: Zuo, Jialong, et al.
Published: (2025)

Learning Inverse Laplacian Pyramid for Progressive Depth Completion
by: Wang, Kun, et al.
Published: (2025)

HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation
by: Zhang, Huaxin, et al.
Published: (2023)

Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration
by: Wu, Dongyue, et al.
Published: (2025)

Object-Aware Video Matting with Cross-Frame Guidance
by: Zhang, Huayu, et al.
Published: (2025)

CLIP-guided Prototype Modulating for Few-shot Action Recognition
by: Wang, Xiang, et al.
Published: (2023)

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)

Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
by: Shan, Xiangheng, et al.
Published: (2024)

Adaptive Prototype Replay for Class Incremental Semantic Segmentation
by: Zhu, Guilin, et al.
Published: (2024)

Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic Alignment
by: Yin, Wenti, et al.
Published: (2025)

Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
by: Zhang, Huaxin, et al.
Published: (2024)

ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
by: Zuo, Jialong, et al.
Published: (2025)

MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
by: Jiao, Siyi, et al.
Published: (2025)

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
by: Wang, Xiang, et al.
Published: (2024)

Taming Consistency Distillation for Accelerated Human Image Animation
by: Wang, Xiang, et al.
Published: (2025)

PLIP: Language-Image Pre-training for Person Representation Learning
by: Zuo, Jialong, et al.
Published: (2023)

UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
by: Zuo, Jialong, et al.
Published: (2023)

Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification
by: Hong, Jiahao, et al.
Published: (2024)

Small Object Detection Model with Spatial Laplacian Pyramid Attention and Multi-Scale Features Enhancement in Aerial Images
by: Ji, Zhangjian, et al.
Published: (2026)

Replace Anyone in Videos
by: Wang, Xiang, et al.
Published: (2024)

Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM
by: Zhang, Huaxin, et al.
Published: (2024)

GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
by: Zhang, Huaxin, et al.
Published: (2024)

Cross-video Identity Correlating for Person Re-identification Pre-training
by: Zuo, Jialong, et al.
Published: (2024)

VideoLucy: Deep Memory Backtracking for Long Video Understanding
by: Zuo, Jialong, et al.
Published: (2025)

PULPo: Probabilistic Unsupervised Laplacian Pyramid Registration
by: Siegert, Leonard, et al.
Published: (2024)

Towards Reliable and Holistic Visual In-Context Learning Prompt Selection
by: Wu, Wenxiao, et al.
Published: (2025)

RPBA-Net: An Interpretable Residual Pyramid Bilateral Affine Network for RAW-Domain ISP Enhancement
by: Xin, Yucheng, et al.
Published: (2026)

Adaptive Semantic Consistency for Cross-domain Few-shot Classification
by: Lu, Hengchu, et al.
Published: (2023)

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
by: Xu, Zhengze, et al.
Published: (2024)

Full-quantum variational dynamics simulation for time-dependent Hamiltonians with global spectral discretization
by: Qiao, Minchen, et al.
Published: (2026)

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
by: Chang, Pascal, et al.
Published: (2025)

EDCSSM: Edge Detection with Convolutional State Space Model
by: Hong, Qinghui, et al.
Published: (2024)

Real analyticity of the modified Laplacian coflow
by: Li, Chuanhuan, et al.
Published: (2024)