Saved in:
| Main Authors: | Zhang, Feng, Deng, Haoyou, Li, Zhiqiang, Li, Lida, Xu, Bin, Lu, Qingbo, Cao, Zisheng, Wei, Minchen, Gao, Changxin, Sang, Nong, Bai, Xiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.11613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
by: Deng, Haoyou, et al.
Published: (2025)
by: Deng, Haoyou, et al.
Published: (2025)
Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
by: Zhang, Feng, et al.
Published: (2023)
by: Zhang, Feng, et al.
Published: (2023)
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment
by: Deng, Haoyou, et al.
Published: (2026)
by: Deng, Haoyou, et al.
Published: (2026)
REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for Noisy Correspondence
by: Zheng, Ruochen, et al.
Published: (2024)
by: Zheng, Ruochen, et al.
Published: (2024)
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
by: Jiao, Siyi, et al.
Published: (2024)
by: Jiao, Siyi, et al.
Published: (2024)
Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation
by: Wu, Dongyue, et al.
Published: (2024)
by: Wu, Dongyue, et al.
Published: (2024)
SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation
by: Xu, Zhengze, et al.
Published: (2023)
by: Xu, Zhengze, et al.
Published: (2023)
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets
by: Zuo, Jialong, et al.
Published: (2025)
by: Zuo, Jialong, et al.
Published: (2025)
Learning Inverse Laplacian Pyramid for Progressive Depth Completion
by: Wang, Kun, et al.
Published: (2025)
by: Wang, Kun, et al.
Published: (2025)
HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation
by: Zhang, Huaxin, et al.
Published: (2023)
by: Zhang, Huaxin, et al.
Published: (2023)
Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration
by: Wu, Dongyue, et al.
Published: (2025)
by: Wu, Dongyue, et al.
Published: (2025)
Object-Aware Video Matting with Cross-Frame Guidance
by: Zhang, Huayu, et al.
Published: (2025)
by: Zhang, Huayu, et al.
Published: (2025)
CLIP-guided Prototype Modulating for Few-shot Action Recognition
by: Wang, Xiang, et al.
Published: (2023)
by: Wang, Xiang, et al.
Published: (2023)
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)
by: Wang, Xiang, et al.
Published: (2025)
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
by: Shan, Xiangheng, et al.
Published: (2024)
by: Shan, Xiangheng, et al.
Published: (2024)
Adaptive Prototype Replay for Class Incremental Semantic Segmentation
by: Zhu, Guilin, et al.
Published: (2024)
by: Zhu, Guilin, et al.
Published: (2024)
Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic Alignment
by: Yin, Wenti, et al.
Published: (2025)
by: Yin, Wenti, et al.
Published: (2025)
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
by: Zhang, Huaxin, et al.
Published: (2024)
by: Zhang, Huaxin, et al.
Published: (2024)
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
by: Zuo, Jialong, et al.
Published: (2025)
by: Zuo, Jialong, et al.
Published: (2025)
MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
by: Jiao, Siyi, et al.
Published: (2025)
by: Jiao, Siyi, et al.
Published: (2025)
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
by: Wang, Xiang, et al.
Published: (2024)
by: Wang, Xiang, et al.
Published: (2024)
Taming Consistency Distillation for Accelerated Human Image Animation
by: Wang, Xiang, et al.
Published: (2025)
by: Wang, Xiang, et al.
Published: (2025)
PLIP: Language-Image Pre-training for Person Representation Learning
by: Zuo, Jialong, et al.
Published: (2023)
by: Zuo, Jialong, et al.
Published: (2023)
UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
by: Zuo, Jialong, et al.
Published: (2023)
by: Zuo, Jialong, et al.
Published: (2023)
Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification
by: Hong, Jiahao, et al.
Published: (2024)
by: Hong, Jiahao, et al.
Published: (2024)
Small Object Detection Model with Spatial Laplacian Pyramid Attention and Multi-Scale Features Enhancement in Aerial Images
by: Ji, Zhangjian, et al.
Published: (2026)
by: Ji, Zhangjian, et al.
Published: (2026)
Replace Anyone in Videos
by: Wang, Xiang, et al.
Published: (2024)
by: Wang, Xiang, et al.
Published: (2024)
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM
by: Zhang, Huaxin, et al.
Published: (2024)
by: Zhang, Huaxin, et al.
Published: (2024)
GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
by: Zhang, Huaxin, et al.
Published: (2024)
by: Zhang, Huaxin, et al.
Published: (2024)
Cross-video Identity Correlating for Person Re-identification Pre-training
by: Zuo, Jialong, et al.
Published: (2024)
by: Zuo, Jialong, et al.
Published: (2024)
VideoLucy: Deep Memory Backtracking for Long Video Understanding
by: Zuo, Jialong, et al.
Published: (2025)
by: Zuo, Jialong, et al.
Published: (2025)
PULPo: Probabilistic Unsupervised Laplacian Pyramid Registration
by: Siegert, Leonard, et al.
Published: (2024)
by: Siegert, Leonard, et al.
Published: (2024)
Towards Reliable and Holistic Visual In-Context Learning Prompt Selection
by: Wu, Wenxiao, et al.
Published: (2025)
by: Wu, Wenxiao, et al.
Published: (2025)
RPBA-Net: An Interpretable Residual Pyramid Bilateral Affine Network for RAW-Domain ISP Enhancement
by: Xin, Yucheng, et al.
Published: (2026)
by: Xin, Yucheng, et al.
Published: (2026)
Adaptive Semantic Consistency for Cross-domain Few-shot Classification
by: Lu, Hengchu, et al.
Published: (2023)
by: Lu, Hengchu, et al.
Published: (2023)
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
by: Xu, Zhengze, et al.
Published: (2024)
by: Xu, Zhengze, et al.
Published: (2024)
Full-quantum variational dynamics simulation for time-dependent Hamiltonians with global spectral discretization
by: Qiao, Minchen, et al.
Published: (2026)
by: Qiao, Minchen, et al.
Published: (2026)
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
by: Chang, Pascal, et al.
Published: (2025)
by: Chang, Pascal, et al.
Published: (2025)
EDCSSM: Edge Detection with Convolutional State Space Model
by: Hong, Qinghui, et al.
Published: (2024)
by: Hong, Qinghui, et al.
Published: (2024)
Real analyticity of the modified Laplacian coflow
by: Li, Chuanhuan, et al.
Published: (2024)
by: Li, Chuanhuan, et al.
Published: (2024)
Similar Items
-
Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
by: Deng, Haoyou, et al.
Published: (2025) -
Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping
by: Zhang, Feng, et al.
Published: (2023) -
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment
by: Deng, Haoyou, et al.
Published: (2026) -
REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for Noisy Correspondence
by: Zheng, Ruochen, et al.
Published: (2024) -
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
by: Jiao, Siyi, et al.
Published: (2024)