:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ren, Wenqi, Wang, Weijie, Zheng, Meng, Wu, Ziyan, Tang, Yang, Zhong, Zhun, Sebe, Nicu
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2601.01439
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery
by: Zheng, Haiyang, et al.
Published: (2024)

Generalized Fine-Grained Category Discovery with Multi-Granularity Conceptual Experts
by: Zheng, Haiyang, et al.
Published: (2025)

Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
by: Zheng, Haiyang, et al.
Published: (2024)

Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery
by: Liu, Xiao, et al.
Published: (2025)

Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning
by: Zheng, Haiyang, et al.
Published: (2025)

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery
by: Liu, Mingxuan, et al.
Published: (2023)

Hierarchical Cross-Attention Network for Virtual Try-On
by: Tang, Hao, et al.
Published: (2024)

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation
by: Zhao, Dong, et al.
Published: (2024)

Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)

Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
by: Zhao, Dong, et al.
Published: (2026)

ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models
by: Wang, Weijie, et al.
Published: (2023)

FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation
by: Zhao, Dong, et al.
Published: (2025)

Rethinking the Learning Paradigm for Facial Expression Recognition
by: Wang, Weijie, et al.
Published: (2022)

Asymmetric GANs for Image-to-Image Translation
by: Tang, Hao, et al.
Published: (2019)

Pseudolabel guided pixels contrast for domain adaptive semantic segmentation
by: Xiang, Jianzi, et al.
Published: (2025)

Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation
by: Lv, Chonghua, et al.
Published: (2026)

SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation
by: Ge, Yanqi, et al.
Published: (2024)

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts
by: Wang, Puzuo, et al.
Published: (2024)

PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems
by: Wang, Weijie, et al.
Published: (2026)

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors
by: Li, Chenxi, et al.
Published: (2025)

Fully-Geometric Cross-Attention for Point Cloud Registration
by: Wang, Weijie, et al.
Published: (2025)

Large Language Models for Multimodal Deformable Image Registration
by: Ma, Mingrui, et al.
Published: (2024)

Finetune Like You Pretrain: Boosting Zero-shot Adversarial Robustness in Vision-language Models
by: Xing, Songlong, et al.
Published: (2026)

Causal Disentanglement for Robust Long-tail Medical Image Generation
by: Nie, Weizhi, et al.
Published: (2025)

CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
by: Xing, Songlong, et al.
Published: (2025)

Vision+X: A Survey on Multimodal Learning in the Light of Data
by: Zhu, Ye, et al.
Published: (2022)

Enhanced Multi-Scale Cross-Attention for Person Image Generation
by: Tang, Hao, et al.
Published: (2025)

Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
by: Tang, Hao, et al.
Published: (2024)

Robust image segmentation model based on binary level set
by: Zhao, Wenqi
Published: (2024)

Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
by: Xue, Feng, et al.
Published: (2025)

POCI-Diff: Position Objects Consistently and Interactively with 3D-Layout Guided Diffusion
by: Rigo, Andrea, et al.
Published: (2026)

RankFeat&RankWeight: Rank-1 Feature/Weight Removal for Out-of-distribution Detection
by: Song, Yue, et al.
Published: (2023)

Reverse Personalization
by: Kung, Han-Wei, et al.
Published: (2025)

Hyperbolic Busemann Neural Networks
by: Chen, Ziheng, et al.
Published: (2026)

Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
by: Li, Jinlong, et al.
Published: (2025)

Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models
by: Li, Jinlong, et al.
Published: (2026)

Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
by: Liu, Jiaqi, et al.
Published: (2025)

SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation
by: Yin, Jun, et al.
Published: (2025)

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
by: Tang, Hao, et al.
Published: (2025)

Loomis Painter: Reconstructing the Painting Process
by: Pobitzer, Markus, et al.
Published: (2025)