Saved in:
| Main Authors: | Ren, Wenqi, Wang, Weijie, Zheng, Meng, Wu, Ziyan, Tang, Yang, Zhong, Zhun, Sebe, Nicu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.01439 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery
by: Zheng, Haiyang, et al.
Published: (2024)
by: Zheng, Haiyang, et al.
Published: (2024)
Generalized Fine-Grained Category Discovery with Multi-Granularity Conceptual Experts
by: Zheng, Haiyang, et al.
Published: (2025)
by: Zheng, Haiyang, et al.
Published: (2025)
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
by: Zheng, Haiyang, et al.
Published: (2024)
by: Zheng, Haiyang, et al.
Published: (2024)
Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery
by: Liu, Xiao, et al.
Published: (2025)
by: Liu, Xiao, et al.
Published: (2025)
Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning
by: Zheng, Haiyang, et al.
Published: (2025)
by: Zheng, Haiyang, et al.
Published: (2025)
Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery
by: Liu, Mingxuan, et al.
Published: (2023)
by: Liu, Mingxuan, et al.
Published: (2023)
Hierarchical Cross-Attention Network for Virtual Try-On
by: Tang, Hao, et al.
Published: (2024)
by: Tang, Hao, et al.
Published: (2024)
Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation
by: Zhao, Dong, et al.
Published: (2024)
by: Zhao, Dong, et al.
Published: (2024)
Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
by: Zhao, Dong, et al.
Published: (2026)
by: Zhao, Dong, et al.
Published: (2026)
ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models
by: Wang, Weijie, et al.
Published: (2023)
by: Wang, Weijie, et al.
Published: (2023)
FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation
by: Zhao, Dong, et al.
Published: (2025)
by: Zhao, Dong, et al.
Published: (2025)
Rethinking the Learning Paradigm for Facial Expression Recognition
by: Wang, Weijie, et al.
Published: (2022)
by: Wang, Weijie, et al.
Published: (2022)
Asymmetric GANs for Image-to-Image Translation
by: Tang, Hao, et al.
Published: (2019)
by: Tang, Hao, et al.
Published: (2019)
Pseudolabel guided pixels contrast for domain adaptive semantic segmentation
by: Xiang, Jianzi, et al.
Published: (2025)
by: Xiang, Jianzi, et al.
Published: (2025)
Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation
by: Lv, Chonghua, et al.
Published: (2026)
by: Lv, Chonghua, et al.
Published: (2026)
SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation
by: Ge, Yanqi, et al.
Published: (2024)
by: Ge, Yanqi, et al.
Published: (2024)
Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts
by: Wang, Puzuo, et al.
Published: (2024)
by: Wang, Puzuo, et al.
Published: (2024)
PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems
by: Wang, Weijie, et al.
Published: (2026)
by: Wang, Weijie, et al.
Published: (2026)
FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors
by: Li, Chenxi, et al.
Published: (2025)
by: Li, Chenxi, et al.
Published: (2025)
Fully-Geometric Cross-Attention for Point Cloud Registration
by: Wang, Weijie, et al.
Published: (2025)
by: Wang, Weijie, et al.
Published: (2025)
Large Language Models for Multimodal Deformable Image Registration
by: Ma, Mingrui, et al.
Published: (2024)
by: Ma, Mingrui, et al.
Published: (2024)
Finetune Like You Pretrain: Boosting Zero-shot Adversarial Robustness in Vision-language Models
by: Xing, Songlong, et al.
Published: (2026)
by: Xing, Songlong, et al.
Published: (2026)
Causal Disentanglement for Robust Long-tail Medical Image Generation
by: Nie, Weizhi, et al.
Published: (2025)
by: Nie, Weizhi, et al.
Published: (2025)
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
by: Xing, Songlong, et al.
Published: (2025)
by: Xing, Songlong, et al.
Published: (2025)
Vision+X: A Survey on Multimodal Learning in the Light of Data
by: Zhu, Ye, et al.
Published: (2022)
by: Zhu, Ye, et al.
Published: (2022)
Enhanced Multi-Scale Cross-Attention for Person Image Generation
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
by: Tang, Hao, et al.
Published: (2024)
by: Tang, Hao, et al.
Published: (2024)
Robust image segmentation model based on binary level set
by: Zhao, Wenqi
Published: (2024)
by: Zhao, Wenqi
Published: (2024)
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
by: Xue, Feng, et al.
Published: (2025)
by: Xue, Feng, et al.
Published: (2025)
POCI-Diff: Position Objects Consistently and Interactively with 3D-Layout Guided Diffusion
by: Rigo, Andrea, et al.
Published: (2026)
by: Rigo, Andrea, et al.
Published: (2026)
RankFeat&RankWeight: Rank-1 Feature/Weight Removal for Out-of-distribution Detection
by: Song, Yue, et al.
Published: (2023)
by: Song, Yue, et al.
Published: (2023)
Reverse Personalization
by: Kung, Han-Wei, et al.
Published: (2025)
by: Kung, Han-Wei, et al.
Published: (2025)
Hyperbolic Busemann Neural Networks
by: Chen, Ziheng, et al.
Published: (2026)
by: Chen, Ziheng, et al.
Published: (2026)
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
by: Li, Jinlong, et al.
Published: (2025)
by: Li, Jinlong, et al.
Published: (2025)
Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models
by: Li, Jinlong, et al.
Published: (2026)
by: Li, Jinlong, et al.
Published: (2026)
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
by: Liu, Jiaqi, et al.
Published: (2025)
by: Liu, Jiaqi, et al.
Published: (2025)
SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation
by: Yin, Jun, et al.
Published: (2025)
by: Yin, Jun, et al.
Published: (2025)
Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
Loomis Painter: Reconstructing the Painting Process
by: Pobitzer, Markus, et al.
Published: (2025)
by: Pobitzer, Markus, et al.
Published: (2025)
Similar Items
-
Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery
by: Zheng, Haiyang, et al.
Published: (2024) -
Generalized Fine-Grained Category Discovery with Multi-Granularity Conceptual Experts
by: Zheng, Haiyang, et al.
Published: (2025) -
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
by: Zheng, Haiyang, et al.
Published: (2024) -
Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery
by: Liu, Xiao, et al.
Published: (2025) -
Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning
by: Zheng, Haiyang, et al.
Published: (2025)