:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jin, Sheng, Yao, Ruijie, Xu, Lumin, Liu, Wentao, Qian, Chen, Wu, Ji, Luo, Ping
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.19401
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
by: Yao, Ruijie, et al.
Published: (2023)

UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion
by: Li, Jialin, et al.
Published: (2025)

TCFormer: Visual Recognition via Token Clustering Transformer
by: Zeng, Wang, et al.
Published: (2024)

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
by: Yang, Jie, et al.
Published: (2024)

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
by: Jin, Sheng, et al.
Published: (2023)

KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model
by: Yang, Jie, et al.
Published: (2025)

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
by: Wu, Size, et al.
Published: (2023)

F-LMM: Grounding Frozen Large Multimodal Models
by: Wu, Size, et al.
Published: (2024)

Hierarchical Compositional Representations for Few-shot Action Recognition
by: Li, Changzhen, et al.
Published: (2022)

Joint Image-Instance Spatial-Temporal Attention for Few-shot Action Recognition
by: Qian, Zefeng, et al.
Published: (2025)

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
by: Sun, Haopeng, et al.
Published: (2024)

Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
by: Gao, Bin-Bin, et al.
Published: (2025)

NADER: Neural Architecture Design via Multi-Agent Collaboration
by: Yang, Zekang, et al.
Published: (2024)

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
by: Zhang, Yi, et al.
Published: (2024)

IQE-CLIP: Instance-aware Query Embedding for Zero-/Few-shot Anomaly Detection in Medical Domain
by: Huang, Hong, et al.
Published: (2025)

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
by: Yin, Hang, et al.
Published: (2025)

Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
by: Bahram, Yara, et al.
Published: (2025)

Rethinking Few-shot 3D Point Cloud Semantic Segmentation
by: An, Zhaochong, et al.
Published: (2024)

Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
by: Guo, Qianyu, et al.
Published: (2025)

UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection
by: Gu, Zhaopeng, et al.
Published: (2024)

TSAL: Few-shot Text Segmentation Based on Attribute Learning
by: Li, Chenming, et al.
Published: (2025)

InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
by: Li, Haijie, et al.
Published: (2024)

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
by: Yang, Zekang, et al.
Published: (2024)

Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols
by: Luo, Xu, et al.
Published: (2026)

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
by: An, Zhaochong, et al.
Published: (2024)

UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration
by: Li, Geng, et al.
Published: (2024)

Task-oriented Learnable Diffusion Timesteps for Universal Few-shot Learning of Dense Tasks
by: Oh, Changgyoon, et al.
Published: (2025)

Customize Your Own Paired Data via Few-shot Way
by: Chen, Jinshu, et al.
Published: (2024)

PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning
by: Wang, Yankai, et al.
Published: (2026)

Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task
by: Liu, Jie, et al.
Published: (2024)

Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
by: Qian, Zefeng, et al.
Published: (2025)

Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views
by: Ma, Junyi, et al.
Published: (2025)

Bridge the Points: Graph-based Few-shot Segment Anything Semantically
by: Zhang, Anqi, et al.
Published: (2024)

Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation
by: Liu, Jie, et al.
Published: (2024)

LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation
by: Ding, Hanze, et al.
Published: (2024)

UINO-FSS: Unifying Representation Learning and Few-shot Segmentation via Hierarchical Distillation and Mamba-HyperCorrelation
by: Zhuo, Wei, et al.
Published: (2025)

FLIER: Few-shot Language Image Models Embedded with Latent Representations
by: Zhou, Zhinuo, et al.
Published: (2024)

Towards Few-shot Out-of-Distribution Detection
by: Dong, Jiuqing, et al.
Published: (2023)

Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
by: Zhu, Muzhi, et al.
Published: (2024)