Saved in:
| Main Authors: | Jin, Sheng, Yao, Ruijie, Xu, Lumin, Liu, Wentao, Qian, Chen, Wu, Ji, Luo, Ping |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.19401 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
by: Yao, Ruijie, et al.
Published: (2023)
by: Yao, Ruijie, et al.
Published: (2023)
UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion
by: Li, Jialin, et al.
Published: (2025)
by: Li, Jialin, et al.
Published: (2025)
TCFormer: Visual Recognition via Token Clustering Transformer
by: Zeng, Wang, et al.
Published: (2024)
by: Zeng, Wang, et al.
Published: (2024)
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)
by: Wu, Size, et al.
Published: (2025)
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
by: Yang, Jie, et al.
Published: (2024)
by: Yang, Jie, et al.
Published: (2024)
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
by: Jin, Sheng, et al.
Published: (2023)
by: Jin, Sheng, et al.
Published: (2023)
KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model
by: Yang, Jie, et al.
Published: (2025)
by: Yang, Jie, et al.
Published: (2025)
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
by: Wu, Size, et al.
Published: (2023)
by: Wu, Size, et al.
Published: (2023)
F-LMM: Grounding Frozen Large Multimodal Models
by: Wu, Size, et al.
Published: (2024)
by: Wu, Size, et al.
Published: (2024)
Hierarchical Compositional Representations for Few-shot Action Recognition
by: Li, Changzhen, et al.
Published: (2022)
by: Li, Changzhen, et al.
Published: (2022)
Joint Image-Instance Spatial-Temporal Attention for Few-shot Action Recognition
by: Qian, Zefeng, et al.
Published: (2025)
by: Qian, Zefeng, et al.
Published: (2025)
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
by: Sun, Haopeng, et al.
Published: (2024)
by: Sun, Haopeng, et al.
Published: (2024)
Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
by: Gao, Bin-Bin, et al.
Published: (2025)
by: Gao, Bin-Bin, et al.
Published: (2025)
NADER: Neural Architecture Design via Multi-Agent Collaboration
by: Yang, Zekang, et al.
Published: (2024)
by: Yang, Zekang, et al.
Published: (2024)
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
by: Zhang, Yi, et al.
Published: (2024)
by: Zhang, Yi, et al.
Published: (2024)
IQE-CLIP: Instance-aware Query Embedding for Zero-/Few-shot Anomaly Detection in Medical Domain
by: Huang, Hong, et al.
Published: (2025)
by: Huang, Hong, et al.
Published: (2025)
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
by: Bahram, Yara, et al.
Published: (2025)
by: Bahram, Yara, et al.
Published: (2025)
Rethinking Few-shot 3D Point Cloud Semantic Segmentation
by: An, Zhaochong, et al.
Published: (2024)
by: An, Zhaochong, et al.
Published: (2024)
Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning
by: Guo, Qianyu, et al.
Published: (2025)
by: Guo, Qianyu, et al.
Published: (2025)
UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection
by: Gu, Zhaopeng, et al.
Published: (2024)
by: Gu, Zhaopeng, et al.
Published: (2024)
TSAL: Few-shot Text Segmentation Based on Attribute Learning
by: Li, Chenming, et al.
Published: (2025)
by: Li, Chenming, et al.
Published: (2025)
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
by: Li, Haijie, et al.
Published: (2024)
by: Li, Haijie, et al.
Published: (2024)
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
by: Yang, Zekang, et al.
Published: (2024)
by: Yang, Zekang, et al.
Published: (2024)
Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols
by: Luo, Xu, et al.
Published: (2026)
by: Luo, Xu, et al.
Published: (2026)
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
by: An, Zhaochong, et al.
Published: (2024)
by: An, Zhaochong, et al.
Published: (2024)
UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration
by: Li, Geng, et al.
Published: (2024)
by: Li, Geng, et al.
Published: (2024)
Task-oriented Learnable Diffusion Timesteps for Universal Few-shot Learning of Dense Tasks
by: Oh, Changgyoon, et al.
Published: (2025)
by: Oh, Changgyoon, et al.
Published: (2025)
Customize Your Own Paired Data via Few-shot Way
by: Chen, Jinshu, et al.
Published: (2024)
by: Chen, Jinshu, et al.
Published: (2024)
PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning
by: Wang, Yankai, et al.
Published: (2026)
by: Wang, Yankai, et al.
Published: (2026)
Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task
by: Liu, Jie, et al.
Published: (2024)
by: Liu, Jie, et al.
Published: (2024)
Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
by: Qian, Zefeng, et al.
Published: (2025)
by: Qian, Zefeng, et al.
Published: (2025)
Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views
by: Ma, Junyi, et al.
Published: (2025)
by: Ma, Junyi, et al.
Published: (2025)
Bridge the Points: Graph-based Few-shot Segment Anything Semantically
by: Zhang, Anqi, et al.
Published: (2024)
by: Zhang, Anqi, et al.
Published: (2024)
Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation
by: Liu, Jie, et al.
Published: (2024)
by: Liu, Jie, et al.
Published: (2024)
LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation
by: Ding, Hanze, et al.
Published: (2024)
by: Ding, Hanze, et al.
Published: (2024)
UINO-FSS: Unifying Representation Learning and Few-shot Segmentation via Hierarchical Distillation and Mamba-HyperCorrelation
by: Zhuo, Wei, et al.
Published: (2025)
by: Zhuo, Wei, et al.
Published: (2025)
FLIER: Few-shot Language Image Models Embedded with Latent Representations
by: Zhou, Zhinuo, et al.
Published: (2024)
by: Zhou, Zhinuo, et al.
Published: (2024)
Towards Few-shot Out-of-Distribution Detection
by: Dong, Jiuqing, et al.
Published: (2023)
by: Dong, Jiuqing, et al.
Published: (2023)
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
by: Zhu, Muzhi, et al.
Published: (2024)
by: Zhu, Muzhi, et al.
Published: (2024)
Similar Items
-
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
by: Yao, Ruijie, et al.
Published: (2023) -
UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion
by: Li, Jialin, et al.
Published: (2025) -
TCFormer: Visual Recognition via Token Clustering Transformer
by: Zeng, Wang, et al.
Published: (2024) -
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025) -
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
by: Yang, Jie, et al.
Published: (2024)