Saved in:
| Main Authors: | Deng, Jieren, Zhang, Haojian, Ding, Kun, Hu, Jianhua, Zhang, Xingxuan, Wang, Yunkuan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.01680 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
by: Ding, Kun, et al.
Published: (2024)
by: Ding, Kun, et al.
Published: (2024)
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
by: Ding, Kun, et al.
Published: (2024)
by: Ding, Kun, et al.
Published: (2024)
Compositional Kronecker Context Optimization for Vision-Language Models
by: Ding, Kun, et al.
Published: (2024)
by: Ding, Kun, et al.
Published: (2024)
Smooth and Stepwise Self-Distillation for Object Detection
by: Deng, Jieren, et al.
Published: (2023)
by: Deng, Jieren, et al.
Published: (2023)
Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration
by: Xue, Weiying, et al.
Published: (2024)
by: Xue, Weiying, et al.
Published: (2024)
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation
by: Chen, Jialei, et al.
Published: (2024)
by: Chen, Jialei, et al.
Published: (2024)
Prompt Tuning with Soft Context Sharing for Vision-Language Models
by: Ding, Kun, et al.
Published: (2022)
by: Ding, Kun, et al.
Published: (2022)
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
by: Li, Junjie, et al.
Published: (2025)
by: Li, Junjie, et al.
Published: (2025)
Learning Causal Features for Incremental Object Detection
by: He, Zhenwei, et al.
Published: (2024)
by: He, Zhenwei, et al.
Published: (2024)
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
by: Ding, Kun, et al.
Published: (2024)
by: Ding, Kun, et al.
Published: (2024)
Gate-and-Merge: Zero-shot Compositional Personalization of Vision Language Models
by: Ding, Guodong, et al.
Published: (2026)
by: Ding, Guodong, et al.
Published: (2026)
Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection
by: Tang, Lv, et al.
Published: (2023)
by: Tang, Lv, et al.
Published: (2023)
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
by: Saxena, Pranav, et al.
Published: (2025)
by: Saxena, Pranav, et al.
Published: (2025)
VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection
by: Zhang, Bin, et al.
Published: (2025)
by: Zhang, Bin, et al.
Published: (2025)
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection
by: Zhou, Qihang, et al.
Published: (2023)
by: Zhou, Qihang, et al.
Published: (2023)
Task-Specific Zero-shot Quantization-Aware Training for Object Detection
by: Li, Changhao, et al.
Published: (2025)
by: Li, Changhao, et al.
Published: (2025)
Incremental Object Detection with CLIP
by: Huang, Ziyue, et al.
Published: (2023)
by: Huang, Ziyue, et al.
Published: (2023)
Zero-shot Object Counting with Good Exemplars
by: Zhu, Huilin, et al.
Published: (2024)
by: Zhu, Huilin, et al.
Published: (2024)
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection
by: Guo, Mingyi, et al.
Published: (2024)
by: Guo, Mingyi, et al.
Published: (2024)
Generalizable Prompt Tuning for Vision-Language Models
by: Zhang, Qian
Published: (2024)
by: Zhang, Qian
Published: (2024)
ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation
by: Wu, Yuhan, et al.
Published: (2025)
by: Wu, Yuhan, et al.
Published: (2025)
Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models
by: Tur, Anil Osman, et al.
Published: (2024)
by: Tur, Anil Osman, et al.
Published: (2024)
GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning
by: Deng, Zehao, et al.
Published: (2026)
by: Deng, Zehao, et al.
Published: (2026)
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
by: Zhang, Zitian, et al.
Published: (2024)
by: Zhang, Zitian, et al.
Published: (2024)
GlocalCLIP: Object-agnostic Global-Local Prompt Learning for Zero-shot Anomaly Detection
by: Ham, Jiyul, et al.
Published: (2024)
by: Ham, Jiyul, et al.
Published: (2024)
Zero-shot Action Localization via the Confidence of Large Vision-Language Models
by: Aklilu, Josiah, et al.
Published: (2024)
by: Aklilu, Josiah, et al.
Published: (2024)
CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection
by: Chen, Xuhai, et al.
Published: (2023)
by: Chen, Xuhai, et al.
Published: (2023)
Language-Inspired Relation Transfer for Few-shot Class-Incremental Learning
by: Zhao, Yifan, et al.
Published: (2025)
by: Zhao, Yifan, et al.
Published: (2025)
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
by: Zhang, Jinlu, et al.
Published: (2025)
by: Zhang, Jinlu, et al.
Published: (2025)
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
by: Zhang, Sha, et al.
Published: (2024)
by: Zhang, Sha, et al.
Published: (2024)
Parameterized Prompt for Incremental Object Detection
by: An, Zijia, et al.
Published: (2025)
by: An, Zijia, et al.
Published: (2025)
BackdoorIDS: Zero-shot Backdoor Detection for Pretrained Vision Encoder
by: Huang, Siquan, et al.
Published: (2026)
by: Huang, Siquan, et al.
Published: (2026)
Generalizable Two-Branch Framework for Image Class-Incremental Learning
by: Wu, Chao, et al.
Published: (2024)
by: Wu, Chao, et al.
Published: (2024)
Dynamic Object Queries for Transformer-based Incremental Object Detection
by: Zhang, Jichuan, et al.
Published: (2024)
by: Zhang, Jichuan, et al.
Published: (2024)
MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
by: Yan, Siyuan, et al.
Published: (2025)
by: Yan, Siyuan, et al.
Published: (2025)
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model
by: Wu, Pengying, et al.
Published: (2024)
by: Wu, Pengying, et al.
Published: (2024)
YOLO-IOD: Towards Real Time Incremental Object Detection
by: Zhang, Shizhou, et al.
Published: (2025)
by: Zhang, Shizhou, et al.
Published: (2025)
Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation
by: Zhao, Xiaoqi, et al.
Published: (2023)
by: Zhao, Xiaoqi, et al.
Published: (2023)
DCA: Dividing and Conquering Amnesia in Incremental Object Detection
by: Zhang, Aoting, et al.
Published: (2025)
by: Zhang, Aoting, et al.
Published: (2025)
GRSDet: Learning to Generate Local Reverse Samples for Few-shot Object Detection
by: Mei, Hefei, et al.
Published: (2023)
by: Mei, Hefei, et al.
Published: (2023)
Similar Items
-
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
by: Ding, Kun, et al.
Published: (2024) -
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
by: Ding, Kun, et al.
Published: (2024) -
Compositional Kronecker Context Optimization for Vision-Language Models
by: Ding, Kun, et al.
Published: (2024) -
Smooth and Stepwise Self-Distillation for Object Detection
by: Deng, Jieren, et al.
Published: (2023) -
Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration
by: Xue, Weiying, et al.
Published: (2024)