Saved in:
| Main Authors: | Luo, Zengli, Zhang, Canlong, Lu, Xiaochun, Li, Zhixin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16674 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncertainty-Aware Prototype Semantic Decoupling for Text-Based Person Search in Full Images
by: Luo, Zengli, et al.
Published: (2025)
by: Luo, Zengli, et al.
Published: (2025)
Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
by: Xu, Shengkai, et al.
Published: (2025)
by: Xu, Shengkai, et al.
Published: (2025)
Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness
by: Yu, Lu, et al.
Published: (2026)
by: Yu, Lu, et al.
Published: (2026)
Zero-Shot Co-salient Object Detection Framework
by: Xiao, Haoke, et al.
Published: (2023)
by: Xiao, Haoke, et al.
Published: (2023)
Text-guided Zero-Shot Object Localization
by: Wang, Jingjing, et al.
Published: (2024)
by: Wang, Jingjing, et al.
Published: (2024)
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
by: Stegmüller, Thomas, et al.
Published: (2024)
by: Stegmüller, Thomas, et al.
Published: (2024)
SinSEMI: A One-Shot Image Generation Model and Data-Efficient Evaluation Framework for Semiconductor Inspection Equipment
by: Wu, ChunLiang, et al.
Published: (2025)
by: Wu, ChunLiang, et al.
Published: (2025)
Zero Shot Composed Image Retrieval
by: Kakarla, Santhosh, et al.
Published: (2025)
by: Kakarla, Santhosh, et al.
Published: (2025)
ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
by: Fang, Han, et al.
Published: (2024)
by: Fang, Han, et al.
Published: (2024)
STiTch: Semantic Transition and Transportation in Collaboration for Training-Free Zero-Shot Composed Image Retrieval
by: Li, Miaoge, et al.
Published: (2026)
by: Li, Miaoge, et al.
Published: (2026)
Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment
by: Kuang, Jidong, et al.
Published: (2024)
by: Kuang, Jidong, et al.
Published: (2024)
MRAD: Zero-Shot Anomaly Detection with Memory-Driven Retrieval
by: Xu, Chaoran, et al.
Published: (2026)
by: Xu, Chaoran, et al.
Published: (2026)
Text as Any-Modality for Zero-Shot Classification by Consistent Prompt Tuning
by: Wu, Xiangyu, et al.
Published: (2025)
by: Wu, Xiangyu, et al.
Published: (2025)
Pseudo-label Based Domain Adaptation for Zero-Shot Text Steganalysis
by: Luo, Yufei, et al.
Published: (2024)
by: Luo, Yufei, et al.
Published: (2024)
Unified Framework for Open-World Compositional Zero-shot Learning
by: Jayasekara, Hirunima, et al.
Published: (2024)
by: Jayasekara, Hirunima, et al.
Published: (2024)
An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
by: Xu, Haojun, et al.
Published: (2024)
by: Xu, Haojun, et al.
Published: (2024)
Zero-shot Composed Text-Image Retrieval
by: Liu, Yikun, et al.
Published: (2023)
by: Liu, Yikun, et al.
Published: (2023)
Zero-Shot Chinese Character Recognition with Hierarchical Multi-Granularity Image-Text Aligning
by: Zhu, Yinglian, et al.
Published: (2025)
by: Zhu, Yinglian, et al.
Published: (2025)
GranAlign: Granularity-Aware Alignment Framework for Zero-Shot Video Moment Retrieval
by: Jeon, Mingyu, et al.
Published: (2026)
by: Jeon, Mingyu, et al.
Published: (2026)
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
by: Tang, Yuanmin, et al.
Published: (2025)
by: Tang, Yuanmin, et al.
Published: (2025)
Fine-Grained Zero-Shot Composed Image Retrieval with Complementary Visual-Semantic Integration
by: Ye, Yongcong, et al.
Published: (2026)
by: Ye, Yongcong, et al.
Published: (2026)
Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning
by: Munir, Ans, et al.
Published: (2024)
by: Munir, Ans, et al.
Published: (2024)
Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
by: Li, Dingbang, et al.
Published: (2024)
by: Li, Dingbang, et al.
Published: (2024)
Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
by: Lebailly, Tim, et al.
Published: (2025)
by: Lebailly, Tim, et al.
Published: (2025)
TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
by: Wang, Yiyao, et al.
Published: (2026)
by: Wang, Yiyao, et al.
Published: (2026)
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images
by: Li, Ke, et al.
Published: (2025)
by: Li, Ke, et al.
Published: (2025)
Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
by: Ton, Tri, et al.
Published: (2024)
by: Ton, Tri, et al.
Published: (2024)
Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
by: Yin, Qilin, et al.
Published: (2025)
by: Yin, Qilin, et al.
Published: (2025)
Zero-Shot Head Swapping in Real-World Scenarios
by: Kang, Taewoong, et al.
Published: (2025)
by: Kang, Taewoong, et al.
Published: (2025)
EZSR: Event-based Zero-Shot Recognition
by: Yang, Yan, et al.
Published: (2024)
by: Yang, Yan, et al.
Published: (2024)
EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval
by: Chen, Yuhan, et al.
Published: (2026)
by: Chen, Yuhan, et al.
Published: (2026)
A Pedestrian-Vehicle Interaction Benchmark and Annotation Framework for Unstructured Scenes via Uncalibrated Cameras
by: Peng, Haoyang, et al.
Published: (2026)
by: Peng, Haoyang, et al.
Published: (2026)
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
by: Belouadi, Jonas, et al.
Published: (2025)
by: Belouadi, Jonas, et al.
Published: (2025)
Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges
by: Zhao, Weiguang, et al.
Published: (2023)
by: Zhao, Weiguang, et al.
Published: (2023)
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation
by: Li, Hongjie, et al.
Published: (2024)
by: Li, Hongjie, et al.
Published: (2024)
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models
by: Yu, Lu, et al.
Published: (2024)
by: Yu, Lu, et al.
Published: (2024)
Fine-Grained Zero-Shot Object Detection
by: Ma, Hongxu, et al.
Published: (2025)
by: Ma, Hongxu, et al.
Published: (2025)
Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models
by: Ayuso-Albizu, Pablo, et al.
Published: (2025)
by: Ayuso-Albizu, Pablo, et al.
Published: (2025)
Similar Items
-
Uncertainty-Aware Prototype Semantic Decoupling for Text-Based Person Search in Full Images
by: Luo, Zengli, et al.
Published: (2025) -
Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
by: Xu, Shengkai, et al.
Published: (2025) -
Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness
by: Yu, Lu, et al.
Published: (2026) -
Zero-Shot Co-salient Object Detection Framework
by: Xiao, Haoke, et al.
Published: (2023) -
Text-guided Zero-Shot Object Localization
by: Wang, Jingjing, et al.
Published: (2024)