:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Zengli, Zhang, Canlong, Lu, Xiaochun, Li, Zhixin
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.16674
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Uncertainty-Aware Prototype Semantic Decoupling for Text-Based Person Search in Full Images
by: Luo, Zengli, et al.
Published: (2025)

Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
by: Xu, Shengkai, et al.
Published: (2025)

Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness
by: Yu, Lu, et al.
Published: (2026)

Zero-Shot Co-salient Object Detection Framework
by: Xiao, Haoke, et al.
Published: (2023)

Text-guided Zero-Shot Object Localization
by: Wang, Jingjing, et al.
Published: (2024)

A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
by: Stegmüller, Thomas, et al.
Published: (2024)

SinSEMI: A One-Shot Image Generation Model and Data-Efficient Evaluation Framework for Semiconductor Inspection Equipment
by: Wu, ChunLiang, et al.
Published: (2025)

Zero Shot Composed Image Retrieval
by: Kakarla, Santhosh, et al.
Published: (2025)

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
by: Fang, Han, et al.
Published: (2024)

STiTch: Semantic Transition and Transportation in Collaboration for Training-Free Zero-Shot Composed Image Retrieval
by: Li, Miaoge, et al.
Published: (2026)

Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment
by: Kuang, Jidong, et al.
Published: (2024)

MRAD: Zero-Shot Anomaly Detection with Memory-Driven Retrieval
by: Xu, Chaoran, et al.
Published: (2026)

Text as Any-Modality for Zero-Shot Classification by Consistent Prompt Tuning
by: Wu, Xiangyu, et al.
Published: (2025)

Pseudo-label Based Domain Adaptation for Zero-Shot Text Steganalysis
by: Luo, Yufei, et al.
Published: (2024)

Unified Framework for Open-World Compositional Zero-shot Learning
by: Jayasekara, Hirunima, et al.
Published: (2024)

An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
by: Xu, Haojun, et al.
Published: (2024)

Zero-shot Composed Text-Image Retrieval
by: Liu, Yikun, et al.
Published: (2023)

Zero-Shot Chinese Character Recognition with Hierarchical Multi-Granularity Image-Text Aligning
by: Zhu, Yinglian, et al.
Published: (2025)

GranAlign: Granularity-Aware Alignment Framework for Zero-Shot Video Moment Retrieval
by: Jeon, Mingyu, et al.
Published: (2026)

Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
by: Dipta, Shubhashis Roy, et al.
Published: (2025)

Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
by: Tang, Yuanmin, et al.
Published: (2025)

Fine-Grained Zero-Shot Composed Image Retrieval with Complementary Visual-Semantic Integration
by: Ye, Yongcong, et al.
Published: (2026)

Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning
by: Munir, Ans, et al.
Published: (2024)

Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval
by: Wang, Xin, et al.
Published: (2025)

TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
by: Li, Dingbang, et al.
Published: (2024)

Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
by: Lebailly, Tim, et al.
Published: (2025)

TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
by: Wang, Yiyao, et al.
Published: (2026)

RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images
by: Li, Ke, et al.
Published: (2025)

Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
by: Ton, Tri, et al.
Published: (2024)

Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
by: Yin, Qilin, et al.
Published: (2025)

Zero-Shot Head Swapping in Real-World Scenarios
by: Kang, Taewoong, et al.
Published: (2025)

EZSR: Event-based Zero-Shot Recognition
by: Yang, Yan, et al.
Published: (2024)

EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval
by: Chen, Yuhan, et al.
Published: (2026)

A Pedestrian-Vehicle Interaction Benchmark and Annotation Framework for Unstructured Scenes via Uncalibrated Cameras
by: Peng, Haoyang, et al.
Published: (2026)

TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
by: Belouadi, Jonas, et al.
Published: (2025)

Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges
by: Zhao, Weiguang, et al.
Published: (2023)

ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation
by: Li, Hongjie, et al.
Published: (2024)

Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models
by: Yu, Lu, et al.
Published: (2024)

Fine-Grained Zero-Shot Object Detection
by: Ma, Hongxu, et al.
Published: (2025)

Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models
by: Ayuso-Albizu, Pablo, et al.
Published: (2025)