:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Zheng, Song, Yibing, Cheng, Ming-Ming, Li, Xiang, Yang, Jian
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.09442
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
by: Li, Zheng, et al.
Published: (2025)

Generating Attribute-Aware Human Motions from Textual Prompt
by: Wang, Xinghan, et al.
Published: (2025)

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
by: Zhang, Xin, et al.
Published: (2025)

Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
by: Li, Yuxuan, et al.
Published: (2026)

From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
by: Lo, Ling, et al.
Published: (2025)

ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition
by: Park, Minjeong, et al.
Published: (2025)

Cascade Prompt Learning for Vision-Language Model Adaptation
by: Wu, Ge, et al.
Published: (2024)

Visual Instruction Pretraining for Domain-Specific Foundation Models
by: Li, Yuxuan, et al.
Published: (2025)

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
by: Li, Zheng, et al.
Published: (2024)

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
by: Li, Senmao, et al.
Published: (2023)

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing
by: Han, Shengdong, et al.
Published: (2025)

CLIP Model for Images to Textual Prompts Based on Top-k Neighbors
by: Zhang, Xin, et al.
Published: (2024)

Step-wise Distribution Alignment Guided Style Prompt Tuning for Source-free Cross-domain Few-shot Learning
by: Xu, Huali, et al.
Published: (2024)

Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework
by: Tang, Wenhao, et al.
Published: (2025)

Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion
by: Bian, Yuan, et al.
Published: (2025)

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
by: Li, Yuxuan, et al.
Published: (2024)

Multi-Token Enhancing for Vision Representation Learning
by: Li, Zhong-Yu, et al.
Published: (2024)

Zone Evaluation: Revealing Spatial Bias in Object Detection
by: Zheng, Zhaohui, et al.
Published: (2023)

CrossKD: Cross-Head Knowledge Distillation for Object Detection
by: Wang, Jiabao, et al.
Published: (2023)

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions
by: Li, Yunheng, et al.
Published: (2026)

HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
by: Feng, Changfeng, et al.
Published: (2024)

Re-Aligning Language to Visual Objects with an Agentic Workflow
by: Chen, Yuming, et al.
Published: (2025)

Visual Textualization for Image Prompted Object Detection
by: Wu, Yongjian, et al.
Published: (2025)

PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI
by: Su, Haoyang, et al.
Published: (2025)

Medal S: Spatio-Textual Prompt Model for Medical Segmentation
by: Shi, Pengcheng, et al.
Published: (2025)

Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
by: Yuan, Xinbin, et al.
Published: (2025)

LSKNet: A Foundation Lightweight Backbone for Remote Sensing
by: Li, Yuxuan, et al.
Published: (2024)

E2MPL:An Enduring and Efficient Meta Prompt Learning Framework for Few-shot Unsupervised Domain Adaptation
by: Yang, Wanqi, et al.
Published: (2024)

Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
by: Hu, Taihang, et al.
Published: (2025)

WOW-Seg: A Word-free Open World Segmentation Model
by: Li, Danyang, et al.
Published: (2026)

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology
by: Tang, Wenhao, et al.
Published: (2025)

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
by: Chen, Yuming, et al.
Published: (2023)

ProEdit: Inversion-based Editing From Prompts Done Right
by: Ouyang, Zhi, et al.
Published: (2025)

A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence
by: Zhao, Penghai, et al.
Published: (2024)

VideoAVE: A Multi-Attribute Video-to-Text Attribute Value Extraction Dataset and Benchmark Models
by: Cheng, Ming, et al.
Published: (2025)

Bringing Textual Prompt to AI-Generated Image Quality Assessment
by: Qu, Bowen, et al.
Published: (2024)

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
by: Liu, Tao, et al.
Published: (2025)

DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation
by: Yin, Bowen, et al.
Published: (2023)

Enhancing Representations through Heterogeneous Self-Supervised Learning
by: Li, Zhong-Yu, et al.
Published: (2023)

A Simple Detector with Frame Dynamics is a Strong Tracker
by: Peng, Chenxu, et al.
Published: (2025)