Saved in:
| Main Authors: | Li, Zheng, Song, Yibing, Cheng, Ming-Ming, Li, Xiang, Yang, Jian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.09442 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
Generating Attribute-Aware Human Motions from Textual Prompt
by: Wang, Xinghan, et al.
Published: (2025)
by: Wang, Xinghan, et al.
Published: (2025)
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
by: Li, Yuxuan, et al.
Published: (2026)
by: Li, Yuxuan, et al.
Published: (2026)
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
by: Lo, Ling, et al.
Published: (2025)
by: Lo, Ling, et al.
Published: (2025)
ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition
by: Park, Minjeong, et al.
Published: (2025)
by: Park, Minjeong, et al.
Published: (2025)
Cascade Prompt Learning for Vision-Language Model Adaptation
by: Wu, Ge, et al.
Published: (2024)
by: Wu, Ge, et al.
Published: (2024)
Visual Instruction Pretraining for Domain-Specific Foundation Models
by: Li, Yuxuan, et al.
Published: (2025)
by: Li, Yuxuan, et al.
Published: (2025)
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
by: Li, Zheng, et al.
Published: (2024)
by: Li, Zheng, et al.
Published: (2024)
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
by: Li, Senmao, et al.
Published: (2023)
by: Li, Senmao, et al.
Published: (2023)
DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing
by: Han, Shengdong, et al.
Published: (2025)
by: Han, Shengdong, et al.
Published: (2025)
CLIP Model for Images to Textual Prompts Based on Top-k Neighbors
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
Step-wise Distribution Alignment Guided Style Prompt Tuning for Source-free Cross-domain Few-shot Learning
by: Xu, Huali, et al.
Published: (2024)
by: Xu, Huali, et al.
Published: (2024)
Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework
by: Tang, Wenhao, et al.
Published: (2025)
by: Tang, Wenhao, et al.
Published: (2025)
Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion
by: Bian, Yuan, et al.
Published: (2025)
by: Bian, Yuan, et al.
Published: (2025)
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
by: Li, Yuxuan, et al.
Published: (2024)
by: Li, Yuxuan, et al.
Published: (2024)
Multi-Token Enhancing for Vision Representation Learning
by: Li, Zhong-Yu, et al.
Published: (2024)
by: Li, Zhong-Yu, et al.
Published: (2024)
Zone Evaluation: Revealing Spatial Bias in Object Detection
by: Zheng, Zhaohui, et al.
Published: (2023)
by: Zheng, Zhaohui, et al.
Published: (2023)
CrossKD: Cross-Head Knowledge Distillation for Object Detection
by: Wang, Jiabao, et al.
Published: (2023)
by: Wang, Jiabao, et al.
Published: (2023)
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions
by: Li, Yunheng, et al.
Published: (2026)
by: Li, Yunheng, et al.
Published: (2026)
HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
by: Feng, Changfeng, et al.
Published: (2024)
by: Feng, Changfeng, et al.
Published: (2024)
Re-Aligning Language to Visual Objects with an Agentic Workflow
by: Chen, Yuming, et al.
Published: (2025)
by: Chen, Yuming, et al.
Published: (2025)
Visual Textualization for Image Prompted Object Detection
by: Wu, Yongjian, et al.
Published: (2025)
by: Wu, Yongjian, et al.
Published: (2025)
PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI
by: Su, Haoyang, et al.
Published: (2025)
by: Su, Haoyang, et al.
Published: (2025)
Medal S: Spatio-Textual Prompt Model for Medical Segmentation
by: Shi, Pengcheng, et al.
Published: (2025)
by: Shi, Pengcheng, et al.
Published: (2025)
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
by: Yuan, Xinbin, et al.
Published: (2025)
by: Yuan, Xinbin, et al.
Published: (2025)
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
by: Li, Yuxuan, et al.
Published: (2024)
by: Li, Yuxuan, et al.
Published: (2024)
E2MPL:An Enduring and Efficient Meta Prompt Learning Framework for Few-shot Unsupervised Domain Adaptation
by: Yang, Wanqi, et al.
Published: (2024)
by: Yang, Wanqi, et al.
Published: (2024)
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
by: Hu, Taihang, et al.
Published: (2025)
by: Hu, Taihang, et al.
Published: (2025)
WOW-Seg: A Word-free Open World Segmentation Model
by: Li, Danyang, et al.
Published: (2026)
by: Li, Danyang, et al.
Published: (2026)
Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology
by: Tang, Wenhao, et al.
Published: (2025)
by: Tang, Wenhao, et al.
Published: (2025)
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
by: Chen, Yuming, et al.
Published: (2023)
by: Chen, Yuming, et al.
Published: (2023)
ProEdit: Inversion-based Editing From Prompts Done Right
by: Ouyang, Zhi, et al.
Published: (2025)
by: Ouyang, Zhi, et al.
Published: (2025)
A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence
by: Zhao, Penghai, et al.
Published: (2024)
by: Zhao, Penghai, et al.
Published: (2024)
VideoAVE: A Multi-Attribute Video-to-Text Attribute Value Extraction Dataset and Benchmark Models
by: Cheng, Ming, et al.
Published: (2025)
by: Cheng, Ming, et al.
Published: (2025)
Bringing Textual Prompt to AI-Generated Image Quality Assessment
by: Qu, Bowen, et al.
Published: (2024)
by: Qu, Bowen, et al.
Published: (2024)
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
by: Liu, Tao, et al.
Published: (2025)
by: Liu, Tao, et al.
Published: (2025)
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation
by: Yin, Bowen, et al.
Published: (2023)
by: Yin, Bowen, et al.
Published: (2023)
Enhancing Representations through Heterogeneous Self-Supervised Learning
by: Li, Zhong-Yu, et al.
Published: (2023)
by: Li, Zhong-Yu, et al.
Published: (2023)
A Simple Detector with Frame Dynamics is a Strong Tracker
by: Peng, Chenxu, et al.
Published: (2025)
by: Peng, Chenxu, et al.
Published: (2025)
Similar Items
-
AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
by: Li, Zheng, et al.
Published: (2025) -
Generating Attribute-Aware Human Motions from Textual Prompt
by: Wang, Xinghan, et al.
Published: (2025) -
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
by: Zhang, Xin, et al.
Published: (2025) -
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining
by: Li, Yuxuan, et al.
Published: (2026) -
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
by: Lo, Ling, et al.
Published: (2025)