Guardado en:
| Autores principales: | Tsao, Hsi-Ai, Hsiung, Lei, Chen, Pin-Yu, Ho, Tsung-Yi |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2409.01821 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
AutoVP: An Automated Visual Prompting Framework and Benchmark
por: Tsao, Hsi-Ai, et al.
Publicado: (2023)
por: Tsao, Hsi-Ai, et al.
Publicado: (2023)
RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection
por: He, Zhiyuan, et al.
Publicado: (2024)
por: He, Zhiyuan, et al.
Publicado: (2024)
NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes
por: Sun, Hao-Lun, et al.
Publicado: (2023)
por: Sun, Hao-Lun, et al.
Publicado: (2023)
Modular Prompt Learning Improves Vision-Language Models
por: Huang, Zhenhan, et al.
Publicado: (2025)
por: Huang, Zhenhan, et al.
Publicado: (2025)
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
por: Chou, Sheng-Yen, et al.
Publicado: (2023)
por: Chou, Sheng-Yen, et al.
Publicado: (2023)
Differentiable Prompt Learning for Vision Language Models
por: Huang, Zhenhan, et al.
Publicado: (2024)
por: Huang, Zhenhan, et al.
Publicado: (2024)
Visual Prompt Engineering for Vision Language Models in Radiology
por: Denner, Stefan, et al.
Publicado: (2024)
por: Denner, Stefan, et al.
Publicado: (2024)
Concept-Guided Prompt Learning for Generalization in Vision-Language Models
por: Zhang, Yi, et al.
Publicado: (2024)
por: Zhang, Yi, et al.
Publicado: (2024)
When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models
por: Chen, Qianpu, et al.
Publicado: (2026)
por: Chen, Qianpu, et al.
Publicado: (2026)
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
por: Ai, Zixiang, et al.
Publicado: (2025)
por: Ai, Zixiang, et al.
Publicado: (2025)
Structural Graph Probing of Vision-Language Models
por: He, Haoyu, et al.
Publicado: (2026)
por: He, Haoyu, et al.
Publicado: (2026)
Mixture of Prompt Learning for Vision Language Models
por: Du, Yu, et al.
Publicado: (2024)
por: Du, Yu, et al.
Publicado: (2024)
When Does Pruning Benefit Vision Representations?
por: Cassano, Enrico, et al.
Publicado: (2025)
por: Cassano, Enrico, et al.
Publicado: (2025)
Reasoning or Pattern Matching? Probing Large Vision-Language Models with Visual Puzzles
por: Lymperaiou, Maria, et al.
Publicado: (2026)
por: Lymperaiou, Maria, et al.
Publicado: (2026)
Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations
por: Zhu, Kangyu, et al.
Publicado: (2025)
por: Zhu, Kangyu, et al.
Publicado: (2025)
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models
por: Singha, Mainak, et al.
Publicado: (2025)
por: Singha, Mainak, et al.
Publicado: (2025)
VIP: Visual-guided Prompt Evolution for Efficient Dense Vision-Language Inference
por: Zhu, Hao, et al.
Publicado: (2026)
por: Zhu, Hao, et al.
Publicado: (2026)
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
por: Chang, You-Ming, et al.
Publicado: (2023)
por: Chang, You-Ming, et al.
Publicado: (2023)
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
por: Hou, Jiacheng, et al.
Publicado: (2026)
por: Hou, Jiacheng, et al.
Publicado: (2026)
VisCoP: Visual Probing for Video Domain Adaptation of Vision Language Models
por: Reilly, Dominick, et al.
Publicado: (2025)
por: Reilly, Dominick, et al.
Publicado: (2025)
VPTracker: Global Vision-Language Tracking via Visual Prompt
por: Wang, Jingchao, et al.
Publicado: (2025)
por: Wang, Jingchao, et al.
Publicado: (2025)
When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations
por: Lalai, Harsh Nishant, et al.
Publicado: (2026)
por: Lalai, Harsh Nishant, et al.
Publicado: (2026)
Probing Visual Concepts in Lightweight Vision-Language Models for Automated Driving
por: Theodoridis, Nikos, et al.
Publicado: (2026)
por: Theodoridis, Nikos, et al.
Publicado: (2026)
Attention Prompting on Image for Large Vision-Language Models
por: Yu, Runpeng, et al.
Publicado: (2024)
por: Yu, Runpeng, et al.
Publicado: (2024)
Vision Graph Prompting via Semantic Low-Rank Decomposition
por: Ai, Zixiang, et al.
Publicado: (2025)
por: Ai, Zixiang, et al.
Publicado: (2025)
Modeling Variants of Prompts for Vision-Language Models
por: Li, Ao, et al.
Publicado: (2025)
por: Li, Ao, et al.
Publicado: (2025)
MetaTPT: Meta Test-time Prompt Tuning for Vision-Language Models
por: Lei, Yuqing, et al.
Publicado: (2025)
por: Lei, Yuqing, et al.
Publicado: (2025)
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
por: Li, Zheng, et al.
Publicado: (2024)
por: Li, Zheng, et al.
Publicado: (2024)
Unsupervised Out-of-Distribution Detection in Medical Imaging Using Multi-Exit Class Activation Maps and Feature Masking
por: Chen, Yu-Jen, et al.
Publicado: (2025)
por: Chen, Yu-Jen, et al.
Publicado: (2025)
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis
por: Hsu, Chia-Yi, et al.
Publicado: (2025)
por: Hsu, Chia-Yi, et al.
Publicado: (2025)
SEP: Self-Enhanced Prompt Tuning for Visual-Language Model
por: Yao, Hantao, et al.
Publicado: (2024)
por: Yao, Hantao, et al.
Publicado: (2024)
Contextualized Visual Personalization in Vision-Language Models
por: Oh, Yeongtak, et al.
Publicado: (2026)
por: Oh, Yeongtak, et al.
Publicado: (2026)
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
por: Chin, Zhi-Yi, et al.
Publicado: (2023)
por: Chin, Zhi-Yi, et al.
Publicado: (2023)
Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention
por: Song, Shezheng, et al.
Publicado: (2026)
por: Song, Shezheng, et al.
Publicado: (2026)
Adversarial Prompt Distillation for Vision-Language Models
por: Luo, Lin, et al.
Publicado: (2024)
por: Luo, Lin, et al.
Publicado: (2024)
Active Prompt Learning in Vision Language Models
por: Bang, Jihwan, et al.
Publicado: (2023)
por: Bang, Jihwan, et al.
Publicado: (2023)
In the Era of Prompt Learning with Vision-Language Models
por: Jha, Ankit
Publicado: (2024)
por: Jha, Ankit
Publicado: (2024)
Revisiting Prompt Pretraining of Vision-Language Models
por: Chen, Zhenyuan, et al.
Publicado: (2024)
por: Chen, Zhenyuan, et al.
Publicado: (2024)
Generalizable Prompt Tuning for Vision-Language Models
por: Zhang, Qian
Publicado: (2024)
por: Zhang, Qian
Publicado: (2024)
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint
por: Lee, Heekyung, et al.
Publicado: (2025)
por: Lee, Heekyung, et al.
Publicado: (2025)
Ejemplares similares
-
AutoVP: An Automated Visual Prompting Framework and Benchmark
por: Tsao, Hsi-Ai, et al.
Publicado: (2023) -
RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection
por: He, Zhiyuan, et al.
Publicado: (2024) -
NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes
por: Sun, Hao-Lun, et al.
Publicado: (2023) -
Modular Prompt Learning Improves Vision-Language Models
por: Huang, Zhenhan, et al.
Publicado: (2025) -
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
por: Chou, Sheng-Yen, et al.
Publicado: (2023)