Saved in:
| Main Authors: | Shi, Senyuan, Tan, Hao, Tan, Zichang, Feng, Shuhan, Liu, Ajian, Escalera, Sergio, Wan, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.26421 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
by: Tan, Hao, et al.
Published: (2025)
by: Tan, Hao, et al.
Published: (2025)
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
by: Tan, Hao, et al.
Published: (2025)
by: Tan, Hao, et al.
Published: (2025)
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
by: Tan, Hao, et al.
Published: (2026)
by: Tan, Hao, et al.
Published: (2026)
PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024)
by: Tan, Hao, et al.
Published: (2024)
SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024)
by: Tan, Hao, et al.
Published: (2024)
CFPL-FAS: Class Free Prompt Learning for Generalizable Face Anti-spoofing
by: Liu, Ajian, et al.
Published: (2024)
by: Liu, Ajian, et al.
Published: (2024)
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
by: Chen, Shunxin, et al.
Published: (2025)
by: Chen, Shunxin, et al.
Published: (2025)
Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
by: Li, Yiheng, et al.
Published: (2025)
by: Li, Yiheng, et al.
Published: (2025)
Training-Free Unsupervised Prompt for Vision-Language Models
by: Long, Sifan, et al.
Published: (2024)
by: Long, Sifan, et al.
Published: (2024)
RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation
by: Rao, Zhi, et al.
Published: (2025)
by: Rao, Zhi, et al.
Published: (2025)
Unified Physical-Digital Attack Detection Challenge
by: Yuan, Haocheng, et al.
Published: (2024)
by: Yuan, Haocheng, et al.
Published: (2024)
Unified Physical-Digital Face Attack Detection
by: Fang, Hao, et al.
Published: (2024)
by: Fang, Hao, et al.
Published: (2024)
A Transformer Model for Boundary Detection in Continuous Sign Language
by: Rastgoo, Razieh, et al.
Published: (2024)
by: Rastgoo, Razieh, et al.
Published: (2024)
Interactive Post-Training for Vision-Language-Action Models
by: Tan, Shuhan, et al.
Published: (2025)
by: Tan, Shuhan, et al.
Published: (2025)
AGC: Adaptive Geodesic Correction for Adversarial Robustness on Vision-Language Models
by: Li, Zhiwei, et al.
Published: (2026)
by: Li, Zhiwei, et al.
Published: (2026)
NCL++: Nested Collaborative Learning for Long-Tailed Visual Recognition
by: Tan, Zichang, et al.
Published: (2023)
by: Tan, Zichang, et al.
Published: (2023)
Reduce the Artifacts Bias for More Generalizable AI-Generated Image Detection
by: Li, Yiheng, et al.
Published: (2026)
by: Li, Yiheng, et al.
Published: (2026)
CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection
by: Li, Yiheng, et al.
Published: (2026)
by: Li, Yiheng, et al.
Published: (2026)
PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding
by: Erregue, Iñaki, et al.
Published: (2026)
by: Erregue, Iñaki, et al.
Published: (2026)
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers
by: Casarin, Sofia, et al.
Published: (2025)
by: Casarin, Sofia, et al.
Published: (2025)
FA^{3}-CLIP: Frequency-Aware Cues Fusion and Attack-Agnostic Prompt Learning for Unified Face Attack Detection
by: Li, Yongze, et al.
Published: (2025)
by: Li, Yongze, et al.
Published: (2025)
Multi-Turn Adaptive Prompting Attack on Large Vision-Language Models
by: Choi, In Chong, et al.
Published: (2026)
by: Choi, In Chong, et al.
Published: (2026)
Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models
by: Zhao, Jianfei, et al.
Published: (2025)
by: Zhao, Jianfei, et al.
Published: (2025)
MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution
by: Peng, Siran, et al.
Published: (2025)
by: Peng, Siran, et al.
Published: (2025)
La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection
by: Zou, Hang, et al.
Published: (2024)
by: Zou, Hang, et al.
Published: (2024)
SOVABench: A Vehicle Surveillance Action Retrieval Benchmark for Multimodal Large Language Models
by: Rabasseda, Oriol, et al.
Published: (2026)
by: Rabasseda, Oriol, et al.
Published: (2026)
Quantized Prompt for Efficient Generalization of Vision-Language Models
by: Hao, Tianxiang, et al.
Published: (2024)
by: Hao, Tianxiang, et al.
Published: (2024)
Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning
by: Liu, Ajian, et al.
Published: (2025)
by: Liu, Ajian, et al.
Published: (2025)
Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
by: Keita, Mamadou, et al.
Published: (2024)
by: Keita, Mamadou, et al.
Published: (2024)
Human-Free Automated Prompting for Vision-Language Anomaly Detection: Prompt Optimization with Meta-guiding Prompt Scheme
by: Chen, Pi-Wei, et al.
Published: (2024)
by: Chen, Pi-Wei, et al.
Published: (2024)
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)
by: Wang, Wenjie, et al.
Published: (2024)
Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
by: Zhu, Xingyu, et al.
Published: (2024)
by: Zhu, Xingyu, et al.
Published: (2024)
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
by: Lu, Xinhua, et al.
Published: (2025)
by: Lu, Xinhua, et al.
Published: (2025)
Hydra: Computer Vision for Data Quality Monitoring
by: Britton, Thomas, et al.
Published: (2024)
by: Britton, Thomas, et al.
Published: (2024)
What Matters in Virtual Try-Off? Dual-UNet Diffusion Model For Garment Reconstruction
by: Truong, Loc-Phat, et al.
Published: (2026)
by: Truong, Loc-Phat, et al.
Published: (2026)
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
by: Peng, Wenshuo, et al.
Published: (2024)
by: Peng, Wenshuo, et al.
Published: (2024)
Fake-HR1: Rethinking Reasoning of Vision Language Model for Synthetic Image Detection
by: Jiang, Changjiang, et al.
Published: (2026)
by: Jiang, Changjiang, et al.
Published: (2026)
SEEC: Segmentation-Assisted Multi-Entropy Models for Learned Lossless Image Compression
by: Zheng, Chunhang, et al.
Published: (2025)
by: Zheng, Chunhang, et al.
Published: (2025)
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation
by: Yang, Xiuyu, et al.
Published: (2025)
by: Yang, Xiuyu, et al.
Published: (2025)
Similar Items
-
Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
by: Tan, Hao, et al.
Published: (2025) -
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
by: Tan, Hao, et al.
Published: (2025) -
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
by: Tan, Hao, et al.
Published: (2026) -
PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024) -
SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024)