:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Senyuan, Tan, Hao, Tan, Zichang, Feng, Shuhan, Liu, Ajian, Escalera, Sergio, Wan, Jun
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.26421
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
by: Tan, Hao, et al.
Published: (2025)

Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
by: Tan, Hao, et al.
Published: (2025)

VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
by: Tan, Hao, et al.
Published: (2026)

PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024)

SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition
by: Tan, Hao, et al.
Published: (2024)

CFPL-FAS: Class Free Prompt Learning for Generalizable Face Anti-spoofing
by: Liu, Ajian, et al.
Published: (2024)

Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
by: Chen, Shunxin, et al.
Published: (2025)

Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
by: Li, Yiheng, et al.
Published: (2025)

Training-Free Unsupervised Prompt for Vision-Language Models
by: Long, Sifan, et al.
Published: (2024)

RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation
by: Rao, Zhi, et al.
Published: (2025)

Unified Physical-Digital Attack Detection Challenge
by: Yuan, Haocheng, et al.
Published: (2024)

Unified Physical-Digital Face Attack Detection
by: Fang, Hao, et al.
Published: (2024)

A Transformer Model for Boundary Detection in Continuous Sign Language
by: Rastgoo, Razieh, et al.
Published: (2024)

Interactive Post-Training for Vision-Language-Action Models
by: Tan, Shuhan, et al.
Published: (2025)

AGC: Adaptive Geodesic Correction for Adversarial Robustness on Vision-Language Models
by: Li, Zhiwei, et al.
Published: (2026)

NCL++: Nested Collaborative Learning for Long-Tailed Visual Recognition
by: Tan, Zichang, et al.
Published: (2023)

Reduce the Artifacts Bias for More Generalizable AI-Generated Image Detection
by: Li, Yiheng, et al.
Published: (2026)

CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection
by: Li, Yiheng, et al.
Published: (2026)

PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding
by: Erregue, Iñaki, et al.
Published: (2026)

L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers
by: Casarin, Sofia, et al.
Published: (2025)

FA^{3}-CLIP: Frequency-Aware Cues Fusion and Attack-Agnostic Prompt Learning for Unified Face Attack Detection
by: Li, Yongze, et al.
Published: (2025)

Multi-Turn Adaptive Prompting Attack on Large Vision-Language Models
by: Choi, In Chong, et al.
Published: (2026)

Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models
by: Zhao, Jianfei, et al.
Published: (2025)

MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution
by: Peng, Siran, et al.
Published: (2025)

La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection
by: Zou, Hang, et al.
Published: (2024)

SOVABench: A Vehicle Surveillance Action Retrieval Benchmark for Multimodal Large Language Models
by: Rabasseda, Oriol, et al.
Published: (2026)

Quantized Prompt for Efficient Generalization of Vision-Language Models
by: Hao, Tianxiang, et al.
Published: (2024)

Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning
by: Liu, Ajian, et al.
Published: (2025)

Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
by: Keita, Mamadou, et al.
Published: (2024)

Human-Free Automated Prompting for Vision-Language Anomaly Detection: Prompt Optimization with Meta-guiding Prompt Scheme
by: Chen, Pi-Wei, et al.
Published: (2024)

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)

Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
by: Zhu, Xingyu, et al.
Published: (2024)

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
by: Lu, Xinhua, et al.
Published: (2025)

Hydra: Computer Vision for Data Quality Monitoring
by: Britton, Thomas, et al.
Published: (2024)

What Matters in Virtual Try-Off? Dual-UNet Diffusion Model For Garment Reconstruction
by: Truong, Loc-Phat, et al.
Published: (2026)

Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
by: Zhang, Xin, et al.
Published: (2025)

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
by: Peng, Wenshuo, et al.
Published: (2024)

Fake-HR1: Rethinking Reasoning of Vision Language Model for Synthetic Image Detection
by: Jiang, Changjiang, et al.
Published: (2026)

SEEC: Segmentation-Assisted Multi-Entropy Models for Learned Lossless Image Compression
by: Zheng, Chunhang, et al.
Published: (2025)

Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation
by: Yang, Xiuyu, et al.
Published: (2025)