:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zuo, Rui, Tong, Qinyue, Lu, Zhe-Ming, Lu, Ziqian
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.13442
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)

MediSee: Reasoning-based Pixel-level Perception in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)

Improving Skeleton-based Action Recognition with Interactive Object Information
by: Wen, Hao, et al.
Published: (2025)

MedVeriSeg: Teaching MLLM-Based Medical Segmentation Models to Verify Query Validity Without Extra Training
by: Lu, Ziqian, et al.
Published: (2026)

From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information
by: Jiao, Qirui, et al.
Published: (2024)

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs
by: Zhang, Yi, et al.
Published: (2025)

Enhancing Frequency Forgery Clues for Diffusion-Generated Image Detection
by: Zhang, Daichi, et al.
Published: (2025)

Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
by: Wang, Jingchao, et al.
Published: (2025)

ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization
by: Zhang, Fanrui, et al.
Published: (2024)

Diffusion Facial Forgery Detection
by: Cheng, Harry, et al.
Published: (2024)

Enhancing Self-Supervised Talking Head Forgery Detection via a Training-Free Dual-System Framework
by: Liu, Ke, et al.
Published: (2026)

360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
by: Tran, Huyen T. T., et al.
Published: (2026)

Video Forgery Detection for Surveillance Cameras: A Review
by: Tayfor, Noor B., et al.
Published: (2025)

STEVE: A Step Verification Pipeline for Computer-use Agent Training
by: Lu, Fanbin, et al.
Published: (2025)

Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
by: Liu, Anglin, et al.
Published: (2026)

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
by: Huang, Zhe, et al.
Published: (2025)

Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs
by: Zhang, Qizhe, et al.
Published: (2025)

Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization
by: Chen, Yuqi, et al.
Published: (2026)

Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference
by: Yin, Hao, et al.
Published: (2025)

Weakly-Supervised Image Forgery Localization via Vision-Language Collaborative Reasoning Framework
by: Sheng, Ziqi, et al.
Published: (2025)

Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
by: Jiang, Yuchu, et al.
Published: (2025)

Digital Image Forgery Detection Using Transfer Learning
by: Buyuk, Fatma Betul, et al.
Published: (2026)

Signature Forgery Detection: Improving Cross-Dataset Generalization
by: Parracho, Matheus Ramos
Published: (2025)

Field-Localized Forgery Detection for Digital Identity Documents
by: Kumar, Abhishek, et al.
Published: (2026)

Suppressing Forgery-Specific Shortcuts for Generalizable Deepfake Detection
by: Wang, Yihui, et al.
Published: (2026)

SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints
by: Sheng, Ziqi, et al.
Published: (2024)

GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training
by: Wei, Tong, et al.
Published: (2025)

A Large-scale Universal Evaluation Benchmark For Face Forgery Detection
by: Bei, Yijun, et al.
Published: (2024)

Training-Free Consistency Pipeline for Fashion Repose
by: Aghilar, Potito, et al.
Published: (2025)

CAM-VFD: Cross-Attention Multimodal Video Forgery Detection
by: Elkhodary, Hoda Osama, et al.
Published: (2026)

Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
by: Li, Zhengcen, et al.
Published: (2026)

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos
by: Li, Zhaolun, et al.
Published: (2025)

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs
by: Yuan, Jiakang, et al.
Published: (2025)

CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection
by: Zhou, Binjia, et al.
Published: (2025)

Band-Attention Modulated RetNet for Face Forgery Detection
by: Zhang, Zhida, et al.
Published: (2024)

Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
by: Wang, Yiming, et al.
Published: (2026)

UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization
by: Huang, Qing, et al.
Published: (2025)

AIFIND: Artifact-Aware Interpreting Fine-Grained Alignment for Incremental Face Forgery Detection
by: Wang, Hao, et al.
Published: (2026)

FINER: MLLMs Hallucinate under Fine-grained Negative Queries
by: Xiao, Rui, et al.
Published: (2026)

DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models
by: Wang, Qichao, et al.
Published: (2026)