Saved in:
| Main Authors: | Zuo, Rui, Tong, Qinyue, Lu, Zhe-Ming, Lu, Ziqian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.13442 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)
by: Tong, Qinyue, et al.
Published: (2025)
MediSee: Reasoning-based Pixel-level Perception in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)
by: Tong, Qinyue, et al.
Published: (2025)
Improving Skeleton-based Action Recognition with Interactive Object Information
by: Wen, Hao, et al.
Published: (2025)
by: Wen, Hao, et al.
Published: (2025)
MedVeriSeg: Teaching MLLM-Based Medical Segmentation Models to Verify Query Validity Without Extra Training
by: Lu, Ziqian, et al.
Published: (2026)
by: Lu, Ziqian, et al.
Published: (2026)
From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information
by: Jiao, Qirui, et al.
Published: (2024)
by: Jiao, Qirui, et al.
Published: (2024)
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs
by: Zhang, Yi, et al.
Published: (2025)
by: Zhang, Yi, et al.
Published: (2025)
Enhancing Frequency Forgery Clues for Diffusion-Generated Image Detection
by: Zhang, Daichi, et al.
Published: (2025)
by: Zhang, Daichi, et al.
Published: (2025)
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization
by: Zhang, Fanrui, et al.
Published: (2024)
by: Zhang, Fanrui, et al.
Published: (2024)
Diffusion Facial Forgery Detection
by: Cheng, Harry, et al.
Published: (2024)
by: Cheng, Harry, et al.
Published: (2024)
Enhancing Self-Supervised Talking Head Forgery Detection via a Training-Free Dual-System Framework
by: Liu, Ke, et al.
Published: (2026)
by: Liu, Ke, et al.
Published: (2026)
360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
by: Tran, Huyen T. T., et al.
Published: (2026)
by: Tran, Huyen T. T., et al.
Published: (2026)
Video Forgery Detection for Surveillance Cameras: A Review
by: Tayfor, Noor B., et al.
Published: (2025)
by: Tayfor, Noor B., et al.
Published: (2025)
STEVE: A Step Verification Pipeline for Computer-use Agent Training
by: Lu, Fanbin, et al.
Published: (2025)
by: Lu, Fanbin, et al.
Published: (2025)
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
by: Liu, Anglin, et al.
Published: (2026)
by: Liu, Anglin, et al.
Published: (2026)
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
by: Huang, Zhe, et al.
Published: (2025)
by: Huang, Zhe, et al.
Published: (2025)
Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs
by: Zhang, Qizhe, et al.
Published: (2025)
by: Zhang, Qizhe, et al.
Published: (2025)
Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization
by: Chen, Yuqi, et al.
Published: (2026)
by: Chen, Yuqi, et al.
Published: (2026)
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference
by: Yin, Hao, et al.
Published: (2025)
by: Yin, Hao, et al.
Published: (2025)
Weakly-Supervised Image Forgery Localization via Vision-Language Collaborative Reasoning Framework
by: Sheng, Ziqi, et al.
Published: (2025)
by: Sheng, Ziqi, et al.
Published: (2025)
Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
by: Jiang, Yuchu, et al.
Published: (2025)
by: Jiang, Yuchu, et al.
Published: (2025)
Digital Image Forgery Detection Using Transfer Learning
by: Buyuk, Fatma Betul, et al.
Published: (2026)
by: Buyuk, Fatma Betul, et al.
Published: (2026)
Signature Forgery Detection: Improving Cross-Dataset Generalization
by: Parracho, Matheus Ramos
Published: (2025)
by: Parracho, Matheus Ramos
Published: (2025)
Field-Localized Forgery Detection for Digital Identity Documents
by: Kumar, Abhishek, et al.
Published: (2026)
by: Kumar, Abhishek, et al.
Published: (2026)
Suppressing Forgery-Specific Shortcuts for Generalizable Deepfake Detection
by: Wang, Yihui, et al.
Published: (2026)
by: Wang, Yihui, et al.
Published: (2026)
SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints
by: Sheng, Ziqi, et al.
Published: (2024)
by: Sheng, Ziqi, et al.
Published: (2024)
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training
by: Wei, Tong, et al.
Published: (2025)
by: Wei, Tong, et al.
Published: (2025)
A Large-scale Universal Evaluation Benchmark For Face Forgery Detection
by: Bei, Yijun, et al.
Published: (2024)
by: Bei, Yijun, et al.
Published: (2024)
Training-Free Consistency Pipeline for Fashion Repose
by: Aghilar, Potito, et al.
Published: (2025)
by: Aghilar, Potito, et al.
Published: (2025)
CAM-VFD: Cross-Attention Multimodal Video Forgery Detection
by: Elkhodary, Hoda Osama, et al.
Published: (2026)
by: Elkhodary, Hoda Osama, et al.
Published: (2026)
Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
by: Li, Zhengcen, et al.
Published: (2026)
by: Li, Zhengcen, et al.
Published: (2026)
FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos
by: Li, Zhaolun, et al.
Published: (2025)
by: Li, Zhaolun, et al.
Published: (2025)
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs
by: Yuan, Jiakang, et al.
Published: (2025)
by: Yuan, Jiakang, et al.
Published: (2025)
CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection
by: Zhou, Binjia, et al.
Published: (2025)
by: Zhou, Binjia, et al.
Published: (2025)
Band-Attention Modulated RetNet for Face Forgery Detection
by: Zhang, Zhida, et al.
Published: (2024)
by: Zhang, Zhida, et al.
Published: (2024)
Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
by: Wang, Yiming, et al.
Published: (2026)
by: Wang, Yiming, et al.
Published: (2026)
UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization
by: Huang, Qing, et al.
Published: (2025)
by: Huang, Qing, et al.
Published: (2025)
AIFIND: Artifact-Aware Interpreting Fine-Grained Alignment for Incremental Face Forgery Detection
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
by: Xiao, Rui, et al.
Published: (2026)
by: Xiao, Rui, et al.
Published: (2026)
DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models
by: Wang, Qichao, et al.
Published: (2026)
by: Wang, Qichao, et al.
Published: (2026)
Similar Items
-
MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
by: Tong, Qinyue, et al.
Published: (2025) -
MediSee: Reasoning-based Pixel-level Perception in Medical Images
by: Tong, Qinyue, et al.
Published: (2025) -
Improving Skeleton-based Action Recognition with Interactive Object Information
by: Wen, Hao, et al.
Published: (2025) -
MedVeriSeg: Teaching MLLM-Based Medical Segmentation Models to Verify Query Validity Without Extra Training
by: Lu, Ziqian, et al.
Published: (2026) -
From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information
by: Jiao, Qirui, et al.
Published: (2024)