:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Qiu, Mei, Zhao, Jianqiang, Qu, Yanyun
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.04608
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
by: Zuo, Zuo, et al.
Published: (2024)

Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual Descriptor
by: Lin, Ethan, et al.
Published: (2025)

FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection
by: Zhu, Leqi, et al.
Published: (2026)

Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
by: Wu, Yao, et al.
Published: (2024)

PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation
by: Tan, Wenbin, et al.
Published: (2026)

Detect Fake with Fake: Leveraging Synthetic Data-driven Representation for Synthetic Image Detection
by: Otake, Hina, et al.
Published: (2024)

Multi-Channel Cross Modal Detection of Synthetic Face Images
by: Ibsen, M., et al.
Published: (2023)

Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective
by: Li, Jiahao, et al.
Published: (2025)

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
by: Chen, Yujun, et al.
Published: (2023)

Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model
by: Zuo, Zuo, et al.
Published: (2025)

SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization
by: Shang, Tianyi, et al.
Published: (2026)

Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation
by: Hughes, Philip, et al.
Published: (2025)

Novel Category Discovery with X-Agent Attention for Open-Vocabulary Semantic Segmentation
by: Li, Jiahao, et al.
Published: (2025)

DPL: Cross-quality DeepFake Detection via Dual Progressive Learning
by: Zhang, Dongliang, et al.
Published: (2024)

Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation
by: Li, Jiahao, et al.
Published: (2026)

SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
by: Zhao, Shijia, et al.
Published: (2025)

CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation
by: Zuo, Zuo, et al.
Published: (2024)

Federated Cross-Modal Retrieval with Missing Modalities via Semantic Routing and Adapter Personalization
by: Zhou, Hefeng, et al.
Published: (2026)

Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
by: Chen, Haoming, et al.
Published: (2024)

RASR: Retrieval-Augmented Semantic Reasoning for Fake News Video Detection
by: Li, Hui, et al.
Published: (2026)

Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation
by: Wen, Siwei, et al.
Published: (2025)

Head-wise Modality Specialization within MLLMs for Robust Fake News Detection under Missing Modality
by: Qian, Kai, et al.
Published: (2026)

Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
by: Luo, Yuqing, et al.
Published: (2025)

Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake
by: Yang, Di, et al.
Published: (2024)

Exploring Image Representation with Decoupled Classical Visual Descriptors
by: Qu, Chenyuan, et al.
Published: (2025)

Beyond Accuracy: Uncovering the Role of Similarity Perception and its Alignment with Semantics in Supervised Learning
by: Filus, Katarzyna, et al.
Published: (2025)

Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration
by: Wang, Pei, et al.
Published: (2024)

Learning Semantic Facial Descriptors for Accurate Face Animation
by: Zhu, Lei, et al.
Published: (2025)

CrossWeaver: Cross-modal Weaving for Arbitrary-Modality Semantic Segmentation
by: Zhang, Zelin, et al.
Published: (2026)

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
by: Wei, Riling, et al.
Published: (2025)

AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
by: Lin, Yiheng, et al.
Published: (2025)

Semantics-Oriented Multitask Learning for DeepFake Detection: A Joint Embedding Approach
by: Zou, Mian, et al.
Published: (2024)

FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models
by: Li, Yixuan, et al.
Published: (2024)

Multimodal Cancer Survival Analysis via Hypergraph Learning with Cross-Modality Rebalance
by: Qu, Mingcheng, et al.
Published: (2025)

SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching
by: Liu, Xiangzeng, et al.
Published: (2025)

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
by: Li, Xiaofan, et al.
Published: (2024)

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
by: Cao, Meng, et al.
Published: (2024)

TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection
by: Xiong, Xinqi, et al.
Published: (2025)

CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning
by: Choi, Eunjee, et al.
Published: (2025)

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
by: Shao, Rui, et al.
Published: (2023)