Saved in:
| Main Authors: | Qiu, Mei, Zhao, Jianqiang, Qu, Yanyun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04608 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
by: Zuo, Zuo, et al.
Published: (2024)
by: Zuo, Zuo, et al.
Published: (2024)
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual Descriptor
by: Lin, Ethan, et al.
Published: (2025)
by: Lin, Ethan, et al.
Published: (2025)
FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection
by: Zhu, Leqi, et al.
Published: (2026)
by: Zhu, Leqi, et al.
Published: (2026)
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
by: Wu, Yao, et al.
Published: (2024)
by: Wu, Yao, et al.
Published: (2024)
PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation
by: Tan, Wenbin, et al.
Published: (2026)
by: Tan, Wenbin, et al.
Published: (2026)
Detect Fake with Fake: Leveraging Synthetic Data-driven Representation for Synthetic Image Detection
by: Otake, Hina, et al.
Published: (2024)
by: Otake, Hina, et al.
Published: (2024)
Multi-Channel Cross Modal Detection of Synthetic Face Images
by: Ibsen, M., et al.
Published: (2023)
by: Ibsen, M., et al.
Published: (2023)
Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective
by: Li, Jiahao, et al.
Published: (2025)
by: Li, Jiahao, et al.
Published: (2025)
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
by: Chen, Yujun, et al.
Published: (2023)
by: Chen, Yujun, et al.
Published: (2023)
Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model
by: Zuo, Zuo, et al.
Published: (2025)
by: Zuo, Zuo, et al.
Published: (2025)
SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization
by: Shang, Tianyi, et al.
Published: (2026)
by: Shang, Tianyi, et al.
Published: (2026)
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation
by: Hughes, Philip, et al.
Published: (2025)
by: Hughes, Philip, et al.
Published: (2025)
Novel Category Discovery with X-Agent Attention for Open-Vocabulary Semantic Segmentation
by: Li, Jiahao, et al.
Published: (2025)
by: Li, Jiahao, et al.
Published: (2025)
DPL: Cross-quality DeepFake Detection via Dual Progressive Learning
by: Zhang, Dongliang, et al.
Published: (2024)
by: Zhang, Dongliang, et al.
Published: (2024)
Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation
by: Li, Jiahao, et al.
Published: (2026)
by: Li, Jiahao, et al.
Published: (2026)
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
by: Zhao, Shijia, et al.
Published: (2025)
by: Zhao, Shijia, et al.
Published: (2025)
CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation
by: Zuo, Zuo, et al.
Published: (2024)
by: Zuo, Zuo, et al.
Published: (2024)
Federated Cross-Modal Retrieval with Missing Modalities via Semantic Routing and Adapter Personalization
by: Zhou, Hefeng, et al.
Published: (2026)
by: Zhou, Hefeng, et al.
Published: (2026)
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
by: Chen, Haoming, et al.
Published: (2024)
by: Chen, Haoming, et al.
Published: (2024)
RASR: Retrieval-Augmented Semantic Reasoning for Fake News Video Detection
by: Li, Hui, et al.
Published: (2026)
by: Li, Hui, et al.
Published: (2026)
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation
by: Wen, Siwei, et al.
Published: (2025)
by: Wen, Siwei, et al.
Published: (2025)
Head-wise Modality Specialization within MLLMs for Robust Fake News Detection under Missing Modality
by: Qian, Kai, et al.
Published: (2026)
by: Qian, Kai, et al.
Published: (2026)
Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
by: Luo, Yuqing, et al.
Published: (2025)
by: Luo, Yuqing, et al.
Published: (2025)
Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake
by: Yang, Di, et al.
Published: (2024)
by: Yang, Di, et al.
Published: (2024)
Exploring Image Representation with Decoupled Classical Visual Descriptors
by: Qu, Chenyuan, et al.
Published: (2025)
by: Qu, Chenyuan, et al.
Published: (2025)
Beyond Accuracy: Uncovering the Role of Similarity Perception and its Alignment with Semantics in Supervised Learning
by: Filus, Katarzyna, et al.
Published: (2025)
by: Filus, Katarzyna, et al.
Published: (2025)
Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration
by: Wang, Pei, et al.
Published: (2024)
by: Wang, Pei, et al.
Published: (2024)
Learning Semantic Facial Descriptors for Accurate Face Animation
by: Zhu, Lei, et al.
Published: (2025)
by: Zhu, Lei, et al.
Published: (2025)
CrossWeaver: Cross-modal Weaving for Arbitrary-Modality Semantic Segmentation
by: Zhang, Zelin, et al.
Published: (2026)
by: Zhang, Zelin, et al.
Published: (2026)
Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
by: Wei, Riling, et al.
Published: (2025)
by: Wei, Riling, et al.
Published: (2025)
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
by: Lin, Yiheng, et al.
Published: (2025)
by: Lin, Yiheng, et al.
Published: (2025)
Semantics-Oriented Multitask Learning for DeepFake Detection: A Joint Embedding Approach
by: Zou, Mian, et al.
Published: (2024)
by: Zou, Mian, et al.
Published: (2024)
FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models
by: Li, Yixuan, et al.
Published: (2024)
by: Li, Yixuan, et al.
Published: (2024)
Multimodal Cancer Survival Analysis via Hypergraph Learning with Cross-Modality Rebalance
by: Qu, Mingcheng, et al.
Published: (2025)
by: Qu, Mingcheng, et al.
Published: (2025)
SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching
by: Liu, Xiangzeng, et al.
Published: (2025)
by: Liu, Xiangzeng, et al.
Published: (2025)
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
by: Li, Xiaofan, et al.
Published: (2024)
by: Li, Xiaofan, et al.
Published: (2024)
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
by: Cao, Meng, et al.
Published: (2024)
by: Cao, Meng, et al.
Published: (2024)
TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection
by: Xiong, Xinqi, et al.
Published: (2025)
by: Xiong, Xinqi, et al.
Published: (2025)
CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning
by: Choi, Eunjee, et al.
Published: (2025)
by: Choi, Eunjee, et al.
Published: (2025)
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
by: Shao, Rui, et al.
Published: (2023)
by: Shao, Rui, et al.
Published: (2023)
Similar Items
-
CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
by: Zuo, Zuo, et al.
Published: (2024) -
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual Descriptor
by: Lin, Ethan, et al.
Published: (2025) -
FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection
by: Zhu, Leqi, et al.
Published: (2026) -
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
by: Wu, Yao, et al.
Published: (2024) -
PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation
by: Tan, Wenbin, et al.
Published: (2026)