:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Zizheng, Chen, Haoxing, Li, Jiaqi, Lan, Jun, Zhu, Huijia, Wang, Weiqiang, Wang, Limin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.17081
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
by: Chen, Haoxing, et al.
Published: (2024)

Boosting Audio-visual Zero-shot Learning with Large Language Models
by: Chen, Haoxing, et al.
Published: (2023)

Conditional Prototype Rectification Prompt Learning
by: Chen, Haoxing, et al.
Published: (2024)

Adaptive and Balanced Re-initialization for Long-timescale Continual Test-time Domain Adaptation
by: Wang, Yanshuo, et al.
Published: (2026)

Maintain Plasticity in Long-timescale Continual Test-time Adaptation
by: Wang, Yanshuo, et al.
Published: (2024)

Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks
by: Haoxuan, Sun, et al.
Published: (2025)

Supervised Contrastive Learning for Snapshot Spectral Imaging Face Anti-Spoofing
by: Song, Chuanbiao, et al.
Published: (2024)

EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO
by: Guan, Wei, et al.
Published: (2025)

Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection
by: Wang, Hanyi, et al.
Published: (2026)

Efficient Transfer Learning for Video-language Foundation Models
by: Chen, Haoxing, et al.
Published: (2024)

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
by: Lin, Yukang, et al.
Published: (2025)

DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On
by: Sun, Xianbing, et al.
Published: (2025)

VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
by: Tan, Hao, et al.
Published: (2026)

DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
by: Duan, Yuxuan, et al.
Published: (2024)

Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
by: Ji, Yikun, et al.
Published: (2025)

Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
by: Ji, Yikun, et al.
Published: (2025)

Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion
by: Cao, Ke, et al.
Published: (2024)

Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images
by: Ji, Yikun, et al.
Published: (2025)

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
by: Tan, Hao, et al.
Published: (2025)

Dual-Adapter: Training-free Dual Adaptation for Few-shot Out-of-Distribution Detection
by: Chen, Xinyi, et al.
Published: (2024)

Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
by: Zhang, Tianxiao, et al.
Published: (2024)

RhythmMamba: Fast, Lightweight, and Accurate Remote Physiological Measurement
by: Zou, Bochao, et al.
Published: (2024)

V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
by: Tang, Haojun, et al.
Published: (2025)

TextSleuth: Towards Explainable Tampered Text Detection
by: Qu, Chenfan, et al.
Published: (2024)

Training-free Token Reduction for Vision Mamba
by: Ma, Qiankun, et al.
Published: (2025)

VideoMamba: State Space Model for Efficient Video Understanding
by: Li, Kunchang, et al.
Published: (2024)

GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection
by: Yan, Haozhen, et al.
Published: (2025)

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
by: Chen, Guo, et al.
Published: (2024)

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
by: Wei, Lai, et al.
Published: (2026)

StreamOV: Streaming Omni-Video Understanding via Evidence-Guided Memory and Response Triggering
by: Xie, Ming, et al.
Published: (2026)

Autoregressive Pretraining with Mamba in Vision
by: Ren, Sucheng, et al.
Published: (2024)

LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Order
by: Freiberger, Matthias, et al.
Published: (2024)

Mamba-R: Vision Mamba ALSO Needs Registers
by: Wang, Feng, et al.
Published: (2024)

GlobalMamba: Global Image Serialization for Vision Mamba
by: Wang, Chengkun, et al.
Published: (2024)

Demystify Mamba in Vision: A Linear Attention Perspective
by: Han, Dongchen, et al.
Published: (2024)

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
by: Liu, Yunze, et al.
Published: (2025)

MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation
by: Yang, Xiaoxian, et al.
Published: (2025)

Vision Mamba Distillation for Low-resolution Fine-grained Image Classification
by: Chen, Yao, et al.
Published: (2024)

Dynamic Vision Mamba
by: Wu, Mengxuan, et al.
Published: (2025)

Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion
by: Park, Jaehyun, et al.
Published: (2025)