Saved in:
| Main Authors: | Huang, Zizheng, Chen, Haoxing, Li, Jiaqi, Lan, Jun, Zhu, Huijia, Wang, Weiqiang, Wang, Limin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.17081 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
by: Chen, Haoxing, et al.
Published: (2024)
by: Chen, Haoxing, et al.
Published: (2024)
Boosting Audio-visual Zero-shot Learning with Large Language Models
by: Chen, Haoxing, et al.
Published: (2023)
by: Chen, Haoxing, et al.
Published: (2023)
Conditional Prototype Rectification Prompt Learning
by: Chen, Haoxing, et al.
Published: (2024)
by: Chen, Haoxing, et al.
Published: (2024)
Adaptive and Balanced Re-initialization for Long-timescale Continual Test-time Domain Adaptation
by: Wang, Yanshuo, et al.
Published: (2026)
by: Wang, Yanshuo, et al.
Published: (2026)
Maintain Plasticity in Long-timescale Continual Test-time Adaptation
by: Wang, Yanshuo, et al.
Published: (2024)
by: Wang, Yanshuo, et al.
Published: (2024)
Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks
by: Haoxuan, Sun, et al.
Published: (2025)
by: Haoxuan, Sun, et al.
Published: (2025)
Supervised Contrastive Learning for Snapshot Spectral Imaging Face Anti-Spoofing
by: Song, Chuanbiao, et al.
Published: (2024)
by: Song, Chuanbiao, et al.
Published: (2024)
EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO
by: Guan, Wei, et al.
Published: (2025)
by: Guan, Wei, et al.
Published: (2025)
Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection
by: Wang, Hanyi, et al.
Published: (2026)
by: Wang, Hanyi, et al.
Published: (2026)
Efficient Transfer Learning for Video-language Foundation Models
by: Chen, Haoxing, et al.
Published: (2024)
by: Chen, Haoxing, et al.
Published: (2024)
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
by: Lin, Yukang, et al.
Published: (2025)
by: Lin, Yukang, et al.
Published: (2025)
DS-VTON: An Enhanced Dual-Scale Coarse-to-Fine Framework for Virtual Try-On
by: Sun, Xianbing, et al.
Published: (2025)
by: Sun, Xianbing, et al.
Published: (2025)
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning
by: Tan, Hao, et al.
Published: (2026)
by: Tan, Hao, et al.
Published: (2026)
DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
by: Duan, Yuxuan, et al.
Published: (2024)
by: Duan, Yuxuan, et al.
Published: (2024)
Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
by: Ji, Yikun, et al.
Published: (2025)
by: Ji, Yikun, et al.
Published: (2025)
Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
by: Ji, Yikun, et al.
Published: (2025)
by: Ji, Yikun, et al.
Published: (2025)
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion
by: Cao, Ke, et al.
Published: (2024)
by: Cao, Ke, et al.
Published: (2024)
Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images
by: Ji, Yikun, et al.
Published: (2025)
by: Ji, Yikun, et al.
Published: (2025)
Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
by: Tan, Hao, et al.
Published: (2025)
by: Tan, Hao, et al.
Published: (2025)
Dual-Adapter: Training-free Dual Adaptation for Few-shot Out-of-Distribution Detection
by: Chen, Xinyi, et al.
Published: (2024)
by: Chen, Xinyi, et al.
Published: (2024)
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
by: Zhang, Tianxiao, et al.
Published: (2024)
by: Zhang, Tianxiao, et al.
Published: (2024)
RhythmMamba: Fast, Lightweight, and Accurate Remote Physiological Measurement
by: Zou, Bochao, et al.
Published: (2024)
by: Zou, Bochao, et al.
Published: (2024)
V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
by: Tang, Haojun, et al.
Published: (2025)
by: Tang, Haojun, et al.
Published: (2025)
TextSleuth: Towards Explainable Tampered Text Detection
by: Qu, Chenfan, et al.
Published: (2024)
by: Qu, Chenfan, et al.
Published: (2024)
Training-free Token Reduction for Vision Mamba
by: Ma, Qiankun, et al.
Published: (2025)
by: Ma, Qiankun, et al.
Published: (2025)
VideoMamba: State Space Model for Efficient Video Understanding
by: Li, Kunchang, et al.
Published: (2024)
by: Li, Kunchang, et al.
Published: (2024)
GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection
by: Yan, Haozhen, et al.
Published: (2025)
by: Yan, Haozhen, et al.
Published: (2025)
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
by: Chen, Guo, et al.
Published: (2024)
by: Chen, Guo, et al.
Published: (2024)
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
by: Wei, Lai, et al.
Published: (2026)
by: Wei, Lai, et al.
Published: (2026)
StreamOV: Streaming Omni-Video Understanding via Evidence-Guided Memory and Response Triggering
by: Xie, Ming, et al.
Published: (2026)
by: Xie, Ming, et al.
Published: (2026)
Autoregressive Pretraining with Mamba in Vision
by: Ren, Sucheng, et al.
Published: (2024)
by: Ren, Sucheng, et al.
Published: (2024)
LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Order
by: Freiberger, Matthias, et al.
Published: (2024)
by: Freiberger, Matthias, et al.
Published: (2024)
Mamba-R: Vision Mamba ALSO Needs Registers
by: Wang, Feng, et al.
Published: (2024)
by: Wang, Feng, et al.
Published: (2024)
GlobalMamba: Global Image Serialization for Vision Mamba
by: Wang, Chengkun, et al.
Published: (2024)
by: Wang, Chengkun, et al.
Published: (2024)
Demystify Mamba in Vision: A Linear Attention Perspective
by: Han, Dongchen, et al.
Published: (2024)
by: Han, Dongchen, et al.
Published: (2024)
VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
by: Liu, Yunze, et al.
Published: (2025)
by: Liu, Yunze, et al.
Published: (2025)
MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation
by: Yang, Xiaoxian, et al.
Published: (2025)
by: Yang, Xiaoxian, et al.
Published: (2025)
Vision Mamba Distillation for Low-resolution Fine-grained Image Classification
by: Chen, Yao, et al.
Published: (2024)
by: Chen, Yao, et al.
Published: (2024)
Dynamic Vision Mamba
by: Wu, Mengxuan, et al.
Published: (2025)
by: Wu, Mengxuan, et al.
Published: (2025)
Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion
by: Park, Jaehyun, et al.
Published: (2025)
by: Park, Jaehyun, et al.
Published: (2025)
Similar Items
-
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
by: Chen, Haoxing, et al.
Published: (2024) -
Boosting Audio-visual Zero-shot Learning with Large Language Models
by: Chen, Haoxing, et al.
Published: (2023) -
Conditional Prototype Rectification Prompt Learning
by: Chen, Haoxing, et al.
Published: (2024) -
Adaptive and Balanced Re-initialization for Long-timescale Continual Test-time Domain Adaptation
by: Wang, Yanshuo, et al.
Published: (2026) -
Maintain Plasticity in Long-timescale Continual Test-time Adaptation
by: Wang, Yanshuo, et al.
Published: (2024)