Saved in:
| Main Authors: | Xu, Hui, Liu, Chi, Zhu, Congcong, Wang, Minghao, Qu, Youyang, Gao, Longxiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.15406 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method
by: Liu, Chi, et al.
Published: (2025)
by: Liu, Chi, et al.
Published: (2025)
Learning to Look before Learning to Like: Incorporating Human Visual Cognition into Aesthetic Quality Assessment
by: Yu, Liwen, et al.
Published: (2026)
by: Yu, Liwen, et al.
Published: (2026)
Federated Balanced Learning
by: Li, Jiaze, et al.
Published: (2026)
by: Li, Jiaze, et al.
Published: (2026)
FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion
by: Feng, Chen-Bin, et al.
Published: (2026)
by: Feng, Chen-Bin, et al.
Published: (2026)
RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry
by: Wang, Xinchang, et al.
Published: (2026)
by: Wang, Xinchang, et al.
Published: (2026)
Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
Millisecond-Response Tracking and Gazing System for UAVs: A Domestic Solution Based on "Phytium + Cambricon"
by: Zhu, Yuchen, et al.
Published: (2025)
by: Zhu, Yuchen, et al.
Published: (2025)
Remote Sensing Retrieval-Augmented Generation: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model
by: Wen, Congcong, et al.
Published: (2025)
by: Wen, Congcong, et al.
Published: (2025)
Neural Image Space Tessellation efect
by: Du, Youyang, et al.
Published: (2026)
by: Du, Youyang, et al.
Published: (2026)
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
by: Qu, Tianyuan, et al.
Published: (2025)
by: Qu, Tianyuan, et al.
Published: (2025)
Safe and Reliable Diffusion Models via Subspace Projection
by: Chen, Huiqiang, et al.
Published: (2025)
by: Chen, Huiqiang, et al.
Published: (2025)
FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification
by: Fu, Kexue, et al.
Published: (2024)
by: Fu, Kexue, et al.
Published: (2024)
Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection
by: Wang, Zhiyuan, et al.
Published: (2026)
by: Wang, Zhiyuan, et al.
Published: (2026)
Comparison Drives Preference: Reference-Aware Modeling for AI-Generated Video Quality Assessment
by: Zou, Minghao, et al.
Published: (2026)
by: Zou, Minghao, et al.
Published: (2026)
Phase Matching for Out-of-Distribution Generalization
by: Hu, Chengming, et al.
Published: (2023)
by: Hu, Chengming, et al.
Published: (2023)
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
by: Liu, Minghao, et al.
Published: (2024)
by: Liu, Minghao, et al.
Published: (2024)
PRISM: Rethinking Scattered Atmosphere Reconstruction as a Unified Understanding and Generation Model for Real-world Dehazing
by: Fang, Chengyu, et al.
Published: (2026)
by: Fang, Chengyu, et al.
Published: (2026)
A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps
by: Yu, Xuanlong, et al.
Published: (2026)
by: Yu, Xuanlong, et al.
Published: (2026)
Causal Diffusion Transformers for Generative Modeling
by: Deng, Chaorui, et al.
Published: (2024)
by: Deng, Chaorui, et al.
Published: (2024)
Bringing Textual Prompt to AI-Generated Image Quality Assessment
by: Qu, Bowen, et al.
Published: (2024)
by: Qu, Bowen, et al.
Published: (2024)
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026)
by: Meng, Yihao, et al.
Published: (2026)
Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model
by: Xu, Haoran, et al.
Published: (2026)
by: Xu, Haoran, et al.
Published: (2026)
CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation
by: Lin, Xiao, et al.
Published: (2025)
by: Lin, Xiao, et al.
Published: (2025)
ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation
by: Ho, Yu-Hsuan, et al.
Published: (2024)
by: Ho, Yu-Hsuan, et al.
Published: (2024)
Brain Imaging-to-Graph Generation using Adversarial Hierarchical Diffusion Models for MCI Causality Analysis
by: Zuo, Qiankun, et al.
Published: (2023)
by: Zuo, Qiankun, et al.
Published: (2023)
TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather
by: Zhao, Xiongwei, et al.
Published: (2024)
by: Zhao, Xiongwei, et al.
Published: (2024)
UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints
by: Alam, Inzamamul, et al.
Published: (2024)
by: Alam, Inzamamul, et al.
Published: (2024)
Cross-Modal Causal Intervention for Medical Report Generation
by: Chen, Weixing, et al.
Published: (2023)
by: Chen, Weixing, et al.
Published: (2023)
Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2026)
by: Chen, Shunpeng, et al.
Published: (2026)
Smudged Fingerprints: A Systematic Evaluation of the Robustness of AI Image Fingerprints
by: Yao, Kai, et al.
Published: (2025)
by: Yao, Kai, et al.
Published: (2025)
Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
by: Li, Jiaze, et al.
Published: (2025)
by: Li, Jiaze, et al.
Published: (2025)
Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image
by: Miao, Qingran, et al.
Published: (2025)
by: Miao, Qingran, et al.
Published: (2025)
Causal Context Adjustment Loss for Learned Image Compression
by: Han, Minghao, et al.
Published: (2024)
by: Han, Minghao, et al.
Published: (2024)
Fingerprint Presentation Attack Detector Using Global-Local Model
by: Liu, Haozhe, et al.
Published: (2024)
by: Liu, Haozhe, et al.
Published: (2024)
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
by: Nakata, Kengo, et al.
Published: (2024)
by: Nakata, Kengo, et al.
Published: (2024)
Fingerprints of Super Resolution Networks
by: Vonderfecht, Jeremy, et al.
Published: (2024)
by: Vonderfecht, Jeremy, et al.
Published: (2024)
Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
by: Wu, Pingyu, et al.
Published: (2025)
by: Wu, Pingyu, et al.
Published: (2025)
Causal World Modeling for Robot Control
by: Li, Lin, et al.
Published: (2026)
by: Li, Lin, et al.
Published: (2026)
Conditional Synthetic Live and Spoof Fingerprint Generation
by: Abbas, Syed Konain, et al.
Published: (2025)
by: Abbas, Syed Konain, et al.
Published: (2025)
Stage-wise Adaptive Label Distribution for Facial Age Estimation
by: Wu, Bo, et al.
Published: (2025)
by: Wu, Bo, et al.
Published: (2025)
Similar Items
-
Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method
by: Liu, Chi, et al.
Published: (2025) -
Learning to Look before Learning to Like: Incorporating Human Visual Cognition into Aesthetic Quality Assessment
by: Yu, Liwen, et al.
Published: (2026) -
Federated Balanced Learning
by: Li, Jiaze, et al.
Published: (2026) -
FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion
by: Feng, Chen-Bin, et al.
Published: (2026) -
RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry
by: Wang, Xinchang, et al.
Published: (2026)