Saved in:
| Main Authors: | Liu, Qiankun, Jiang, Yuqi, Tan, Zhentao, Chen, Dongdong, Fu, Ying, Chu, Qi, Hua, Gang, Yu, Nenghai |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00513 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards More Unified In-context Visual Understanding
by: Sheng, Dianmo, et al.
Published: (2023)
by: Sheng, Dianmo, et al.
Published: (2023)
TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection
by: Chen, Tianxiang, et al.
Published: (2024)
by: Chen, Tianxiang, et al.
Published: (2024)
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs
by: Zhao, Xuanpu, et al.
Published: (2026)
by: Zhao, Xuanpu, et al.
Published: (2026)
Pluralistic Salient Object Detection
by: Feng, Xuelu, et al.
Published: (2024)
by: Feng, Xuelu, et al.
Published: (2024)
Siamese-DETR for Generic Multi-Object Tracking
by: Liu, Qiankun, et al.
Published: (2023)
by: Liu, Qiankun, et al.
Published: (2023)
LAKAN: Landmark-assisted Adaptive Kolmogorov-Arnold Network for Face Forgery Detection
by: Jiang, Jiayao, et al.
Published: (2025)
by: Jiang, Jiayao, et al.
Published: (2025)
Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization
by: Miao, Changtao, et al.
Published: (2024)
by: Miao, Changtao, et al.
Published: (2024)
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection
by: Chen, Tianxiang, et al.
Published: (2024)
by: Chen, Tianxiang, et al.
Published: (2024)
Multi-spectral Class Center Network for Face Manipulation Detection and Localization
by: Miao, Changtao, et al.
Published: (2023)
by: Miao, Changtao, et al.
Published: (2023)
Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement
by: Wang, Xinghao, et al.
Published: (2025)
by: Wang, Xinghao, et al.
Published: (2025)
Bootstrapping Audio-Visual Segmentation by Strengthening Audio Cues
by: Chen, Tianxiang, et al.
Published: (2024)
by: Chen, Tianxiang, et al.
Published: (2024)
Complete Instances Mining for Weakly Supervised Instance Segmentation
by: Li, Zecheng, et al.
Published: (2024)
by: Li, Zecheng, et al.
Published: (2024)
Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling
by: Zhou, Chao, et al.
Published: (2025)
by: Zhou, Chao, et al.
Published: (2025)
Reference-based Category Discovery: Unsupervised Object Detection with Category Awareness
by: Li, Yichen, et al.
Published: (2026)
by: Li, Yichen, et al.
Published: (2026)
Image Copy Detection for Diffusion Models
by: Wang, Wenhao, et al.
Published: (2024)
by: Wang, Wenhao, et al.
Published: (2024)
Advancing Aesthetic Image Generation via Composition Transfer
by: Zou, Kai, et al.
Published: (2026)
by: Zou, Kai, et al.
Published: (2026)
Improving Detail in Pluralistic Image Inpainting with Feature Dequantization
by: Park, Kyungri, et al.
Published: (2024)
by: Park, Kyungri, et al.
Published: (2024)
Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding
by: Wang, Jiazhen, et al.
Published: (2023)
by: Wang, Jiazhen, et al.
Published: (2023)
CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual Learning
by: Liu, Yaqi, et al.
Published: (2023)
by: Liu, Yaqi, et al.
Published: (2023)
Infrared Small Target Detection with Scale and Location Sensitivity
by: Liu, Qiankun, et al.
Published: (2024)
by: Liu, Qiankun, et al.
Published: (2024)
Multi-Object Tracking in the Dark
by: Wang, Xinzhe, et al.
Published: (2024)
by: Wang, Xinzhe, et al.
Published: (2024)
WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models
by: Yang, Zijin, et al.
Published: (2026)
by: Yang, Zijin, et al.
Published: (2026)
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
by: Chen, Haiwei, et al.
Published: (2024)
by: Chen, Haiwei, et al.
Published: (2024)
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
by: Lu, Jiaxin, et al.
Published: (2024)
by: Lu, Jiaxin, et al.
Published: (2024)
Rethinking Information Loss in Medical Image Segmentation with Various-sized Targets
by: Liu, Tianyi, et al.
Published: (2024)
by: Liu, Tianyi, et al.
Published: (2024)
SAPL: Semantic-Agnostic Prompt Learning in CLIP for Weakly Supervised Image Manipulation Localization
by: Wang, Xinghao, et al.
Published: (2026)
by: Wang, Xinghao, et al.
Published: (2026)
Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization
by: Chen, Rui, et al.
Published: (2025)
by: Chen, Rui, et al.
Published: (2025)
An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation
by: Gao, Zijun, et al.
Published: (2024)
by: Gao, Zijun, et al.
Published: (2024)
Natias: Neuron Attribution based Transferable Image Adversarial Steganography
by: Fan, Zexin, et al.
Published: (2024)
by: Fan, Zexin, et al.
Published: (2024)
Origin Identification for Text-Guided Image-to-Image Diffusion Models
by: Wang, Wenhao, et al.
Published: (2025)
by: Wang, Wenhao, et al.
Published: (2025)
Scale Propagation Network for Generalizable Depth Completion
by: Wang, Haotian, et al.
Published: (2024)
by: Wang, Haotian, et al.
Published: (2024)
CompleteMe: Reference-based Human Image Completion
by: Tsai, Yu-Ju, et al.
Published: (2025)
by: Tsai, Yu-Ju, et al.
Published: (2025)
AnyPattern: Towards In-context Image Copy Detection
by: Wang, Wenhao, et al.
Published: (2024)
by: Wang, Wenhao, et al.
Published: (2024)
SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation
by: Yin, Jun, et al.
Published: (2025)
by: Yin, Jun, et al.
Published: (2025)
RIRF: Reasoning Image Restoration Framework
by: Yan, Wending, et al.
Published: (2026)
by: Yan, Wending, et al.
Published: (2026)
Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing
by: Li, Shuai, et al.
Published: (2025)
by: Li, Shuai, et al.
Published: (2025)
From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy
by: He, Feng, et al.
Published: (2025)
by: He, Feng, et al.
Published: (2025)
Hyper-Transformer for Amodal Completion
by: Gao, Jianxiong, et al.
Published: (2024)
by: Gao, Jianxiong, et al.
Published: (2024)
MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection
by: Li, Yichen, et al.
Published: (2025)
by: Li, Yichen, et al.
Published: (2025)
Transformer for Multitemporal Hyperspectral Image Unmixing
by: Li, Hang, et al.
Published: (2024)
by: Li, Hang, et al.
Published: (2024)
Similar Items
-
Towards More Unified In-context Visual Understanding
by: Sheng, Dianmo, et al.
Published: (2023) -
TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection
by: Chen, Tianxiang, et al.
Published: (2024) -
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs
by: Zhao, Xuanpu, et al.
Published: (2026) -
Pluralistic Salient Object Detection
by: Feng, Xuelu, et al.
Published: (2024) -
Siamese-DETR for Generic Multi-Object Tracking
by: Liu, Qiankun, et al.
Published: (2023)