:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Huang, Wen, Yang, Jiarui, Dai, Tao, Li, Jiawei, Zhan, Shaoxiong, Wang, Bin, Xia, Shu-Tao
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2508.09459
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Looking Back and Forth: Cross-Image Attention Calibration and Attentive Preference Learning for Multi-Image Hallucination Mitigation
von: Yang, Xiaochen, et al.
Veröffentlicht: (2026)

GMMFormer v2: An Uncertainty-aware Framework for Partially Relevant Video Retrieval
von: Wang, Yuting, et al.
Veröffentlicht: (2024)

Personalized Face Super-Resolution with Identity Decoupling and Fitting
von: Yang, Jiarui, et al.
Veröffentlicht: (2025)

Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
von: Li, Ruibin, et al.
Veröffentlicht: (2025)

Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement
von: Wang, Xinghao, et al.
Veröffentlicht: (2025)

DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
von: Tang, Xiaoya, et al.
Veröffentlicht: (2024)

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
von: Fang, Hao, et al.
Veröffentlicht: (2024)

DuoFormer: Leveraging Hierarchical Representations by Local and Global Attention Vision Transformer
von: Tang, Xiaoya, et al.
Veröffentlicht: (2025)

LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
von: Zha, Yaohua, et al.
Veröffentlicht: (2024)

Global2Local: A Joint-Hierarchical Attention for Video Captioning
von: Dai, Chengpeng, et al.
Veröffentlicht: (2022)

VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
von: Gu, Jing, et al.
Veröffentlicht: (2024)

ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
von: Xu, Zitong, et al.
Veröffentlicht: (2025)

Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization
von: Chen, Rui, et al.
Veröffentlicht: (2025)

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation
von: Zhou, Yuxuan, et al.
Veröffentlicht: (2025)

UniVST: A Unified Framework for Training-free Localized Video Style Transfer
von: Song, Quanjian, et al.
Veröffentlicht: (2024)

Omni-IML: Towards Unified Image Manipulation Localization
von: Qu, Chenfan, et al.
Veröffentlicht: (2024)

PHPQ: Pyramid Hybrid Pooling Quantization for Efficient Fine-Grained Image Retrieval
von: Zeng, Ziyun, et al.
Veröffentlicht: (2021)

Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts
von: Guo, Hang, et al.
Veröffentlicht: (2023)

Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
von: Liu, Haitong, et al.
Veröffentlicht: (2025)

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
von: Lai, Jinxiang, et al.
Veröffentlicht: (2023)

Suite-IN++: A FlexiWear BodyNet Integrating Global and Local Motion Features from Apple Suite for Robust Inertial Navigation
von: Sun, Lan, et al.
Veröffentlicht: (2025)

Unsupervised Deformable Image Registration with Local-Global Attention and Image Decomposition
von: Huang, Zhengyong, et al.
Veröffentlicht: (2026)

LoFormer: Local Frequency Transformer for Image Deblurring
von: Mao, Xintian, et al.
Veröffentlicht: (2024)

GLGait: A Global-Local Temporal Receptive Field Network for Gait Recognition in the Wild
von: Peng, Guozhen, et al.
Veröffentlicht: (2024)

Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
von: Hu, Shengkai, et al.
Veröffentlicht: (2025)

UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization
von: Mi, Yachun, et al.
Veröffentlicht: (2025)

Self-supervised Representation Learning with Local Aggregation for Image-based Profiling
von: Dai, Siran, et al.
Veröffentlicht: (2025)

Efficiency Follows Global-Local Decoupling
von: Yang, Zhenyu, et al.
Veröffentlicht: (2026)

Unifying Global-Local Representations in Salient Object Detection with Transformer
von: Ren, Sucheng, et al.
Veröffentlicht: (2021)

3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors
von: Huang, Yujun, et al.
Veröffentlicht: (2024)

MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
von: Xia, Yingjie, et al.
Veröffentlicht: (2025)

BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
von: Zhang, Taolin, et al.
Veröffentlicht: (2024)

RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians
von: Gao, Qiankun, et al.
Veröffentlicht: (2024)

MambaIR: A Simple Baseline for Image Restoration with State-Space Model
von: Guo, Hang, et al.
Veröffentlicht: (2024)

Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration
von: Li, Yawei, et al.
Veröffentlicht: (2025)

Pre-training Point Cloud Compact Model with Partial-aware Reconstruction
von: Zha, Yaohua, et al.
Veröffentlicht: (2024)

IML-ViT: Benchmarking Image Manipulation Localization by Vision Transformer
von: Ma, Xiaochen, et al.
Veröffentlicht: (2023)

UVL2: A Unified Framework for Video Tampering Localization
von: Pei, Pengfei
Veröffentlicht: (2023)

Unified Local and Global Attention Interaction Modeling for Vision Transformers
von: Nguyen, Tan, et al.
Veröffentlicht: (2024)

Attention to Detail: Global-Local Attention for High-Resolution AI-Generated Image Detection
von: Han, Lawrence
Veröffentlicht: (2026)