:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mas, Ignasi, Morros, Ramon, Hidalgo, Javier-Ruiz, Huerta, Ivan
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.04568
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

2D Representation for Unguided Single-View 3D Super-Resolution in Real-Time
by: Mas, Ignasi, et al.
Published: (2025)

Fast Unsupervised Tensor Restoration via Low-rank Deconvolution
by: Reixach, David, et al.
Published: (2024)

Localized Control in Diffusion Models via Latent Vector Prediction
by: Domingo-Gregorio, Pablo, et al.
Published: (2026)

Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion
by: Xiao, Chaodong, et al.
Published: (2024)

Trajectory-aware Shifted State Space Models for Online Video Super-Resolution
by: Zhu, Qiang, et al.
Published: (2025)

SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer
by: Liu, Hongda, et al.
Published: (2025)

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
by: Zhang, Yifei, et al.
Published: (2025)

S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion
by: Ma, Haolong, et al.
Published: (2024)

Efficient Progressive Image Compression with Variance-aware Masking
by: Presta, Alberto, et al.
Published: (2024)

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
by: Hong, Fa-Ting, et al.
Published: (2025)

Granular Ball Guided Masking: Structure-aware Data Augmentation
by: Xia, Shuyin, et al.
Published: (2025)

MATRIX: Mask Track Alignment for Interaction-aware Video Generation
by: Jin, Siyoon, et al.
Published: (2025)

Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
by: Wang, Zixiao, et al.
Published: (2024)

SAMKD: Spatial-aware Adaptive Masking Knowledge Distillation for Object Detection
by: Zhang, Zhourui, et al.
Published: (2025)

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
by: Ke, Fucai, et al.
Published: (2025)

MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
by: Lee, Minhyun, et al.
Published: (2024)

Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
by: Wu, Kun, et al.
Published: (2024)

MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
by: Zhao, Weichao, et al.
Published: (2024)

DFREC: DeepFake Identity Recovery Based on Identity-aware Masked Autoencoder
by: Yu, Peipeng, et al.
Published: (2024)

LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
by: Sakai, Shunsuke, et al.
Published: (2024)

MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training
by: Zhu, Lei, et al.
Published: (2025)

EAST: Early Action Prediction Sampling Strategy with Token Masking
by: Sović, Iva, et al.
Published: (2026)

MC-PanDA: Mask Confidence for Panoptic Domain Adaptation
by: Martinović, Ivan, et al.
Published: (2024)

MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning
by: Zhang, Wenjin, et al.
Published: (2024)

Token-Space Mask Prediction for Efficient Vision Transformer Segmentation
by: Galagain, Calvin, et al.
Published: (2026)

AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation
by: Chen, Tongfei, et al.
Published: (2026)

Adaptive Texture-aware Masking for Self-Supervised Learning in 3D Dental CBCT Analysis
by: Yang, Xinquan, et al.
Published: (2026)

VMamba: Visual State Space Model
by: Liu, Yue, et al.
Published: (2024)

Event-aided Direct Sparse Odometry
by: Hidalgo-Carrió, Javier, et al.
Published: (2022)

Learning Mask Invariant Mutual Information for Masked Image Modeling
by: Huang, Tao, et al.
Published: (2025)

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
by: Zhou, Kangneng, et al.
Published: (2023)

Instance-aware Image Colorization with Controllable Textual Descriptions and Segmentation Masks
by: An, Yanru, et al.
Published: (2025)

Image Forgery Localization with State Space Models
by: Lou, Zijie, et al.
Published: (2024)

Flood Data Analysis on SpaceNet 8 Using Apache Sedona
by: Bai, Yanbing, et al.
Published: (2024)

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
by: Le, Minh-Quan, et al.
Published: (2023)

Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model
by: Zhang, Tianpei, et al.
Published: (2025)

MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation
by: Chen, Wenchao, et al.
Published: (2024)

MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
by: Ni, Jingcheng, et al.
Published: (2025)

Generalizable Deepfake Detection Based on Forgery-aware Layer Masking and Multi-artifact Subspace Decomposition
by: Zhang, Xiang, et al.
Published: (2026)

HMID-Net: An Exploration of Masked Image Modeling and Knowledge Distillation in Hyperbolic Space
by: Wang, Changli, et al.
Published: (2025)