Saved in:
| Main Authors: | Mas, Ignasi, Morros, Ramon, Hidalgo, Javier-Ruiz, Huerta, Ivan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04568 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
2D Representation for Unguided Single-View 3D Super-Resolution in Real-Time
by: Mas, Ignasi, et al.
Published: (2025)
by: Mas, Ignasi, et al.
Published: (2025)
Fast Unsupervised Tensor Restoration via Low-rank Deconvolution
by: Reixach, David, et al.
Published: (2024)
by: Reixach, David, et al.
Published: (2024)
Localized Control in Diffusion Models via Latent Vector Prediction
by: Domingo-Gregorio, Pablo, et al.
Published: (2026)
by: Domingo-Gregorio, Pablo, et al.
Published: (2026)
Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion
by: Xiao, Chaodong, et al.
Published: (2024)
by: Xiao, Chaodong, et al.
Published: (2024)
Trajectory-aware Shifted State Space Models for Online Video Super-Resolution
by: Zhu, Qiang, et al.
Published: (2025)
by: Zhu, Qiang, et al.
Published: (2025)
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer
by: Liu, Hongda, et al.
Published: (2025)
by: Liu, Hongda, et al.
Published: (2025)
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
by: Zhang, Yifei, et al.
Published: (2025)
by: Zhang, Yifei, et al.
Published: (2025)
S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion
by: Ma, Haolong, et al.
Published: (2024)
by: Ma, Haolong, et al.
Published: (2024)
Efficient Progressive Image Compression with Variance-aware Masking
by: Presta, Alberto, et al.
Published: (2024)
by: Presta, Alberto, et al.
Published: (2024)
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
by: Hong, Fa-Ting, et al.
Published: (2025)
by: Hong, Fa-Ting, et al.
Published: (2025)
Granular Ball Guided Masking: Structure-aware Data Augmentation
by: Xia, Shuyin, et al.
Published: (2025)
by: Xia, Shuyin, et al.
Published: (2025)
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
by: Jin, Siyoon, et al.
Published: (2025)
by: Jin, Siyoon, et al.
Published: (2025)
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
by: Wang, Zixiao, et al.
Published: (2024)
by: Wang, Zixiao, et al.
Published: (2024)
SAMKD: Spatial-aware Adaptive Masking Knowledge Distillation for Object Detection
by: Zhang, Zhourui, et al.
Published: (2025)
by: Zhang, Zhourui, et al.
Published: (2025)
DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
by: Ke, Fucai, et al.
Published: (2025)
by: Ke, Fucai, et al.
Published: (2025)
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
by: Lee, Minhyun, et al.
Published: (2024)
by: Lee, Minhyun, et al.
Published: (2024)
Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
by: Zhao, Weichao, et al.
Published: (2024)
by: Zhao, Weichao, et al.
Published: (2024)
DFREC: DeepFake Identity Recovery Based on Identity-aware Masked Autoencoder
by: Yu, Peipeng, et al.
Published: (2024)
by: Yu, Peipeng, et al.
Published: (2024)
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
by: Sakai, Shunsuke, et al.
Published: (2024)
by: Sakai, Shunsuke, et al.
Published: (2024)
MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training
by: Zhu, Lei, et al.
Published: (2025)
by: Zhu, Lei, et al.
Published: (2025)
EAST: Early Action Prediction Sampling Strategy with Token Masking
by: Sović, Iva, et al.
Published: (2026)
by: Sović, Iva, et al.
Published: (2026)
MC-PanDA: Mask Confidence for Panoptic Domain Adaptation
by: Martinović, Ivan, et al.
Published: (2024)
by: Martinović, Ivan, et al.
Published: (2024)
MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning
by: Zhang, Wenjin, et al.
Published: (2024)
by: Zhang, Wenjin, et al.
Published: (2024)
Token-Space Mask Prediction for Efficient Vision Transformer Segmentation
by: Galagain, Calvin, et al.
Published: (2026)
by: Galagain, Calvin, et al.
Published: (2026)
AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation
by: Chen, Tongfei, et al.
Published: (2026)
by: Chen, Tongfei, et al.
Published: (2026)
Adaptive Texture-aware Masking for Self-Supervised Learning in 3D Dental CBCT Analysis
by: Yang, Xinquan, et al.
Published: (2026)
by: Yang, Xinquan, et al.
Published: (2026)
VMamba: Visual State Space Model
by: Liu, Yue, et al.
Published: (2024)
by: Liu, Yue, et al.
Published: (2024)
Event-aided Direct Sparse Odometry
by: Hidalgo-Carrió, Javier, et al.
Published: (2022)
by: Hidalgo-Carrió, Javier, et al.
Published: (2022)
Learning Mask Invariant Mutual Information for Masked Image Modeling
by: Huang, Tao, et al.
Published: (2025)
by: Huang, Tao, et al.
Published: (2025)
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
by: Zhou, Kangneng, et al.
Published: (2023)
by: Zhou, Kangneng, et al.
Published: (2023)
Instance-aware Image Colorization with Controllable Textual Descriptions and Segmentation Masks
by: An, Yanru, et al.
Published: (2025)
by: An, Yanru, et al.
Published: (2025)
Image Forgery Localization with State Space Models
by: Lou, Zijie, et al.
Published: (2024)
by: Lou, Zijie, et al.
Published: (2024)
Flood Data Analysis on SpaceNet 8 Using Apache Sedona
by: Bai, Yanbing, et al.
Published: (2024)
by: Bai, Yanbing, et al.
Published: (2024)
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
by: Le, Minh-Quan, et al.
Published: (2023)
by: Le, Minh-Quan, et al.
Published: (2023)
Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model
by: Zhang, Tianpei, et al.
Published: (2025)
by: Zhang, Tianpei, et al.
Published: (2025)
MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation
by: Chen, Wenchao, et al.
Published: (2024)
by: Chen, Wenchao, et al.
Published: (2024)
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
by: Ni, Jingcheng, et al.
Published: (2025)
by: Ni, Jingcheng, et al.
Published: (2025)
Generalizable Deepfake Detection Based on Forgery-aware Layer Masking and Multi-artifact Subspace Decomposition
by: Zhang, Xiang, et al.
Published: (2026)
by: Zhang, Xiang, et al.
Published: (2026)
HMID-Net: An Exploration of Masked Image Modeling and Knowledge Distillation in Hyperbolic Space
by: Wang, Changli, et al.
Published: (2025)
by: Wang, Changli, et al.
Published: (2025)
Similar Items
-
2D Representation for Unguided Single-View 3D Super-Resolution in Real-Time
by: Mas, Ignasi, et al.
Published: (2025) -
Fast Unsupervised Tensor Restoration via Low-rank Deconvolution
by: Reixach, David, et al.
Published: (2024) -
Localized Control in Diffusion Models via Latent Vector Prediction
by: Domingo-Gregorio, Pablo, et al.
Published: (2026) -
Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion
by: Xiao, Chaodong, et al.
Published: (2024) -
Trajectory-aware Shifted State Space Models for Online Video Super-Resolution
by: Zhu, Qiang, et al.
Published: (2025)