Saved in:
| Main Authors: | Ma, Haoyu, Mahdizadehaghdam, Shahin, Wu, Bichen, Fan, Zhipeng, Gu, Yuchao, Zhao, Wenliang, Shapira, Lior, Xie, Xiaohui |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.12468 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding
by: Xi, Yu, et al.
Published: (2025)
by: Xi, Yu, et al.
Published: (2025)
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
by: Cai, Lingling, et al.
Published: (2024)
by: Cai, Lingling, et al.
Published: (2024)
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
by: Wang, Tianzi, et al.
Published: (2024)
by: Wang, Tianzi, et al.
Published: (2024)
MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression
by: Chen, Yi-Hsin, et al.
Published: (2023)
by: Chen, Yi-Hsin, et al.
Published: (2023)
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
by: Zhou, Yufan, et al.
Published: (2025)
by: Zhou, Yufan, et al.
Published: (2025)
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
by: Zou, Siyu, et al.
Published: (2024)
by: Zou, Siyu, et al.
Published: (2024)
Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
by: Zhou, Xingyu, et al.
Published: (2024)
by: Zhou, Xingyu, et al.
Published: (2024)
Masked Transformer for Electrocardiogram Classification
by: Zhou, Ya, et al.
Published: (2023)
by: Zhou, Ya, et al.
Published: (2023)
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation
by: Zheng, Haoyu, et al.
Published: (2024)
by: Zheng, Haoyu, et al.
Published: (2024)
LLaDA-TTS: Unifying Speech Synthesis and Zero-Shot Editing via Masked Diffusion Modeling
by: Fan, Xiaoyu, et al.
Published: (2026)
by: Fan, Xiaoyu, et al.
Published: (2026)
Mask2IV: Interaction-Centric Video Generation via Mask Trajectories
by: Li, Gen, et al.
Published: (2025)
by: Li, Gen, et al.
Published: (2025)
Polynomial Property Testing
by: Gishboliner, Lior, et al.
Published: (2025)
by: Gishboliner, Lior, et al.
Published: (2025)
Hypergraph removal with polynomial bounds
by: Gishboliner, Lior, et al.
Published: (2022)
by: Gishboliner, Lior, et al.
Published: (2022)
3D Mesh Editing using Masked LRMs
by: Gao, Will, et al.
Published: (2024)
by: Gao, Will, et al.
Published: (2024)
MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale
by: Tang, Zhicong, et al.
Published: (2026)
by: Tang, Zhicong, et al.
Published: (2026)
Towards Effective and Efficient Non-autoregressive decoders for Conformer and LLM-based ASR using Block-based Attention Mask
by: Wang, Tianzi, et al.
Published: (2025)
by: Wang, Tianzi, et al.
Published: (2025)
Segment-Based Attention Masking for GPTs
by: Katz, Shahar, et al.
Published: (2024)
by: Katz, Shahar, et al.
Published: (2024)
Click2Mask: Local Editing with Dynamic Mask Generation
by: Regev, Omer, et al.
Published: (2024)
by: Regev, Omer, et al.
Published: (2024)
Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
by: Qi, Tianhao, et al.
Published: (2025)
by: Qi, Tianhao, et al.
Published: (2025)
Deep Active Speech Cancellation with Mamba-Masking Network
by: Mishaly, Yehuda, et al.
Published: (2025)
by: Mishaly, Yehuda, et al.
Published: (2025)
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
by: Cai, Pengfei, et al.
Published: (2024)
by: Cai, Pengfei, et al.
Published: (2024)
Masked Generative Transformer Is What You Need for Image Editing
by: Chow, Wei, et al.
Published: (2026)
by: Chow, Wei, et al.
Published: (2026)
Masked Latent Transformer with the Random Masking Ratio to Advance the Diagnosis of Dental Fluorosis
by: Wu, Yun, et al.
Published: (2024)
by: Wu, Yun, et al.
Published: (2024)
Polyline Path Masked Attention for Vision Transformer
by: Zhao, Zhongchen, et al.
Published: (2025)
by: Zhao, Zhongchen, et al.
Published: (2025)
DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)
by: Wang, Weitao, et al.
Published: (2025)
StableMask: Refining Causal Masking in Decoder-only Transformer
by: Yin, Qingyu, et al.
Published: (2024)
by: Yin, Qingyu, et al.
Published: (2024)
Edit as You See: Image-guided Video Editing via Masked Motion Modeling
by: Huang, Zhi-Lin, et al.
Published: (2025)
by: Huang, Zhi-Lin, et al.
Published: (2025)
MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach
by: Zhang, Xin, et al.
Published: (2025)
by: Zhang, Xin, et al.
Published: (2025)
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
by: Yin, Zijin, et al.
Published: (2024)
by: Yin, Zijin, et al.
Published: (2024)
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
by: Chen, Jinshu, et al.
Published: (2025)
by: Chen, Jinshu, et al.
Published: (2025)
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
by: Xie, Jiakuan, et al.
Published: (2024)
by: Xie, Jiakuan, et al.
Published: (2024)
EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing
by: Chow, Wei, et al.
Published: (2025)
by: Chow, Wei, et al.
Published: (2025)
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
by: Huang, Zhiqi, et al.
Published: (2024)
by: Huang, Zhiqi, et al.
Published: (2024)
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
by: Cheng, Anzhe, et al.
Published: (2025)
by: Cheng, Anzhe, et al.
Published: (2025)
Text-Guided Video Masked Autoencoder
by: Fan, David, et al.
Published: (2024)
by: Fan, David, et al.
Published: (2024)
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks
by: Ye, Junliang, et al.
Published: (2025)
by: Ye, Junliang, et al.
Published: (2025)
INT-DTT+: Low-Complexity Data-Dependent Transforms for Video Coding
by: Fernández-Menduiña, Samuel, et al.
Published: (2025)
by: Fernández-Menduiña, Samuel, et al.
Published: (2025)
Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity
by: Pascual, Santiago, et al.
Published: (2024)
by: Pascual, Santiago, et al.
Published: (2024)
NLE: Non-autoregressive LLM-based ASR by Transcript Editing
by: Dekel, Avihu, et al.
Published: (2026)
by: Dekel, Avihu, et al.
Published: (2026)
Masked Completion via Structured Diffusion with White-Box Transformers
by: Pai, Druv, et al.
Published: (2024)
by: Pai, Druv, et al.
Published: (2024)
Similar Items
-
Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding
by: Xi, Yu, et al.
Published: (2025) -
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
by: Cai, Lingling, et al.
Published: (2024) -
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
by: Wang, Tianzi, et al.
Published: (2024) -
MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression
by: Chen, Yi-Hsin, et al.
Published: (2023) -
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
by: Zhou, Yufan, et al.
Published: (2025)