Saved in:
| Main Authors: | Maity, Dipan, Mondal, Suman, Roy, Arindam |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.06014 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for Reinforcement Learning from Human Feedback (RLHF)
by: Maity, Dipan
Published: (2026)
by: Maity, Dipan
Published: (2026)
AuON: A Linear-time Alternative to Orthogonal Momentum Updates
by: Maity, Dipan
Published: (2025)
by: Maity, Dipan
Published: (2025)
Self-Supervised Modality-Agnostic Pre-Training of Swin Transformers
by: Talasila, Abhiroop, et al.
Published: (2024)
by: Talasila, Abhiroop, et al.
Published: (2024)
GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory
by: Liu, Jiaxu, et al.
Published: (2025)
by: Liu, Jiaxu, et al.
Published: (2025)
SparseSwin: Swin Transformer with Sparse Transformer Block
by: Pinasthika, Krisna, et al.
Published: (2023)
by: Pinasthika, Krisna, et al.
Published: (2023)
Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet
by: Hirata, Kodai, et al.
Published: (2025)
by: Hirata, Kodai, et al.
Published: (2025)
Deep Reinforcement Learning with Swin Transformers
by: Meng, Li, et al.
Published: (2022)
by: Meng, Li, et al.
Published: (2022)
DynamicGate MLP Conditional Computation via Learned Structural Dropout and Input Dependent Gating for Functional Plasticity
by: Choi, Yong Il
Published: (2026)
by: Choi, Yong Il
Published: (2026)
DualSwinFusionSeg: Multimodal Martian Landslide Segmentation via Dual Swin Transformer with Multi-Scale Fusion and UNet++
by: Kabir, Shahriar, et al.
Published: (2026)
by: Kabir, Shahriar, et al.
Published: (2026)
Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI
by: Naz, Saeeda, et al.
Published: (2025)
by: Naz, Saeeda, et al.
Published: (2025)
Differential Gated Self-Attention
by: Lygizou, Elpiniki Maria, et al.
Published: (2025)
by: Lygizou, Elpiniki Maria, et al.
Published: (2025)
DIAR: Deep Image Alignment and Reconstruction using Swin Transformers
by: Kwiatkowski, Monika, et al.
Published: (2023)
by: Kwiatkowski, Monika, et al.
Published: (2023)
Multi-Channel Swin Transformer Framework for Bearing Remaining Useful Life Prediction
by: Mohajerzarrinkelk, Ali, et al.
Published: (2025)
by: Mohajerzarrinkelk, Ali, et al.
Published: (2025)
SwinGNN: Rethinking Permutation Invariance in Diffusion Models for Graph Generation
by: Yan, Qi, et al.
Published: (2023)
by: Yan, Qi, et al.
Published: (2023)
Learnability Window in Gated Recurrent Neural Networks
by: Livi, Lorenzo
Published: (2025)
by: Livi, Lorenzo
Published: (2025)
TwoHead-SwinFPN: A Unified DL Architecture for Synthetic Manipulation, Detection and Localization in Identity Documents
by: Naseeb, Chan, et al.
Published: (2026)
by: Naseeb, Chan, et al.
Published: (2026)
Fast and explainable clustering in the Manhattan and Tanimoto distance
by: Güttel, Stefan, et al.
Published: (2026)
by: Güttel, Stefan, et al.
Published: (2026)
StrideNET: Swin Transformer for Terrain Recognition with Dynamic Roughness Extraction
by: Shelare, Maitreya, et al.
Published: (2024)
by: Shelare, Maitreya, et al.
Published: (2024)
GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations
by: Paischer, Fabian, et al.
Published: (2025)
by: Paischer, Fabian, et al.
Published: (2025)
SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length
by: Liu, Bangya, et al.
Published: (2024)
by: Liu, Bangya, et al.
Published: (2024)
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate
by: Zheng, Liangwei Nathan, et al.
Published: (2025)
by: Zheng, Liangwei Nathan, et al.
Published: (2025)
Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models
by: Hirabayashi, Yuta, et al.
Published: (2025)
by: Hirabayashi, Yuta, et al.
Published: (2025)
Voxel-Level Brain States Prediction Using Swin Transformer
by: Sun, Yifei, et al.
Published: (2025)
by: Sun, Yifei, et al.
Published: (2025)
Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation
by: Zimerman, Itamar, et al.
Published: (2024)
by: Zimerman, Itamar, et al.
Published: (2024)
Flash Window Attention: speedup the attention computation for Swin Transformer
by: Zhang, Zhendong
Published: (2025)
by: Zhang, Zhendong
Published: (2025)
Enhancing DR Classification with Swin Transformer and Shifted Window Attention
by: Boulaabi, Meher, et al.
Published: (2025)
by: Boulaabi, Meher, et al.
Published: (2025)
An Efficient Approach to Detecting Lung Nodules Using Swin Transformer
by: Shakuri, Saeed, et al.
Published: (2025)
by: Shakuri, Saeed, et al.
Published: (2025)
CNCast: Leveraging 3D Swin Transformer and DiT for Enhanced Regional Weather Forecasting
by: Liang, Hongli, et al.
Published: (2025)
by: Liang, Hongli, et al.
Published: (2025)
Gated Graph Attention Networks with Learnable Temperature
by: Ma, Zhongtian, et al.
Published: (2026)
by: Ma, Zhongtian, et al.
Published: (2026)
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining
by: Liu, Jiarun, et al.
Published: (2024)
by: Liu, Jiarun, et al.
Published: (2024)
PhysEDA: Physics-Aware Learning Framework for Efficient EDA With Manhattan Distance Decay
by: Yang, Zetao
Published: (2026)
by: Yang, Zetao
Published: (2026)
In-context KV-Cache Eviction for LLMs via Attention-Gate
by: Zeng, Zihao, et al.
Published: (2024)
by: Zeng, Zihao, et al.
Published: (2024)
Understanding Gated Neurons in Transformers from Their Input-Output Functionality
by: Gerstner, Sebastian, et al.
Published: (2025)
by: Gerstner, Sebastian, et al.
Published: (2025)
Gating is Weighting: Understanding Gated Linear Attention through In-context Learning
by: Li, Yingcong, et al.
Published: (2025)
by: Li, Yingcong, et al.
Published: (2025)
SSI-GAN: Semi-Supervised Swin-Inspired Generative Adversarial Networks for Neuronal Spike Classification
by: Sharifrazi, Danial, et al.
Published: (2026)
by: Sharifrazi, Danial, et al.
Published: (2026)
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
by: Meyer, Maxwell, et al.
Published: (2024)
by: Meyer, Maxwell, et al.
Published: (2024)
Long-horizon prediction of three-dimensional wall-bounded turbulence with CTA-Swin-UNet and resolvent analysis
by: Chen, Bo, et al.
Published: (2026)
by: Chen, Bo, et al.
Published: (2026)
GateTS: Versatile and Efficient Forecasting via Attention-Inspired routed Mixture-of-Experts
by: Yemets, Kyrylo, et al.
Published: (2025)
by: Yemets, Kyrylo, et al.
Published: (2025)
Gating Enables Curvature: A Geometric Expressivity Gap in Attention
by: Bathula, Satwik, et al.
Published: (2026)
by: Bathula, Satwik, et al.
Published: (2026)
Quadratic Gating Mixture of Experts: Statistical Insights into Self-Attention
by: Akbarian, Pedram, et al.
Published: (2024)
by: Akbarian, Pedram, et al.
Published: (2024)
Similar Items
-
SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for Reinforcement Learning from Human Feedback (RLHF)
by: Maity, Dipan
Published: (2026) -
AuON: A Linear-time Alternative to Orthogonal Momentum Updates
by: Maity, Dipan
Published: (2025) -
Self-Supervised Modality-Agnostic Pre-Training of Swin Transformers
by: Talasila, Abhiroop, et al.
Published: (2024) -
GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory
by: Liu, Jiaxu, et al.
Published: (2025) -
SparseSwin: Swin Transformer with Sparse Transformer Block
by: Pinasthika, Krisna, et al.
Published: (2023)