Saved in:
| Main Authors: | Tran, Huy, Milkert, Max, Hyde, David |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.12879 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
by: Milkert, Max, et al.
Published: (2023)
by: Milkert, Max, et al.
Published: (2023)
ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans
by: Shahbazi, Ashkan, et al.
Published: (2025)
by: Shahbazi, Ashkan, et al.
Published: (2025)
ASAP: Attention Sink Anchored Pruning
by: Lee, Jaehyuk, et al.
Published: (2026)
by: Lee, Jaehyuk, et al.
Published: (2026)
Amortized Optimal Transport from Sliced Potentials
by: Truong, Minh-Phuc, et al.
Published: (2026)
by: Truong, Minh-Phuc, et al.
Published: (2026)
LOTFormer: Doubly-Stochastic Linear Attention via Low-Rank Optimal Transport
by: Shahbazi, Ashkan, et al.
Published: (2025)
by: Shahbazi, Ashkan, et al.
Published: (2025)
Efficient Sliced Wasserstein Distance Computation via Adaptive Bayesian Optimization
by: Acharya, Manish, et al.
Published: (2025)
by: Acharya, Manish, et al.
Published: (2025)
Tree-Sliced Wasserstein Distance with Nonlinear Projection
by: Tran, Thanh, et al.
Published: (2025)
by: Tran, Thanh, et al.
Published: (2025)
Understanding Learning with Sliced-Wasserstein Requires Rethinking Informative Slices
by: Tran, Huy, et al.
Published: (2024)
by: Tran, Huy, et al.
Published: (2024)
ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference
by: Pathak, Surendra, et al.
Published: (2026)
by: Pathak, Surendra, et al.
Published: (2026)
Stereographic Spherical Sliced Wasserstein Distances
by: Tran, Huy, et al.
Published: (2024)
by: Tran, Huy, et al.
Published: (2024)
Doubly Stochastic Adaptive Neighbors Clustering via the Marcus Mapping
by: Yuan, Jinghui, et al.
Published: (2024)
by: Yuan, Jinghui, et al.
Published: (2024)
Demystifying SGD with Doubly Stochastic Gradients
by: Kim, Kyurae, et al.
Published: (2024)
by: Kim, Kyurae, et al.
Published: (2024)
Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution
by: Covert, Ian, et al.
Published: (2024)
by: Covert, Ian, et al.
Published: (2024)
Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors
by: van der Linden, Putri A., et al.
Published: (2024)
by: van der Linden, Putri A., et al.
Published: (2024)
Amortized Variational Inference: When and Why?
by: Margossian, Charles C., et al.
Published: (2023)
by: Margossian, Charles C., et al.
Published: (2023)
Doubly Stochastic Mean-Shift Clustering
by: Trigano, Tom, et al.
Published: (2026)
by: Trigano, Tom, et al.
Published: (2026)
Proximal Projection for Doubly Sparse Regularized Models
by: He, Jia Wei, et al.
Published: (2026)
by: He, Jia Wei, et al.
Published: (2026)
Row-Stochastic Matrices Can Provably Outperform Doubly Stochastic Matrices in Decentralized Learning
by: Liu, Bing, et al.
Published: (2025)
by: Liu, Bing, et al.
Published: (2025)
Beyond the Laplacian: Doubly Stochastic Matrices for Graph Neural Networks
by: Hu, Zhaobo, et al.
Published: (2026)
by: Hu, Zhaobo, et al.
Published: (2026)
Amortized Bayesian Workflow
by: Li, Chengkun, et al.
Published: (2024)
by: Li, Chengkun, et al.
Published: (2024)
ASAP: Exploiting the Satisficing Generalization Edge in Neural Combinatorial Optimization
by: Fang, Han, et al.
Published: (2025)
by: Fang, Han, et al.
Published: (2025)
EEG-Titans: Long-Horizon Seizure Forecasting via Dual-Branch Attention and Neural Memory
by: Pham, Tien-Dat, et al.
Published: (2026)
by: Pham, Tien-Dat, et al.
Published: (2026)
Markovian Sliced Wasserstein Distances: Beyond Independent Projections
by: Nguyen, Khai, et al.
Published: (2023)
by: Nguyen, Khai, et al.
Published: (2023)
ASAP: Unsupervised Post-training with Label Distribution Shift Adaptive Learning Rate
by: Park, Heewon, et al.
Published: (2025)
by: Park, Heewon, et al.
Published: (2025)
Quantum Doubly Stochastic Transformers
by: Born, Jannis, et al.
Published: (2025)
by: Born, Jannis, et al.
Published: (2025)
The Homogeneity Trap: Spectral Collapse in Doubly-Stochastic Deep Networks
by: Liu, Yizhi
Published: (2026)
by: Liu, Yizhi
Published: (2026)
Amortized Conditional Independence Testing
by: Duong, Bao, et al.
Published: (2025)
by: Duong, Bao, et al.
Published: (2025)
Amortized nonmyopic active search via deep imitation learning
by: Nguyen, Quan, et al.
Published: (2024)
by: Nguyen, Quan, et al.
Published: (2024)
Minimax-Optimal Two-Sample Test with Sliced Wasserstein
by: Tran, Binh Thuan, et al.
Published: (2025)
by: Tran, Binh Thuan, et al.
Published: (2025)
Intrinsic Reward Policy Optimization for Sparse-Reward Environments
by: Cho, Minjae, et al.
Published: (2026)
by: Cho, Minjae, et al.
Published: (2026)
Amortized Bayesian Mixture Models
by: Kucharský, Šimon, et al.
Published: (2025)
by: Kucharský, Šimon, et al.
Published: (2025)
Amortized Bayesian Multilevel Models
by: Habermann, Daniel, et al.
Published: (2024)
by: Habermann, Daniel, et al.
Published: (2024)
Neural Methods for Amortized Inference
by: Zammit-Mangion, Andrew, et al.
Published: (2024)
by: Zammit-Mangion, Andrew, et al.
Published: (2024)
Doubly Adaptive Channel and Spatial Attention for Semantic Image Communication by IoT Devices
by: Miri, Soroosh, et al.
Published: (2026)
by: Miri, Soroosh, et al.
Published: (2026)
Tree-Sliced Wasserstein Distance: A Geometric Perspective
by: Tran, Viet-Hoang, et al.
Published: (2024)
by: Tran, Viet-Hoang, et al.
Published: (2024)
Partially Observable Gaussian Process Network and Doubly Stochastic Variational Inference
by: Kiroriwal, Saksham, et al.
Published: (2025)
by: Kiroriwal, Saksham, et al.
Published: (2025)
Amortizing Causal Sensitivity Analysis via Prior Data-Fitted Networks
by: Javurek, Emil, et al.
Published: (2026)
by: Javurek, Emil, et al.
Published: (2026)
SurvivalPFN: Amortizing Survival Prediction via In-Context Bayesian Inference
by: Qi, Shi-ang, et al.
Published: (2026)
by: Qi, Shi-ang, et al.
Published: (2026)
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning
by: Balazadeh, Vahid, et al.
Published: (2025)
by: Balazadeh, Vahid, et al.
Published: (2025)
Amortized Inference of Causal Models via Conditional Fixed-Point Iterations
by: Mahajan, Divyat, et al.
Published: (2024)
by: Mahajan, Divyat, et al.
Published: (2024)
Similar Items
-
Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
by: Milkert, Max, et al.
Published: (2023) -
ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans
by: Shahbazi, Ashkan, et al.
Published: (2025) -
ASAP: Attention Sink Anchored Pruning
by: Lee, Jaehyuk, et al.
Published: (2026) -
Amortized Optimal Transport from Sliced Potentials
by: Truong, Minh-Phuc, et al.
Published: (2026) -
LOTFormer: Doubly-Stochastic Linear Attention via Low-Rank Optimal Transport
by: Shahbazi, Ashkan, et al.
Published: (2025)