Saved in:
| Main Authors: | Pan, Leyan, Cao, Xinyuan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.04644 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse
by: Jacot, Arthur, et al.
Published: (2024)
by: Jacot, Arthur, et al.
Published: (2024)
Batch Normalization for Neural Networks on Complex Domains
by: Nguyen, Xuan Son, et al.
Published: (2026)
by: Nguyen, Xuan Son, et al.
Published: (2026)
Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization
by: Palenicek, Daniel, et al.
Published: (2025)
by: Palenicek, Daniel, et al.
Published: (2025)
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024)
by: Liao, Zhu, et al.
Published: (2024)
Supervised Batch Normalization
by: Faye, Bilal, et al.
Published: (2024)
by: Faye, Bilal, et al.
Published: (2024)
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
by: Chen, Ke, et al.
Published: (2024)
by: Chen, Ke, et al.
Published: (2024)
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
by: Bergsma, Shane, et al.
Published: (2025)
by: Bergsma, Shane, et al.
Published: (2025)
Batch Normalization Decomposed
by: Nachum, Ido, et al.
Published: (2024)
by: Nachum, Ido, et al.
Published: (2024)
Overcoming the Challenges of Batch Normalization in Federated Learning
by: Guerraoui, Rachid, et al.
Published: (2024)
by: Guerraoui, Rachid, et al.
Published: (2024)
Impact of Batch Normalization on Convolutional Network Representations
by: Potgieter, Hermanus L., et al.
Published: (2025)
by: Potgieter, Hermanus L., et al.
Published: (2025)
SGD and Weight Decay Secretly Minimize the Rank of Your Neural Network
by: Galanti, Tomer, et al.
Published: (2022)
by: Galanti, Tomer, et al.
Published: (2022)
Correction of Decoupled Weight Decay
by: Chou, Jason Chuan-Chih
Published: (2025)
by: Chou, Jason Chuan-Chih
Published: (2025)
Cautious Weight Decay
by: Chen, Lizhang, et al.
Published: (2025)
by: Chen, Lizhang, et al.
Published: (2025)
MimicNorm: Weight Mean and Last BN Layer Mimic the Dynamic of Batch Normalization
by: Fei, Wen, et al.
Published: (2020)
by: Fei, Wen, et al.
Published: (2020)
Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
by: Shi, Yujun, et al.
Published: (2022)
by: Shi, Yujun, et al.
Published: (2022)
On the Robustness of Neural Collapse and the Neural Collapse of Robustness
by: Su, Jingtong, et al.
Published: (2023)
by: Su, Jingtong, et al.
Published: (2023)
Batch Normalization Amplifies Memorization and Privacy Risks
by: Doan, Ngoc Phu, et al.
Published: (2026)
by: Doan, Ngoc Phu, et al.
Published: (2026)
Riemannian Batch Normalization: A Gyro Approach
by: Chen, Ziheng, et al.
Published: (2025)
by: Chen, Ziheng, et al.
Published: (2025)
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks
by: Kosson, Atli, et al.
Published: (2023)
by: Kosson, Atli, et al.
Published: (2023)
Can We Understand Plasticity Through Neural Collapse?
by: Bonifazi, Guglielmo, et al.
Published: (2024)
by: Bonifazi, Guglielmo, et al.
Published: (2024)
An Investigation of Batch Normalization in Off-Policy Actor-Critic Algorithms
by: Wang, Li, et al.
Published: (2025)
by: Wang, Li, et al.
Published: (2025)
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks
by: Ivolgina, Sofia, et al.
Published: (2025)
by: Ivolgina, Sofia, et al.
Published: (2025)
Stochastic Normalized Gradient Descent with Momentum for Large-Batch Training
by: Zhao, Shen-Yi, et al.
Published: (2020)
by: Zhao, Shen-Yi, et al.
Published: (2020)
Batch Normalization-Free Fully Integer Quantized Neural Networks via Progressive Tandem Learning
by: Sun, Pengfei, et al.
Published: (2025)
by: Sun, Pengfei, et al.
Published: (2025)
Making Batch Normalization Great in Federated Deep Learning
by: Zhong, Jike, et al.
Published: (2023)
by: Zhong, Jike, et al.
Published: (2023)
Adaptive Batch Normalization Networks for Adversarial Robustness
by: Lo, Shao-Yuan, et al.
Published: (2024)
by: Lo, Shao-Yuan, et al.
Published: (2024)
Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation
by: Westerhoff, Justus, et al.
Published: (2025)
by: Westerhoff, Justus, et al.
Published: (2025)
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics
by: Kimura, Masanari
Published: (2025)
by: Kimura, Masanari
Published: (2025)
BN-SCAFFOLD: controlling the drift of Batch Normalization statistics in Federated Learning
by: Quintana, Gonzalo Iñaki, et al.
Published: (2024)
by: Quintana, Gonzalo Iñaki, et al.
Published: (2024)
Rethinking Efficiency in Neural Combinatorial Optimization: Batched Preference Optimization with Mamba
by: Xu, Zhenxing, et al.
Published: (2026)
by: Xu, Zhenxing, et al.
Published: (2026)
Golden Ratio Weighting Prevents Model Collapse
by: He, Hengzhi, et al.
Published: (2025)
by: He, Hengzhi, et al.
Published: (2025)
StableGrad: Backward Scale Control without Batch Normalization
by: Mestre, Jose I., et al.
Published: (2026)
by: Mestre, Jose I., et al.
Published: (2026)
Feature Normalization Prevents Collapse of Non-contrastive Learning Dynamics
by: Bao, Han
Published: (2023)
by: Bao, Han
Published: (2023)
Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective
by: Sun, Haixiang, et al.
Published: (2024)
by: Sun, Haixiang, et al.
Published: (2024)
Unraveling Batch Normalization for Realistic Test-Time Adaptation
by: Su, Zixian, et al.
Published: (2023)
by: Su, Zixian, et al.
Published: (2023)
A Lie Group Approach to Riemannian Batch Normalization
by: Chen, Ziheng, et al.
Published: (2024)
by: Chen, Ziheng, et al.
Published: (2024)
Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
by: Markou, Evan, et al.
Published: (2024)
by: Markou, Evan, et al.
Published: (2024)
Does Weight Decay Enhance Training Stability?
by: Saether, Marius, et al.
Published: (2026)
by: Saether, Marius, et al.
Published: (2026)
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
by: Bhatt, Aditya, et al.
Published: (2019)
by: Bhatt, Aditya, et al.
Published: (2019)
Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks
by: Qi, Xuan, et al.
Published: (2026)
by: Qi, Xuan, et al.
Published: (2026)
Similar Items
-
Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse
by: Jacot, Arthur, et al.
Published: (2024) -
Batch Normalization for Neural Networks on Complex Domains
by: Nguyen, Xuan Son, et al.
Published: (2026) -
Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization
by: Palenicek, Daniel, et al.
Published: (2025) -
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024) -
Supervised Batch Normalization
by: Faye, Bilal, et al.
Published: (2024)