Saved in:
| Main Author: | Han, Yankun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.09423 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Optimal Condition for Initialization Variance in Deep Neural Networks: An SGD Dynamics Perspective
by: Horii, Hiroshi, et al.
Published: (2025)
by: Horii, Hiroshi, et al.
Published: (2025)
Optimized Weight Initialization on the Stiefel Manifold for Deep ReLU Neural Networks
by: Lee, Hyungu, et al.
Published: (2025)
by: Lee, Hyungu, et al.
Published: (2025)
CAWI: Copula-Aligned Weight Initialization for Randomized Neural Networks
by: Akhtar, Mushir, et al.
Published: (2026)
by: Akhtar, Mushir, et al.
Published: (2026)
Reducing Oversmoothing through Informed Weight Initialization in Graph Neural Networks
by: Kelesis, Dimitrios, et al.
Published: (2024)
by: Kelesis, Dimitrios, et al.
Published: (2024)
Effects of Initialization Biases on Deep Neural Network Training Dynamics
by: Pellegrino, Nicholas, et al.
Published: (2025)
by: Pellegrino, Nicholas, et al.
Published: (2025)
Deep Neural Network Initialization with Sparsity Inducing Activations
by: Price, Ilan, et al.
Published: (2024)
by: Price, Ilan, et al.
Published: (2024)
Posterior Inference on Shallow Infinitely Wide Bayesian Neural Networks under Weights with Unbounded Variance
by: Loría, Jorge, et al.
Published: (2023)
by: Loría, Jorge, et al.
Published: (2023)
Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
by: Lee, Hyunwoo, et al.
Published: (2024)
by: Lee, Hyunwoo, et al.
Published: (2024)
Deep Kernel Posterior Learning under Infinite Variance Prior Weights
by: Loría, Jorge, et al.
Published: (2024)
by: Loría, Jorge, et al.
Published: (2024)
Fair CoVariance Neural Networks
by: Cavallo, Andrea, et al.
Published: (2024)
by: Cavallo, Andrea, et al.
Published: (2024)
Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks
by: Bencomo, Gianluca, et al.
Published: (2025)
by: Bencomo, Gianluca, et al.
Published: (2025)
On the Weight Dynamics of Deep Normalized Networks
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2023)
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2023)
LDLT L-Lipschitz Network Weight Parameterization Initialization
by: Juston, Marius F. R., et al.
Published: (2026)
by: Juston, Marius F. R., et al.
Published: (2026)
Supervised Dynamic Dimension Reduction with Deep Neural Network
by: Luo, Zhanye, et al.
Published: (2025)
by: Luo, Zhanye, et al.
Published: (2025)
Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling
by: Rosso, Haley, et al.
Published: (2025)
by: Rosso, Haley, et al.
Published: (2025)
DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights
by: Gupta, Saumya, et al.
Published: (2026)
by: Gupta, Saumya, et al.
Published: (2026)
Neural Weight Compression for Language Models
by: Ryu, Jegwang, et al.
Published: (2025)
by: Ryu, Jegwang, et al.
Published: (2025)
Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations
by: Kumar, Akshay, et al.
Published: (2024)
by: Kumar, Akshay, et al.
Published: (2024)
Text2Weight: Bridging Natural Language and Neural Network Weight Spaces
by: Tian, Bowen, et al.
Published: (2025)
by: Tian, Bowen, et al.
Published: (2025)
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
by: Hu, Zhifeng, et al.
Published: (2025)
by: Hu, Zhifeng, et al.
Published: (2025)
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
by: Shakarami, Ashkan, et al.
Published: (2025)
by: Shakarami, Ashkan, et al.
Published: (2025)
Exploring and Improving Initialization for Deep Graph Neural Networks: A Signal Propagation Perspective
by: Wang, Senmiao, et al.
Published: (2025)
by: Wang, Senmiao, et al.
Published: (2025)
Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration
by: Oh, Youngmin, et al.
Published: (2025)
by: Oh, Youngmin, et al.
Published: (2025)
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
by: Bui, Ha Manh, et al.
Published: (2024)
by: Bui, Ha Manh, et al.
Published: (2024)
On the Variance of Neural Network Training with respect to Test Sets and Distributions
by: Jordan, Keller
Published: (2023)
by: Jordan, Keller
Published: (2023)
Variance-Aware Adaptive Weighting for Diffusion Model Training
by: Sun, Nanlong, et al.
Published: (2026)
by: Sun, Nanlong, et al.
Published: (2026)
Learning Guarantee of Reward Modeling Using Deep Neural Networks
by: Luo, Yuanhang, et al.
Published: (2025)
by: Luo, Yuanhang, et al.
Published: (2025)
Recovering Plasticity of Neural Networks via Soft Weight Rescaling
by: Oh, Seungwon, et al.
Published: (2025)
by: Oh, Seungwon, et al.
Published: (2025)
From SGD to Spectra: A Theory of Neural Network Weight Dynamics
by: Olsen, Brian Richard, et al.
Published: (2025)
by: Olsen, Brian Richard, et al.
Published: (2025)
VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models
by: Jiang, Guochao, et al.
Published: (2025)
by: Jiang, Guochao, et al.
Published: (2025)
Geometric Flow Models over Neural Network Weights
by: Erdogan, Ege
Published: (2025)
by: Erdogan, Ege
Published: (2025)
Principal Components for Neural Network Initialization
by: Phan, Nhan, et al.
Published: (2025)
by: Phan, Nhan, et al.
Published: (2025)
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
by: Feng, Fu, et al.
Published: (2024)
by: Feng, Fu, et al.
Published: (2024)
Wormhole Dynamics in Deep Neural Networks
by: Lai, Yen-Lung, et al.
Published: (2025)
by: Lai, Yen-Lung, et al.
Published: (2025)
The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks
by: Lintelo, Jona te, et al.
Published: (2024)
by: Lintelo, Jona te, et al.
Published: (2024)
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
by: Zhang, Xinhao, et al.
Published: (2024)
by: Zhang, Xinhao, et al.
Published: (2024)
Variance-aware Reward Modeling with Anchor Guidance
by: Fang, Shuxing, et al.
Published: (2026)
by: Fang, Shuxing, et al.
Published: (2026)
CoVariance Filters and Neural Networks over Hilbert Spaces
by: Battiloro, Claudio, et al.
Published: (2025)
by: Battiloro, Claudio, et al.
Published: (2025)
GVPO: Group Variance Policy Optimization for Large Language Model Post-Training
by: Zhang, Kaichen, et al.
Published: (2025)
by: Zhang, Kaichen, et al.
Published: (2025)
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
by: Zhu, Fengqi, et al.
Published: (2025)
by: Zhu, Fengqi, et al.
Published: (2025)
Similar Items
-
Optimal Condition for Initialization Variance in Deep Neural Networks: An SGD Dynamics Perspective
by: Horii, Hiroshi, et al.
Published: (2025) -
Optimized Weight Initialization on the Stiefel Manifold for Deep ReLU Neural Networks
by: Lee, Hyungu, et al.
Published: (2025) -
CAWI: Copula-Aligned Weight Initialization for Randomized Neural Networks
by: Akhtar, Mushir, et al.
Published: (2026) -
Reducing Oversmoothing through Informed Weight Initialization in Graph Neural Networks
by: Kelesis, Dimitrios, et al.
Published: (2024) -
Effects of Initialization Biases on Deep Neural Network Training Dynamics
by: Pellegrino, Nicholas, et al.
Published: (2025)