:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Han, Yankun
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.09423
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Optimal Condition for Initialization Variance in Deep Neural Networks: An SGD Dynamics Perspective
by: Horii, Hiroshi, et al.
Published: (2025)

Optimized Weight Initialization on the Stiefel Manifold for Deep ReLU Neural Networks
by: Lee, Hyungu, et al.
Published: (2025)

CAWI: Copula-Aligned Weight Initialization for Randomized Neural Networks
by: Akhtar, Mushir, et al.
Published: (2026)

Reducing Oversmoothing through Informed Weight Initialization in Graph Neural Networks
by: Kelesis, Dimitrios, et al.
Published: (2024)

Effects of Initialization Biases on Deep Neural Network Training Dynamics
by: Pellegrino, Nicholas, et al.
Published: (2025)

Deep Neural Network Initialization with Sparsity Inducing Activations
by: Price, Ilan, et al.
Published: (2024)

Posterior Inference on Shallow Infinitely Wide Bayesian Neural Networks under Weights with Unbounded Variance
by: Loría, Jorge, et al.
Published: (2023)

Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
by: Lee, Hyunwoo, et al.
Published: (2024)

Deep Kernel Posterior Learning under Infinite Variance Prior Weights
by: Loría, Jorge, et al.
Published: (2024)

Fair CoVariance Neural Networks
by: Cavallo, Andrea, et al.
Published: (2024)

Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks
by: Bencomo, Gianluca, et al.
Published: (2025)

On the Weight Dynamics of Deep Normalized Networks
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2023)

LDLT L-Lipschitz Network Weight Parameterization Initialization
by: Juston, Marius F. R., et al.
Published: (2026)

Supervised Dynamic Dimension Reduction with Deep Neural Network
by: Luo, Zhanye, et al.
Published: (2025)

Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling
by: Rosso, Haley, et al.
Published: (2025)

DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights
by: Gupta, Saumya, et al.
Published: (2026)

Neural Weight Compression for Language Models
by: Ryu, Jegwang, et al.
Published: (2025)

Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations
by: Kumar, Akshay, et al.
Published: (2024)

Text2Weight: Bridging Natural Language and Neural Network Weight Spaces
by: Tian, Bowen, et al.
Published: (2025)

Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
by: Hu, Zhifeng, et al.
Published: (2025)

VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
by: Shakarami, Ashkan, et al.
Published: (2025)

Exploring and Improving Initialization for Deep Graph Neural Networks: A Signal Propagation Perspective
by: Wang, Senmiao, et al.
Published: (2025)

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration
by: Oh, Youngmin, et al.
Published: (2025)

Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
by: Bui, Ha Manh, et al.
Published: (2024)

On the Variance of Neural Network Training with respect to Test Sets and Distributions
by: Jordan, Keller
Published: (2023)

Variance-Aware Adaptive Weighting for Diffusion Model Training
by: Sun, Nanlong, et al.
Published: (2026)

Learning Guarantee of Reward Modeling Using Deep Neural Networks
by: Luo, Yuanhang, et al.
Published: (2025)

Recovering Plasticity of Neural Networks via Soft Weight Rescaling
by: Oh, Seungwon, et al.
Published: (2025)

From SGD to Spectra: A Theory of Neural Network Weight Dynamics
by: Olsen, Brian Richard, et al.
Published: (2025)

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models
by: Jiang, Guochao, et al.
Published: (2025)

Geometric Flow Models over Neural Network Weights
by: Erdogan, Ege
Published: (2025)

Principal Components for Neural Network Initialization
by: Phan, Nhan, et al.
Published: (2025)

WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
by: Feng, Fu, et al.
Published: (2024)

Wormhole Dynamics in Deep Neural Networks
by: Lai, Yen-Lung, et al.
Published: (2025)

The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks
by: Lintelo, Jona te, et al.
Published: (2024)

Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
by: Zhang, Xinhao, et al.
Published: (2024)

Variance-aware Reward Modeling with Anchor Guidance
by: Fang, Shuxing, et al.
Published: (2026)

CoVariance Filters and Neural Networks over Hilbert Spaces
by: Battiloro, Claudio, et al.
Published: (2025)

GVPO: Group Variance Policy Optimization for Large Language Model Post-Training
by: Zhang, Kaichen, et al.
Published: (2025)

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
by: Zhu, Fengqi, et al.
Published: (2025)