:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Zhi-Qin John, Zhang, Yaoyu, Luo, Tao
Format:	Preprint
Published:	2022
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2201.07395
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

An overview of condensation phenomenon in deep learning
by: Xu, Zhi-Qin John, et al.
Published: (2025)

A rationale from frequency perspective for grokking in training neural network
by: Zhou, Zhangchen, et al.
Published: (2024)

Embedding principle of homogeneous neural network for classification problem
by: Zhang, Jiahan, et al.
Published: (2025)

Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks
by: Bai, Zhiwei, et al.
Published: (2022)

Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks
by: Xu, Zhi-Qin John, et al.
Published: (2019)

Uncovering Critical Sets of Deep Neural Networks via Sample-Independent Critical Lifting
by: Zhang, Leyang, et al.
Published: (2025)

Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or Memorizing
by: Zhang, Zhongwang, et al.
Published: (2024)

Mitigating spectral bias for the multiscale operator learning
by: Liu, Xinliang, et al.
Published: (2022)

Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization
by: Zhang, Leyang, et al.
Published: (2023)

Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks
by: Zhang, Leyang, et al.
Published: (2024)

Complexity Control Facilitates Reasoning-Based Compositional Generalization in Transformers
by: Zhang, Zhongwang, et al.
Published: (2025)

On understanding and overcoming spectral biases of deep neural network learning methods for solving PDEs
by: Xu, Zhi-Qin John, et al.
Published: (2025)

Focus and Dilution: The Multi-stage Learning Process of Attention
by: Chen, Zheng-An, et al.
Published: (2026)

Inductive biases in deep learning models for weather prediction
by: Thuemmel, Jannik, et al.
Published: (2023)

Training instability in deep learning follows low-dimensional dynamical principles
by: Zhang, Zhipeng, et al.
Published: (2026)

On the expressiveness and spectral bias of KANs
by: Wang, Yixuan, et al.
Published: (2024)

Adaptive Preconditioners Trigger Loss Spikes in Adam
by: Bai, Zhiwei, et al.
Published: (2025)

An approach of deep reinforcement learning for maximizing the net present value of stochastic projects
by: Xu, Wei, et al.
Published: (2025)

When do spectral gradient updates help in deep learning?
by: Davis, Damek, et al.
Published: (2025)

Understanding the dynamics of the frequency bias in neural networks
by: Molina, Juan, et al.
Published: (2024)

Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion
by: Bai, Zhiwei, et al.
Published: (2024)

Disentangle Sample Size and Initialization Effect on Perfect Generalization for Single-Neuron Target
by: Zhao, Jiajie, et al.
Published: (2024)

Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models
by: Yao, Junjie, et al.
Published: (2025)

Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation
by: Chen, Tianyi, et al.
Published: (2024)

Loss Spike in Training Neural Networks
by: Li, Xiaolong, et al.
Published: (2023)

Neural Force Field: Few-shot Learning of Generalized Physical Reasoning
by: Li, Shiqian, et al.
Published: (2025)

Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism
by: Wang, Zhiwei, et al.
Published: (2024)

Implicit bias produces neural scaling laws in learning curves, from perceptrons to deep networks
by: D'Amico, Francesco, et al.
Published: (2025)

Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization
by: Zhang, Yaoyu, et al.
Published: (2024)

Reasoning Bias of Next Token Prediction Training
by: Lin, Pengxiao, et al.
Published: (2025)

An Analysis for Reasoning Bias of Language Models with Small Initialization
by: Yao, Junjie, et al.
Published: (2025)

Determinism in the Undetermined: Deterministic Output in Charge-Conserving Continuous-Time Neuromorphic Systems with Temporal Stochasticity
by: Yan, Jing, et al.
Published: (2026)

Accelerating superconductor discovery through tempered deep learning of the electron-phonon spectral function
by: Gibson, Jason B., et al.
Published: (2024)

A Contrastive Diffusion-based Network (CDNet) for Time Series Classification
by: Zhang, Yaoyu, et al.
Published: (2025)

Loss Jump During Loss Switch in Solving PDEs with Neural Networks
by: Wang, Zhiwei, et al.
Published: (2024)

Xeno-learning: knowledge transfer across species in deep learning-based spectral image analysis
by: Sellner, Jan, et al.
Published: (2024)

Memorization in deep learning: A survey
by: Wei, Jiaheng, et al.
Published: (2024)

Position: Many generalization measures for deep learning are fragile
by: Zhang, Shuofeng, et al.
Published: (2025)

Active learning with biased non-response to label requests
by: Robinson, Thomas, et al.
Published: (2023)

AcL: Action Learner for Fault-Tolerant Quadruped Locomotion Control
by: Xu, Tianyu, et al.
Published: (2025)