Saved in:
| Main Authors: | Xu, Zhi-Qin John, Zhang, Yaoyu, Luo, Tao |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2201.07395 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An overview of condensation phenomenon in deep learning
by: Xu, Zhi-Qin John, et al.
Published: (2025)
by: Xu, Zhi-Qin John, et al.
Published: (2025)
A rationale from frequency perspective for grokking in training neural network
by: Zhou, Zhangchen, et al.
Published: (2024)
by: Zhou, Zhangchen, et al.
Published: (2024)
Embedding principle of homogeneous neural network for classification problem
by: Zhang, Jiahan, et al.
Published: (2025)
by: Zhang, Jiahan, et al.
Published: (2025)
Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks
by: Bai, Zhiwei, et al.
Published: (2022)
by: Bai, Zhiwei, et al.
Published: (2022)
Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks
by: Xu, Zhi-Qin John, et al.
Published: (2019)
by: Xu, Zhi-Qin John, et al.
Published: (2019)
Uncovering Critical Sets of Deep Neural Networks via Sample-Independent Critical Lifting
by: Zhang, Leyang, et al.
Published: (2025)
by: Zhang, Leyang, et al.
Published: (2025)
Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or Memorizing
by: Zhang, Zhongwang, et al.
Published: (2024)
by: Zhang, Zhongwang, et al.
Published: (2024)
Mitigating spectral bias for the multiscale operator learning
by: Liu, Xinliang, et al.
Published: (2022)
by: Liu, Xinliang, et al.
Published: (2022)
Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization
by: Zhang, Leyang, et al.
Published: (2023)
by: Zhang, Leyang, et al.
Published: (2023)
Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks
by: Zhang, Leyang, et al.
Published: (2024)
by: Zhang, Leyang, et al.
Published: (2024)
Complexity Control Facilitates Reasoning-Based Compositional Generalization in Transformers
by: Zhang, Zhongwang, et al.
Published: (2025)
by: Zhang, Zhongwang, et al.
Published: (2025)
On understanding and overcoming spectral biases of deep neural network learning methods for solving PDEs
by: Xu, Zhi-Qin John, et al.
Published: (2025)
by: Xu, Zhi-Qin John, et al.
Published: (2025)
Focus and Dilution: The Multi-stage Learning Process of Attention
by: Chen, Zheng-An, et al.
Published: (2026)
by: Chen, Zheng-An, et al.
Published: (2026)
Inductive biases in deep learning models for weather prediction
by: Thuemmel, Jannik, et al.
Published: (2023)
by: Thuemmel, Jannik, et al.
Published: (2023)
Training instability in deep learning follows low-dimensional dynamical principles
by: Zhang, Zhipeng, et al.
Published: (2026)
by: Zhang, Zhipeng, et al.
Published: (2026)
On the expressiveness and spectral bias of KANs
by: Wang, Yixuan, et al.
Published: (2024)
by: Wang, Yixuan, et al.
Published: (2024)
Adaptive Preconditioners Trigger Loss Spikes in Adam
by: Bai, Zhiwei, et al.
Published: (2025)
by: Bai, Zhiwei, et al.
Published: (2025)
An approach of deep reinforcement learning for maximizing the net present value of stochastic projects
by: Xu, Wei, et al.
Published: (2025)
by: Xu, Wei, et al.
Published: (2025)
When do spectral gradient updates help in deep learning?
by: Davis, Damek, et al.
Published: (2025)
by: Davis, Damek, et al.
Published: (2025)
Understanding the dynamics of the frequency bias in neural networks
by: Molina, Juan, et al.
Published: (2024)
by: Molina, Juan, et al.
Published: (2024)
Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion
by: Bai, Zhiwei, et al.
Published: (2024)
by: Bai, Zhiwei, et al.
Published: (2024)
Disentangle Sample Size and Initialization Effect on Perfect Generalization for Single-Neuron Target
by: Zhao, Jiajie, et al.
Published: (2024)
by: Zhao, Jiajie, et al.
Published: (2024)
Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models
by: Yao, Junjie, et al.
Published: (2025)
by: Yao, Junjie, et al.
Published: (2025)
Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation
by: Chen, Tianyi, et al.
Published: (2024)
by: Chen, Tianyi, et al.
Published: (2024)
Loss Spike in Training Neural Networks
by: Li, Xiaolong, et al.
Published: (2023)
by: Li, Xiaolong, et al.
Published: (2023)
Neural Force Field: Few-shot Learning of Generalized Physical Reasoning
by: Li, Shiqian, et al.
Published: (2025)
by: Li, Shiqian, et al.
Published: (2025)
Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism
by: Wang, Zhiwei, et al.
Published: (2024)
by: Wang, Zhiwei, et al.
Published: (2024)
Implicit bias produces neural scaling laws in learning curves, from perceptrons to deep networks
by: D'Amico, Francesco, et al.
Published: (2025)
by: D'Amico, Francesco, et al.
Published: (2025)
Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization
by: Zhang, Yaoyu, et al.
Published: (2024)
by: Zhang, Yaoyu, et al.
Published: (2024)
Reasoning Bias of Next Token Prediction Training
by: Lin, Pengxiao, et al.
Published: (2025)
by: Lin, Pengxiao, et al.
Published: (2025)
An Analysis for Reasoning Bias of Language Models with Small Initialization
by: Yao, Junjie, et al.
Published: (2025)
by: Yao, Junjie, et al.
Published: (2025)
Determinism in the Undetermined: Deterministic Output in Charge-Conserving Continuous-Time Neuromorphic Systems with Temporal Stochasticity
by: Yan, Jing, et al.
Published: (2026)
by: Yan, Jing, et al.
Published: (2026)
Accelerating superconductor discovery through tempered deep learning of the electron-phonon spectral function
by: Gibson, Jason B., et al.
Published: (2024)
by: Gibson, Jason B., et al.
Published: (2024)
A Contrastive Diffusion-based Network (CDNet) for Time Series Classification
by: Zhang, Yaoyu, et al.
Published: (2025)
by: Zhang, Yaoyu, et al.
Published: (2025)
Loss Jump During Loss Switch in Solving PDEs with Neural Networks
by: Wang, Zhiwei, et al.
Published: (2024)
by: Wang, Zhiwei, et al.
Published: (2024)
Xeno-learning: knowledge transfer across species in deep learning-based spectral image analysis
by: Sellner, Jan, et al.
Published: (2024)
by: Sellner, Jan, et al.
Published: (2024)
Memorization in deep learning: A survey
by: Wei, Jiaheng, et al.
Published: (2024)
by: Wei, Jiaheng, et al.
Published: (2024)
Position: Many generalization measures for deep learning are fragile
by: Zhang, Shuofeng, et al.
Published: (2025)
by: Zhang, Shuofeng, et al.
Published: (2025)
Active learning with biased non-response to label requests
by: Robinson, Thomas, et al.
Published: (2023)
by: Robinson, Thomas, et al.
Published: (2023)
AcL: Action Learner for Fault-Tolerant Quadruped Locomotion Control
by: Xu, Tianyu, et al.
Published: (2025)
by: Xu, Tianyu, et al.
Published: (2025)
Similar Items
-
An overview of condensation phenomenon in deep learning
by: Xu, Zhi-Qin John, et al.
Published: (2025) -
A rationale from frequency perspective for grokking in training neural network
by: Zhou, Zhangchen, et al.
Published: (2024) -
Embedding principle of homogeneous neural network for classification problem
by: Zhang, Jiahan, et al.
Published: (2025) -
Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks
by: Bai, Zhiwei, et al.
Published: (2022) -
Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks
by: Xu, Zhi-Qin John, et al.
Published: (2019)