Saved in:
| Main Authors: | Xu, Yizhou, Ziyin, Liu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.07085 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Remove Symmetries to Control Model Expressivity and Improve Optimization
by: Ziyin, Liu, et al.
Published: (2024)
by: Ziyin, Liu, et al.
Published: (2024)
Parameter Symmetry Potentially Unifies Deep Learning Theory
by: Ziyin, Liu, et al.
Published: (2025)
by: Ziyin, Liu, et al.
Published: (2025)
Noise Balance and Stationary Distribution of Stochastic Gradient Descent
by: Ziyin, Liu, et al.
Published: (2023)
by: Ziyin, Liu, et al.
Published: (2023)
When Does Learning Renormalize? Sufficient Conditions for Power Law Spectral Dynamics
by: Zhang, Yizhou
Published: (2025)
by: Zhang, Yizhou
Published: (2025)
Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime
by: Defilippis, Leonardo, et al.
Published: (2025)
by: Defilippis, Leonardo, et al.
Published: (2025)
Compositional Generalization via Forced Rendering of Disentangled Latents
by: Liang, Qiyao, et al.
Published: (2025)
by: Liang, Qiyao, et al.
Published: (2025)
Learning Hierarchical Polynomials of Multiple Nonlinear Features with Three-Layer Networks
by: Fu, Hengyu, et al.
Published: (2024)
by: Fu, Hengyu, et al.
Published: (2024)
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
by: Xie, Zixuan, et al.
Published: (2025)
by: Xie, Zixuan, et al.
Published: (2025)
Enhancing Imbalanced Node Classification via Curriculum-Guided Feature Learning and Three-Stage Attention Network
by: Fofanah, Abdul Joseph, et al.
Published: (2026)
by: Fofanah, Abdul Joseph, et al.
Published: (2026)
Understanding the Emergence of Multimodal Representation Alignment
by: Tjandrasuwita, Megan, et al.
Published: (2025)
by: Tjandrasuwita, Megan, et al.
Published: (2025)
Universal One-third Time Scaling in Learning Peaked Distributions
by: Liu, Yizhou, et al.
Published: (2026)
by: Liu, Yizhou, et al.
Published: (2026)
The Weight Gram Matrix Captures Sequential Feature Linearization in Deep Networks
by: Cha, Taehun, et al.
Published: (2026)
by: Cha, Taehun, et al.
Published: (2026)
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
by: Zhang, Tianren, et al.
Published: (2024)
by: Zhang, Tianren, et al.
Published: (2024)
Self-cross Feature based Spiking Neural Networks for Efficient Few-shot Learning
by: Xu, Qi, et al.
Published: (2025)
by: Xu, Qi, et al.
Published: (2025)
Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features
by: Wang, Jiuqi, et al.
Published: (2024)
by: Wang, Jiuqi, et al.
Published: (2024)
Thermodynamic Irreversibility of Training Algorithms
by: Ziyin, Liu, et al.
Published: (2026)
by: Ziyin, Liu, et al.
Published: (2026)
Shortcut Features as Top Eigenfunctions of NTK: A Linear Neural Network Case and More
by: Lim, Jinwoo, et al.
Published: (2026)
by: Lim, Jinwoo, et al.
Published: (2026)
Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets
by: Jacot, Arthur, et al.
Published: (2024)
by: Jacot, Arthur, et al.
Published: (2024)
Learning Probabilities of Causation with Mask-Augmented Data
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
Three Pathways to Neurosymbolic Reinforcement Learning with Interpretable Model and Policy Networks
by: Graf, Peter, et al.
Published: (2024)
by: Graf, Peter, et al.
Published: (2024)
Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles
by: Zhang, Yizhou, et al.
Published: (2025)
by: Zhang, Yizhou, et al.
Published: (2025)
Learning Distinguishable Representations in Deep Q-Networks for Linear Transfer
by: Sathish, Sooraj, et al.
Published: (2025)
by: Sathish, Sooraj, et al.
Published: (2025)
TANGNN: a Concise, Scalable and Effective Graph Neural Networks with Top-m Attention Mechanism for Graph Representation Learning
by: E, Jiawei, et al.
Published: (2024)
by: E, Jiawei, et al.
Published: (2024)
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization
by: Tang, Cheng, et al.
Published: (2024)
by: Tang, Cheng, et al.
Published: (2024)
Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Transform then Explore: a Simple and Effective Technique for Exploratory Combinatorial Optimization with Reinforcement Learning
by: Pu, Tianle, et al.
Published: (2024)
by: Pu, Tianle, et al.
Published: (2024)
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD
by: Zhang, Tongcheng, et al.
Published: (2026)
by: Zhang, Tongcheng, et al.
Published: (2026)
TimeDiT: General-purpose Diffusion Transformers for Time Series Foundation Model
by: Cao, Defu, et al.
Published: (2024)
by: Cao, Defu, et al.
Published: (2024)
Identifying General Mechanism Shifts in Linear Causal Representations
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
Continual Model-Based Reinforcement Learning with Hypernetworks
by: Huang, Yizhou, et al.
Published: (2020)
by: Huang, Yizhou, et al.
Published: (2020)
Operator Feature Neural Network for Symbolic Regression
by: Deng, Yusong, et al.
Published: (2024)
by: Deng, Yusong, et al.
Published: (2024)
Latent Context Compilation: Distilling Long Context into Compact Portable Memory
by: Li, Zeju, et al.
Published: (2026)
by: Li, Zeju, et al.
Published: (2026)
Superposition Yields Robust Neural Scaling
by: Liu, Yizhou, et al.
Published: (2025)
by: Liu, Yizhou, et al.
Published: (2025)
Provable Effects of Data Replay in Continual Learning: A Feature Learning Perspective
by: Ding, Meng, et al.
Published: (2026)
by: Ding, Meng, et al.
Published: (2026)
Three Dogmas of Reinforcement Learning
by: Abel, David, et al.
Published: (2024)
by: Abel, David, et al.
Published: (2024)
Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles
by: Kim, Jung-hun, et al.
Published: (2017)
by: Kim, Jung-hun, et al.
Published: (2017)
Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks
by: Wong, Annie, et al.
Published: (2024)
by: Wong, Annie, et al.
Published: (2024)
Feature Learning Dynamics in Infinite-Depth Neural Networks
by: Yao, Zihan, et al.
Published: (2025)
by: Yao, Zihan, et al.
Published: (2025)
Learning Unified Distance Metric for Heterogeneous Attribute Data Clustering
by: Zhang, Yiqun, et al.
Published: (2026)
by: Zhang, Yiqun, et al.
Published: (2026)
Continual Driving Policy Optimization with Closed-Loop Individualized Curricula
by: Niu, Haoyi, et al.
Published: (2023)
by: Niu, Haoyi, et al.
Published: (2023)
Similar Items
-
Remove Symmetries to Control Model Expressivity and Improve Optimization
by: Ziyin, Liu, et al.
Published: (2024) -
Parameter Symmetry Potentially Unifies Deep Learning Theory
by: Ziyin, Liu, et al.
Published: (2025) -
Noise Balance and Stationary Distribution of Stochastic Gradient Descent
by: Ziyin, Liu, et al.
Published: (2023) -
When Does Learning Renormalize? Sufficient Conditions for Power Law Spectral Dynamics
by: Zhang, Yizhou
Published: (2025) -
Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime
by: Defilippis, Leonardo, et al.
Published: (2025)