Saved in:
| Main Authors: | Yang, Greg, Simon, James B., Bernstein, Jeremy |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.17813 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Modular Duality in Deep Learning
by: Bernstein, Jeremy, et al.
Published: (2024)
by: Bernstein, Jeremy, et al.
Published: (2024)
Old Optimizer, New Norm: An Anthology
by: Bernstein, Jeremy, et al.
Published: (2024)
by: Bernstein, Jeremy, et al.
Published: (2024)
Spectral Conditioning of Attention Improves Transformer Performance
by: Saratchandran, Hemanth, et al.
Published: (2026)
by: Saratchandran, Hemanth, et al.
Published: (2026)
SketchOGD: Memory-Efficient Continual Learning
by: Min, Youngjae, et al.
Published: (2023)
by: Min, Youngjae, et al.
Published: (2023)
The Optimization Landscape of SGD Across the Feature Learning Strength
by: Atanasov, Alexander, et al.
Published: (2024)
by: Atanasov, Alexander, et al.
Published: (2024)
Extending $μ$P: Spectral Conditions for Feature Learning Across Optimizers
by: Gupta, Akshita, et al.
Published: (2026)
by: Gupta, Akshita, et al.
Published: (2026)
Scalable Optimization in the Modular Norm
by: Large, Tim, et al.
Published: (2024)
by: Large, Tim, et al.
Published: (2024)
Spectrally Informed Learning of Fluid Flows
by: Shaffer, Benjamin D., et al.
Published: (2024)
by: Shaffer, Benjamin D., et al.
Published: (2024)
Demystifying Spectral Feature Learning for Instrumental Variable Regression
by: Meunier, Dimitri, et al.
Published: (2025)
by: Meunier, Dimitri, et al.
Published: (2025)
Outcome-Aware Spectral Feature Learning for Instrumental Variable Regression
by: Meunier, Dimitri, et al.
Published: (2025)
by: Meunier, Dimitri, et al.
Published: (2025)
Spectral Regularization for Diffusion Models
by: Chandran, Satish, et al.
Published: (2026)
by: Chandran, Satish, et al.
Published: (2026)
Global Convergence and Rich Feature Learning in $L$-Layer Infinite-Width Neural Networks under $μ$P Parametrization
by: Chen, Zixiang, et al.
Published: (2025)
by: Chen, Zixiang, et al.
Published: (2025)
A Spectral View of Adversarially Robust Features
by: Garg, Shivam, et al.
Published: (2018)
by: Garg, Shivam, et al.
Published: (2018)
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
by: Kunin, Daniel, et al.
Published: (2025)
by: Kunin, Daniel, et al.
Published: (2025)
Spectral Convolutional Conditional Neural Processes
by: Mohseni, Peiman, et al.
Published: (2024)
by: Mohseni, Peiman, et al.
Published: (2024)
Spectral Self-supervised Feature Selection
by: Segal, Daniel, et al.
Published: (2024)
by: Segal, Daniel, et al.
Published: (2024)
The Features at Convergence Theorem: a first-principles alternative to the Neural Feature Ansatz for how networks learn representations
by: Boix-Adsera, Enric, et al.
Published: (2025)
by: Boix-Adsera, Enric, et al.
Published: (2025)
Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models
by: Karkada, Dhruva, et al.
Published: (2025)
by: Karkada, Dhruva, et al.
Published: (2025)
On the Relationship Between Activation Outliers and Feature Death in Sparse Autoencoders
by: Simon, Elana, et al.
Published: (2026)
by: Simon, Elana, et al.
Published: (2026)
Statistical Significance of Feature Importance Rankings
by: Goldwasser, Jeremy, et al.
Published: (2024)
by: Goldwasser, Jeremy, et al.
Published: (2024)
Training Transformers with Enforced Lipschitz Constants
by: Newhouse, Laker, et al.
Published: (2025)
by: Newhouse, Laker, et al.
Published: (2025)
SpectraKAN: Conditioning Spectral Operators
by: Cheng, Chun-Wun, et al.
Published: (2026)
by: Cheng, Chun-Wun, et al.
Published: (2026)
Robust Multi-Modal Forecasting: Integrating Static and Dynamic Features
by: Qin, Jeremy
Published: (2025)
by: Qin, Jeremy
Published: (2025)
When Does Learning Renormalize? Sufficient Conditions for Power Law Spectral Dynamics
by: Zhang, Yizhou
Published: (2025)
by: Zhang, Yizhou
Published: (2025)
Spectral Condition for $μ$P under Width-Depth Scaling
by: Zheng, Chenyu, et al.
Published: (2026)
by: Zheng, Chenyu, et al.
Published: (2026)
Toward Scalable and Valid Conditional Independence Testing with Spectral Representations
by: Frohlich, Alek, et al.
Published: (2025)
by: Frohlich, Alek, et al.
Published: (2025)
Review Non-convex Optimization Method for Machine Learning
by: Fotopoulos, Greg B, et al.
Published: (2024)
by: Fotopoulos, Greg B, et al.
Published: (2024)
Spectral Superposition: A Theory of Feature Geometry
by: Ivanov, Georgi, et al.
Published: (2026)
by: Ivanov, Georgi, et al.
Published: (2026)
Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning
by: Dandi, Yatin, et al.
Published: (2026)
by: Dandi, Yatin, et al.
Published: (2026)
Graph Classification Gaussian Processes via Hodgelet Spectral Features
by: Alain, Mathieu, et al.
Published: (2024)
by: Alain, Mathieu, et al.
Published: (2024)
A Matched Spectral Benchmark of Quantum Inspired Feature Maps
by: Ogunade, Toheeb, et al.
Published: (2026)
by: Ogunade, Toheeb, et al.
Published: (2026)
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
by: Huh, Minyoung, et al.
Published: (2024)
by: Huh, Minyoung, et al.
Published: (2024)
A Cryptographic Perspective on Mitigation vs. Detection in Machine Learning
by: Gluch, Greg, et al.
Published: (2025)
by: Gluch, Greg, et al.
Published: (2025)
Descriptor-Injected Cross-Modal Learning: A Systematic Exploration of Audio-MIDI Alignment via Spectral and Melodic Features
by: Méndez, Mariano Fernández
Published: (2026)
by: Méndez, Mariano Fernández
Published: (2026)
Overcoming Data Limitations in Internet Traffic Forecasting: LSTM Models with Transfer Learning and Wavelet Augmentation
by: Saha, Sajal, et al.
Published: (2024)
by: Saha, Sajal, et al.
Published: (2024)
More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
by: Simon, James B., et al.
Published: (2023)
by: Simon, James B., et al.
Published: (2023)
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
by: Simon, Elana, et al.
Published: (2024)
by: Simon, Elana, et al.
Published: (2024)
SpectralKrum: A Spectral-Geometric Defense Against Byzantine Attacks in Federated Learning
by: Tripathi, Aditya, et al.
Published: (2025)
by: Tripathi, Aditya, et al.
Published: (2025)
Machine Learning with High-Cardinality Categorical Features in Actuarial Applications
by: Avanzi, Benjamin, et al.
Published: (2023)
by: Avanzi, Benjamin, et al.
Published: (2023)
Machine Learning based Analysis for Radiomics Features Robustness in Real-World Deployment Scenarios
by: Khan, Sarmad Ahmad, et al.
Published: (2025)
by: Khan, Sarmad Ahmad, et al.
Published: (2025)
Similar Items
-
Modular Duality in Deep Learning
by: Bernstein, Jeremy, et al.
Published: (2024) -
Old Optimizer, New Norm: An Anthology
by: Bernstein, Jeremy, et al.
Published: (2024) -
Spectral Conditioning of Attention Improves Transformer Performance
by: Saratchandran, Hemanth, et al.
Published: (2026) -
SketchOGD: Memory-Efficient Continual Learning
by: Min, Youngjae, et al.
Published: (2023) -
The Optimization Landscape of SGD Across the Feature Learning Strength
by: Atanasov, Alexander, et al.
Published: (2024)