Saved in:
| Main Authors: | Luzi, Lorenzo, Dar, Yehuda, Baraniuk, Richard |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2106.04003 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression
by: Dar, Yehuda, et al.
Published: (2021)
by: Dar, Yehuda, et al.
Published: (2021)
Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective
by: Dar, Yehuda
Published: (2025)
by: Dar, Yehuda
Published: (2025)
Stable and Privacy-Preserving Synthetic Educational Data with Empirical Marginals: A Copula-Based Approach
by: Ramos, Gabriel Diaz, et al.
Published: (2026)
by: Ramos, Gabriel Diaz, et al.
Published: (2026)
Improving Fairness and Mitigating MADness in Generative Models
by: Mayer, Paul, et al.
Published: (2024)
by: Mayer, Paul, et al.
Published: (2024)
Boomerang: Local sampling on image manifolds using diffusion models
by: Luzi, Lorenzo, et al.
Published: (2022)
by: Luzi, Lorenzo, et al.
Published: (2022)
How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks
by: Sharon, Yuval, et al.
Published: (2024)
by: Sharon, Yuval, et al.
Published: (2024)
TL-PCA: Transfer Learning of Principal Component Analysis
by: Hendy, Sharon, et al.
Published: (2024)
by: Hendy, Sharon, et al.
Published: (2024)
How Does Overparameterization Affect Machine Unlearning of Deep Neural Networks?
by: Alon, Gal, et al.
Published: (2025)
by: Alon, Gal, et al.
Published: (2025)
How Much Training Data is Memorized in Overparameterized Autoencoders? An Inverse Problem Perspective on Memorization Evaluation
by: Abitbul, Koren, et al.
Published: (2023)
by: Abitbul, Koren, et al.
Published: (2023)
Transfer Learning of Linear Regression with Multiple Pretrained Models: Benefiting from More Pretrained Models via Overparameterization Debiasing
by: Boharon, Daniel, et al.
Published: (2026)
by: Boharon, Daniel, et al.
Published: (2026)
SPI-GAN: Denoising Diffusion GANs with Straight-Path Interpolations
by: Jeon, Jinsung, et al.
Published: (2022)
by: Jeon, Jinsung, et al.
Published: (2022)
Randomness and Interpolation Improve Gradient Descent
by: Li, Jiawen, et al.
Published: (2025)
by: Li, Jiawen, et al.
Published: (2025)
In-context Learning and Gradient Descent Revisited
by: Deutch, Gilad, et al.
Published: (2023)
by: Deutch, Gilad, et al.
Published: (2023)
Using Skew to Assess the Quality of GAN-generated Image Features
by: Luzi, Lorenzo, et al.
Published: (2023)
by: Luzi, Lorenzo, et al.
Published: (2023)
Synthetic Context Generation for Question Generation
by: Liu, Naiming, et al.
Published: (2024)
by: Liu, Naiming, et al.
Published: (2024)
Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers
by: Liu, Naiming, et al.
Published: (2026)
by: Liu, Naiming, et al.
Published: (2026)
Bayesian Double Descent
by: Polson, Nick, et al.
Published: (2025)
by: Polson, Nick, et al.
Published: (2025)
Manipulating Sparse Double Descent
by: Zhang, Ya Shi
Published: (2024)
by: Zhang, Ya Shi
Published: (2024)
MazeNet: An Accurate, Fast, and Scalable Deep Learning Solution for Steiner Minimum Trees
by: Ramos, Gabriel Díaz, et al.
Published: (2024)
by: Ramos, Gabriel Díaz, et al.
Published: (2024)
Dropout Drops Double Descent
by: Yang, Tian-Le, et al.
Published: (2023)
by: Yang, Tian-Le, et al.
Published: (2023)
Neon: Negative Extrapolation From Self-Training Improves Image Generation
by: Alemohammad, Sina, et al.
Published: (2025)
by: Alemohammad, Sina, et al.
Published: (2025)
The Linear Centroids Hypothesis: Features as Directions Learned by Local Experts
by: Walker, Thomas, et al.
Published: (2026)
by: Walker, Thomas, et al.
Published: (2026)
GrokAlign: Geometric Characterisation and Acceleration of Grokking
by: Walker, Thomas, et al.
Published: (2025)
by: Walker, Thomas, et al.
Published: (2025)
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
by: Mishkin, Aaron, et al.
Published: (2024)
by: Mishkin, Aaron, et al.
Published: (2024)
On the Lipschitz Constant of Deep Networks and Double Descent
by: Gamba, Matteo, et al.
Published: (2023)
by: Gamba, Matteo, et al.
Published: (2023)
[Re] The Discriminative Kalman Filter for Bayesian Filtering with Nonlinear and Non-Gaussian Observation Models
by: Casco-Rodriguez, Josue, et al.
Published: (2024)
by: Casco-Rodriguez, Josue, et al.
Published: (2024)
On the regularization of Wasserstein GANs
by: Petzka, Henning, et al.
Published: (2017)
by: Petzka, Henning, et al.
Published: (2017)
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
by: Nguyen, Tam, et al.
Published: (2025)
by: Nguyen, Tam, et al.
Published: (2025)
ODE-Constrained Generative Modeling of Cardiac Dynamics for 12-Lead ECG Synthesis
by: Yehuda, Yakir, et al.
Published: (2024)
by: Yehuda, Yakir, et al.
Published: (2024)
Deep Networks Always Grok and Here is Why
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
On the Geometry of Deep Learning
by: Balestriero, Randall, et al.
Published: (2024)
by: Balestriero, Randall, et al.
Published: (2024)
On The Presence of Double-Descent in Deep Reinforcement Learning
by: Veselý, Viktor, et al.
Published: (2025)
by: Veselý, Viktor, et al.
Published: (2025)
Mitigating over-exploration in latent space optimization using LES
by: Ronen, Omer, et al.
Published: (2024)
by: Ronen, Omer, et al.
Published: (2024)
The Geometric Structure of Models Learning Sparse Data
by: Walker, Thomas, et al.
Published: (2026)
by: Walker, Thomas, et al.
Published: (2026)
Data Cleansing for GANs
by: Terashita, Naoyuki, et al.
Published: (2025)
by: Terashita, Naoyuki, et al.
Published: (2025)
W4S4: WaLRUS Meets S4 for Long-Range Sequence Modeling
by: Babaei, Hossein, et al.
Published: (2025)
by: Babaei, Hossein, et al.
Published: (2025)
Class-wise Activation Unravelling the Engima of Deep Double Descent
by: Gu, Yufei
Published: (2024)
by: Gu, Yufei
Published: (2024)
Minimizing Collateral Damage in Activation Steering
by: Nguyen, Tam, et al.
Published: (2026)
by: Nguyen, Tam, et al.
Published: (2026)
Privacy-Preserving Federated Convex Optimization: Balancing Partial-Participation and Efficiency via Noise Cancellation
by: Reshef, Roie, et al.
Published: (2025)
by: Reshef, Roie, et al.
Published: (2025)
Multi-Scale Texture Loss for CT denoising with GANs
by: Di Feola, Francesco, et al.
Published: (2024)
by: Di Feola, Francesco, et al.
Published: (2024)
Similar Items
-
The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression
by: Dar, Yehuda, et al.
Published: (2021) -
Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective
by: Dar, Yehuda
Published: (2025) -
Stable and Privacy-Preserving Synthetic Educational Data with Empirical Marginals: A Copula-Based Approach
by: Ramos, Gabriel Diaz, et al.
Published: (2026) -
Improving Fairness and Mitigating MADness in Generative Models
by: Mayer, Paul, et al.
Published: (2024) -
Boomerang: Local sampling on image manifolds using diffusion models
by: Luzi, Lorenzo, et al.
Published: (2022)