:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luzi, Lorenzo, Dar, Yehuda, Baraniuk, Richard
Format:	Preprint
Published:	2021
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2106.04003
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression
by: Dar, Yehuda, et al.
Published: (2021)

Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective
by: Dar, Yehuda
Published: (2025)

Stable and Privacy-Preserving Synthetic Educational Data with Empirical Marginals: A Copula-Based Approach
by: Ramos, Gabriel Diaz, et al.
Published: (2026)

Improving Fairness and Mitigating MADness in Generative Models
by: Mayer, Paul, et al.
Published: (2024)

Boomerang: Local sampling on image manifolds using diffusion models
by: Luzi, Lorenzo, et al.
Published: (2022)

How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks
by: Sharon, Yuval, et al.
Published: (2024)

TL-PCA: Transfer Learning of Principal Component Analysis
by: Hendy, Sharon, et al.
Published: (2024)

How Does Overparameterization Affect Machine Unlearning of Deep Neural Networks?
by: Alon, Gal, et al.
Published: (2025)

How Much Training Data is Memorized in Overparameterized Autoencoders? An Inverse Problem Perspective on Memorization Evaluation
by: Abitbul, Koren, et al.
Published: (2023)

Transfer Learning of Linear Regression with Multiple Pretrained Models: Benefiting from More Pretrained Models via Overparameterization Debiasing
by: Boharon, Daniel, et al.
Published: (2026)

SPI-GAN: Denoising Diffusion GANs with Straight-Path Interpolations
by: Jeon, Jinsung, et al.
Published: (2022)

Randomness and Interpolation Improve Gradient Descent
by: Li, Jiawen, et al.
Published: (2025)

In-context Learning and Gradient Descent Revisited
by: Deutch, Gilad, et al.
Published: (2023)

Using Skew to Assess the Quality of GAN-generated Image Features
by: Luzi, Lorenzo, et al.
Published: (2023)

Synthetic Context Generation for Question Generation
by: Liu, Naiming, et al.
Published: (2024)

Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers
by: Liu, Naiming, et al.
Published: (2026)

Bayesian Double Descent
by: Polson, Nick, et al.
Published: (2025)

Manipulating Sparse Double Descent
by: Zhang, Ya Shi
Published: (2024)

MazeNet: An Accurate, Fast, and Scalable Deep Learning Solution for Steiner Minimum Trees
by: Ramos, Gabriel Díaz, et al.
Published: (2024)

Dropout Drops Double Descent
by: Yang, Tian-Le, et al.
Published: (2023)

Neon: Negative Extrapolation From Self-Training Improves Image Generation
by: Alemohammad, Sina, et al.
Published: (2025)

The Linear Centroids Hypothesis: Features as Directions Learned by Local Experts
by: Walker, Thomas, et al.
Published: (2026)

GrokAlign: Geometric Characterisation and Acceleration of Grokking
by: Walker, Thomas, et al.
Published: (2025)

Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
by: Mishkin, Aaron, et al.
Published: (2024)

On the Lipschitz Constant of Deep Networks and Double Descent
by: Gamba, Matteo, et al.
Published: (2023)

[Re] The Discriminative Kalman Filter for Bayesian Filtering with Nonlinear and Non-Gaussian Observation Models
by: Casco-Rodriguez, Josue, et al.
Published: (2024)

On the regularization of Wasserstein GANs
by: Petzka, Henning, et al.
Published: (2017)

Improving Routing in Sparse Mixture of Experts with Graph of Tokens
by: Nguyen, Tam, et al.
Published: (2025)

ODE-Constrained Generative Modeling of Cardiac Dynamics for 12-Lead ECG Synthesis
by: Yehuda, Yakir, et al.
Published: (2024)

Deep Networks Always Grok and Here is Why
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)

On the Geometry of Deep Learning
by: Balestriero, Randall, et al.
Published: (2024)

On The Presence of Double-Descent in Deep Reinforcement Learning
by: Veselý, Viktor, et al.
Published: (2025)

Mitigating over-exploration in latent space optimization using LES
by: Ronen, Omer, et al.
Published: (2024)

The Geometric Structure of Models Learning Sparse Data
by: Walker, Thomas, et al.
Published: (2026)

Data Cleansing for GANs
by: Terashita, Naoyuki, et al.
Published: (2025)

W4S4: WaLRUS Meets S4 for Long-Range Sequence Modeling
by: Babaei, Hossein, et al.
Published: (2025)

Class-wise Activation Unravelling the Engima of Deep Double Descent
by: Gu, Yufei
Published: (2024)

Minimizing Collateral Damage in Activation Steering
by: Nguyen, Tam, et al.
Published: (2026)

Privacy-Preserving Federated Convex Optimization: Balancing Partial-Participation and Efficiency via Noise Cancellation
by: Reshef, Roie, et al.
Published: (2025)

Multi-Scale Texture Loss for CT denoising with GANs
by: Di Feola, Francesco, et al.
Published: (2024)