Saved in:
| Main Authors: | Zhang, Erica, Sagan, Naomi, Tse, Danny, Zhang, Fangzhao, Pilanci, Mert, Blanchet, Jose |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21410 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes
by: Zhang, Erica, et al.
Published: (2024)
by: Zhang, Erica, et al.
Published: (2024)
Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and Regularization
by: Zhang, Erica, et al.
Published: (2025)
by: Zhang, Erica, et al.
Published: (2025)
Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing
by: Romanov, Elad, et al.
Published: (2024)
by: Romanov, Elad, et al.
Published: (2024)
Optimizer-Induced Mode Connectivity: From AdamW to Muon
by: Zhang, Fangzhao, et al.
Published: (2026)
by: Zhang, Fangzhao, et al.
Published: (2026)
When Should Humans Step In? Optimal Human Dispatching in AI-Assisted Decisions
by: Tan, Lezhi, et al.
Published: (2026)
by: Tan, Lezhi, et al.
Published: (2026)
Compressing Large Language Models using Low Rank and Low Precision Decomposition
by: Saha, Rajarshi, et al.
Published: (2024)
by: Saha, Rajarshi, et al.
Published: (2024)
Optimal Shrinkage for Distributed Second-Order Optimization
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective
by: Zeger, Emi, et al.
Published: (2026)
by: Zeger, Emi, et al.
Published: (2026)
From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity
by: Pilanci, Mert
Published: (2023)
by: Pilanci, Mert
Published: (2023)
Optimal Sets and Solution Paths of ReLU Networks
by: Mishkin, Aaron, et al.
Published: (2023)
by: Mishkin, Aaron, et al.
Published: (2023)
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
by: Varshney, Prateek, et al.
Published: (2024)
by: Varshney, Prateek, et al.
Published: (2024)
Black Boxes and Looking Glasses: Multilevel Symmetries, Reflection Planes, and Convex Optimization in Deep Networks
by: Zeger, Emi, et al.
Published: (2024)
by: Zeger, Emi, et al.
Published: (2024)
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
A Recovery Guarantee for Sparse Neural Networks
by: Fridovich-Keil, Sara, et al.
Published: (2025)
by: Fridovich-Keil, Sara, et al.
Published: (2025)
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks
by: Feng, Miria, et al.
Published: (2024)
by: Feng, Miria, et al.
Published: (2024)
Convex Optimization for Alignment and Preference Learning on a Single GPU
by: Feng, Miria, et al.
Published: (2026)
by: Feng, Miria, et al.
Published: (2026)
Large Language Models Implicitly Learn to See and Hear Just By Reading
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
Thinking While Listening: Simple Test Time Scaling For Audio Classification
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
Exploring the loss landscape of regularized neural networks via convex duality
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions
by: Mishkin, Aaron, et al.
Published: (2022)
by: Mishkin, Aaron, et al.
Published: (2022)
LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation
by: Huang, Yongchao
Published: (2025)
by: Huang, Yongchao
Published: (2025)
Adaptive Large Language Models By Layerwise Attention Shortcuts
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
Towards Signal Processing In Large Language Models
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
by: Mishkin, Aaron, et al.
Published: (2024)
by: Mishkin, Aaron, et al.
Published: (2024)
Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization
by: Kuelbs, Daniel, et al.
Published: (2024)
by: Kuelbs, Daniel, et al.
Published: (2024)
AdaPTwin: Low-Cost Adaptive Compression of Product Twins in Transformers
by: Biju, Emil, et al.
Published: (2024)
by: Biju, Emil, et al.
Published: (2024)
Convex Low-resource Accent-Robust Language Detection in Speech Recognition
by: Feng, Miria, et al.
Published: (2026)
by: Feng, Miria, et al.
Published: (2026)
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
by: Hor, Soheil, et al.
Published: (2024)
by: Hor, Soheil, et al.
Published: (2024)
MatRL: Provably Generalizable Iterative Algorithm Discovery via Monte-Carlo Tree Search
by: Kim, Sungyoon, et al.
Published: (2025)
by: Kim, Sungyoon, et al.
Published: (2025)
PRCD-MAP: Learning How Much to Trust Imperfect Priors in Causal Discovery
by: Shan, Xihang, et al.
Published: (2026)
by: Shan, Xihang, et al.
Published: (2026)
Residual Prior Diffusion: A Probabilistic Framework Integrating Coarse Latent Priors with Diffusion Models
by: Kutsuna, Takuro
Published: (2025)
by: Kutsuna, Takuro
Published: (2025)
When Priors Backfire: On the Vulnerability of Unlearnable Examples to Pretraining
by: Li, Zhihao, et al.
Published: (2026)
by: Li, Zhihao, et al.
Published: (2026)
LLM Priors for ERM over Programs
by: Singhal, Shivam, et al.
Published: (2025)
by: Singhal, Shivam, et al.
Published: (2025)
Randomized Geometric Algebra Methods for Convex Neural Networks
by: Wang, Yifei, et al.
Published: (2024)
by: Wang, Yifei, et al.
Published: (2024)
Prior Learning in Introspective VAEs
by: Athanasiadis, Ioannis, et al.
Published: (2024)
by: Athanasiadis, Ioannis, et al.
Published: (2024)
LLM Sparsity Prior for Robust Feature Selection
by: Skinner, Caleb, et al.
Published: (2026)
by: Skinner, Caleb, et al.
Published: (2026)
A KL-regularization Framework for Learning to Plan with Adaptive Priors
by: Serra-Gomez, Álvaro, et al.
Published: (2025)
by: Serra-Gomez, Álvaro, et al.
Published: (2025)
Similar Items
-
Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes
by: Zhang, Erica, et al.
Published: (2024) -
Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
by: Zhang, Fangzhao, et al.
Published: (2024) -
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024) -
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024) -
LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and Regularization
by: Zhang, Erica, et al.
Published: (2025)