Saved in:
| Main Authors: | Hor, Soheil, Qian, Ying, Pilanci, Mert, Arbabian, Amin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.04359 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Large Language Models By Layerwise Attention Shortcuts
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity
by: Pilanci, Mert
Published: (2023)
by: Pilanci, Mert
Published: (2023)
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
by: Varshney, Prateek, et al.
Published: (2024)
by: Varshney, Prateek, et al.
Published: (2024)
Black Boxes and Looking Glasses: Multilevel Symmetries, Reflection Planes, and Convex Optimization in Deep Networks
by: Zeger, Emi, et al.
Published: (2024)
by: Zeger, Emi, et al.
Published: (2024)
Optimal Sets and Solution Paths of ReLU Networks
by: Mishkin, Aaron, et al.
Published: (2023)
by: Mishkin, Aaron, et al.
Published: (2023)
AdaPTwin: Low-Cost Adaptive Compression of Product Twins in Transformers
by: Biju, Emil, et al.
Published: (2024)
by: Biju, Emil, et al.
Published: (2024)
Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective
by: Zeger, Emi, et al.
Published: (2026)
by: Zeger, Emi, et al.
Published: (2026)
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
A Recovery Guarantee for Sparse Neural Networks
by: Fridovich-Keil, Sara, et al.
Published: (2025)
by: Fridovich-Keil, Sara, et al.
Published: (2025)
Thinking While Listening: Simple Test Time Scaling For Audio Classification
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
Exploring the loss landscape of regularized neural networks via convex duality
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions
by: Mishkin, Aaron, et al.
Published: (2022)
by: Mishkin, Aaron, et al.
Published: (2022)
Towards Signal Processing In Large Language Models
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
by: Mishkin, Aaron, et al.
Published: (2024)
by: Mishkin, Aaron, et al.
Published: (2024)
Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes
by: Zhang, Erica, et al.
Published: (2024)
by: Zhang, Erica, et al.
Published: (2024)
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks
by: Feng, Miria, et al.
Published: (2024)
by: Feng, Miria, et al.
Published: (2024)
Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization
by: Kuelbs, Daniel, et al.
Published: (2024)
by: Kuelbs, Daniel, et al.
Published: (2024)
Large Language Models Implicitly Learn to See and Hear Just By Reading
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
Convex Optimization for Alignment and Preference Learning on a Single GPU
by: Feng, Miria, et al.
Published: (2026)
by: Feng, Miria, et al.
Published: (2026)
Convex Low-resource Accent-Robust Language Detection in Speech Recognition
by: Feng, Miria, et al.
Published: (2026)
by: Feng, Miria, et al.
Published: (2026)
Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing
by: Romanov, Elad, et al.
Published: (2024)
by: Romanov, Elad, et al.
Published: (2024)
MatRL: Provably Generalizable Iterative Algorithm Discovery via Monte-Carlo Tree Search
by: Kim, Sungyoon, et al.
Published: (2025)
by: Kim, Sungyoon, et al.
Published: (2025)
NanoFlux: Adversarial Dual-LLM Evaluation and Distillation For Multi-Domain Reasoning
by: Anantha, Raviteja, et al.
Published: (2025)
by: Anantha, Raviteja, et al.
Published: (2025)
Randomized Geometric Algebra Methods for Convex Neural Networks
by: Wang, Yifei, et al.
Published: (2024)
by: Wang, Yifei, et al.
Published: (2024)
Learning When to Trust LLM Priors: A Validated Framework for Semantic Prior Integration
by: Zhang, Erica, et al.
Published: (2026)
by: Zhang, Erica, et al.
Published: (2026)
Optimizer-Induced Mode Connectivity: From AdamW to Muon
by: Zhang, Fangzhao, et al.
Published: (2026)
by: Zhang, Fangzhao, et al.
Published: (2026)
A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)
by: Haussmann, Manuel, et al.
Published: (2026)
Compressing Large Language Models using Low Rank and Low Precision Decomposition
by: Saha, Rajarshi, et al.
Published: (2024)
by: Saha, Rajarshi, et al.
Published: (2024)
Deep-Learning-Directed Preventive Dynamic Security Control via Coordinated Demand Response
by: Masoumi, Amin, et al.
Published: (2025)
by: Masoumi, Amin, et al.
Published: (2025)
Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
by: Tzikas, Alexandros E., et al.
Published: (2024)
by: Tzikas, Alexandros E., et al.
Published: (2024)
Birefringence-free photoelastic modulator with centimeter-square aperture operating at 2.7 MHz with sub-watt drive power
by: Atalar, Okan, et al.
Published: (2024)
by: Atalar, Okan, et al.
Published: (2024)
Polarization-insensitive wide-angle resonant acousto-optic phase modulator
by: Atalar, Okan, et al.
Published: (2024)
by: Atalar, Okan, et al.
Published: (2024)
A Library of Mirrors: Deep Neural Nets in Low Dimensions are Convex Lasso Models with Reflection Features
by: Zeger, Emi, et al.
Published: (2024)
by: Zeger, Emi, et al.
Published: (2024)
ConvexECG: Lightweight and Explainable Neural Networks for Personalized, Continuous Cardiac Monitoring
by: Ansari, Rayan, et al.
Published: (2024)
by: Ansari, Rayan, et al.
Published: (2024)
Unexplored flaws in multiple-choice VQA evaluations
by: Rosenthal, Fabio, et al.
Published: (2025)
by: Rosenthal, Fabio, et al.
Published: (2025)
Gradient Coding with Iterative Block Leverage Score Sampling
by: Charalambides, Neophytos, et al.
Published: (2023)
by: Charalambides, Neophytos, et al.
Published: (2023)
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
by: Villa-Renteria, Ivan, et al.
Published: (2024)
by: Villa-Renteria, Ivan, et al.
Published: (2024)
Similar Items
-
Adaptive Large Language Models By Layerwise Attention Shortcuts
by: Verma, Prateek, et al.
Published: (2024) -
From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity
by: Pilanci, Mert
Published: (2023) -
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
by: Varshney, Prateek, et al.
Published: (2024) -
Black Boxes and Looking Glasses: Multilevel Symmetries, Reflection Planes, and Convex Optimization in Deep Networks
by: Zeger, Emi, et al.
Published: (2024) -
Optimal Sets and Solution Paths of ReLU Networks
by: Mishkin, Aaron, et al.
Published: (2023)