:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hor, Soheil, Qian, Ying, Pilanci, Mert, Arbabian, Amin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.04359
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Large Language Models By Layerwise Attention Shortcuts
by: Verma, Prateek, et al.
Published: (2024)

From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity
by: Pilanci, Mert
Published: (2023)

Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
by: Varshney, Prateek, et al.
Published: (2024)

Black Boxes and Looking Glasses: Multilevel Symmetries, Reflection Planes, and Convex Optimization in Deep Networks
by: Zeger, Emi, et al.
Published: (2024)

Optimal Sets and Solution Paths of ReLU Networks
by: Mishkin, Aaron, et al.
Published: (2023)

AdaPTwin: Low-Cost Adaptive Compression of Product Twins in Transformers
by: Biju, Emil, et al.
Published: (2024)

Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
by: Zhang, Fangzhao, et al.
Published: (2024)

Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
by: Kim, Sungyoon, et al.
Published: (2024)

Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)

Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective
by: Zeger, Emi, et al.
Published: (2026)

Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024)

A Recovery Guarantee for Sparse Neural Networks
by: Fridovich-Keil, Sara, et al.
Published: (2025)

Thinking While Listening: Simple Test Time Scaling For Audio Classification
by: Verma, Prateek, et al.
Published: (2025)

Exploring the loss landscape of regularized neural networks via convex duality
by: Kim, Sungyoon, et al.
Published: (2024)

Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions
by: Mishkin, Aaron, et al.
Published: (2022)

Towards Signal Processing In Large Language Models
by: Verma, Prateek, et al.
Published: (2024)

Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
by: Mishkin, Aaron, et al.
Published: (2024)

Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes
by: Zhang, Erica, et al.
Published: (2024)

CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks
by: Feng, Miria, et al.
Published: (2024)

Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization
by: Kuelbs, Daniel, et al.
Published: (2024)

Large Language Models Implicitly Learn to See and Hear Just By Reading
by: Verma, Prateek, et al.
Published: (2025)

Convex Optimization for Alignment and Preference Learning on a Single GPU
by: Feng, Miria, et al.
Published: (2026)

Convex Low-resource Accent-Robust Language Detection in Speech Recognition
by: Feng, Miria, et al.
Published: (2026)

Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing
by: Romanov, Elad, et al.
Published: (2024)

MatRL: Provably Generalizable Iterative Algorithm Discovery via Monte-Carlo Tree Search
by: Kim, Sungyoon, et al.
Published: (2025)

NanoFlux: Adversarial Dual-LLM Evaluation and Distillation For Multi-Domain Reasoning
by: Anantha, Raviteja, et al.
Published: (2025)

Randomized Geometric Algebra Methods for Convex Neural Networks
by: Wang, Yifei, et al.
Published: (2024)

Learning When to Trust LLM Priors: A Validated Framework for Semantic Prior Integration
by: Zhang, Erica, et al.
Published: (2026)

Optimizer-Induced Mode Connectivity: From AdamW to Muon
by: Zhang, Fangzhao, et al.
Published: (2026)

A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)

Compressing Large Language Models using Low Rank and Low Precision Decomposition
by: Saha, Rajarshi, et al.
Published: (2024)

Deep-Learning-Directed Preventive Dynamic Security Control via Coordinated Demand Response
by: Masoumi, Amin, et al.
Published: (2025)

Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
by: Tzikas, Alexandros E., et al.
Published: (2024)

Birefringence-free photoelastic modulator with centimeter-square aperture operating at 2.7 MHz with sub-watt drive power
by: Atalar, Okan, et al.
Published: (2024)

Polarization-insensitive wide-angle resonant acousto-optic phase modulator
by: Atalar, Okan, et al.
Published: (2024)

A Library of Mirrors: Deep Neural Nets in Low Dimensions are Convex Lasso Models with Reflection Features
by: Zeger, Emi, et al.
Published: (2024)

ConvexECG: Lightweight and Explainable Neural Networks for Personalized, Continuous Cardiac Monitoring
by: Ansari, Rayan, et al.
Published: (2024)

Unexplored flaws in multiple-choice VQA evaluations
by: Rosenthal, Fabio, et al.
Published: (2025)

Gradient Coding with Iterative Block Leverage Score Sampling
by: Charalambides, Neophytos, et al.
Published: (2023)

Subtractive Training for Music Stem Insertion using Latent Diffusion Models
by: Villa-Renteria, Ivan, et al.
Published: (2024)