Saved in:
| Main Authors: | Ang, Calvin, Kim, Sungyoon, Pilanci, Mert |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.19559 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Optimal Quantization for Matrix Multiplication
by: Ordentlich, Or, et al.
Published: (2024)
by: Ordentlich, Or, et al.
Published: (2024)
High-Rate Quantized Matrix Multiplication I
by: Ordentlich, Or, et al.
Published: (2026)
by: Ordentlich, Or, et al.
Published: (2026)
High-Rate Quantized Matrix Multiplication II
by: Ordentlich, Or, et al.
Published: (2026)
by: Ordentlich, Or, et al.
Published: (2026)
Optimizer-Induced Mode Connectivity: From AdamW to Muon
by: Zhang, Fangzhao, et al.
Published: (2026)
by: Zhang, Fangzhao, et al.
Published: (2026)
Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization
by: Borzechowski, Florian, et al.
Published: (2025)
by: Borzechowski, Florian, et al.
Published: (2025)
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
NestQuant: Nested Lattice Quantization for Matrix Products and LLMs
by: Savkin, Semyon, et al.
Published: (2025)
by: Savkin, Semyon, et al.
Published: (2025)
Persistent Entropy as a Detector of Phase Transitions
by: Rucco, Matteo
Published: (2026)
by: Rucco, Matteo
Published: (2026)
PrismQuant: Rate-Distortion-Optimal Vector Quantization for Gaussian-Mixture Sources
by: Park, Bumsu, et al.
Published: (2026)
by: Park, Bumsu, et al.
Published: (2026)
From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity
by: Pilanci, Mert
Published: (2023)
by: Pilanci, Mert
Published: (2023)
FibQuant: Universal Vector Quantization for Random-Access KV-Cache Compression
by: Lee, Namyoon, et al.
Published: (2026)
by: Lee, Namyoon, et al.
Published: (2026)
Deep Learning and Matrix Completion-aided IoT Network Localization in the Outlier Scenarios
by: Kim, Sunwoo
Published: (2025)
by: Kim, Sunwoo
Published: (2025)
Exploring the loss landscape of regularized neural networks via convex duality
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
Fibbinary-Based Compression and Quantization for Efficient Neural Radio Receivers
by: Fiandaca, Roberta, et al.
Published: (2025)
by: Fiandaca, Roberta, et al.
Published: (2025)
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
A SER-based Device Selection Mechanism in Multi-bits Quantization Federated Learning
by: Sun, Pengcheng, et al.
Published: (2024)
by: Sun, Pengcheng, et al.
Published: (2024)
Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing
by: Romanov, Elad, et al.
Published: (2024)
by: Romanov, Elad, et al.
Published: (2024)
Three Quantization Regimes for ReLU Networks
by: Ou, Weigutian, et al.
Published: (2024)
by: Ou, Weigutian, et al.
Published: (2024)
The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs
by: Yazan, Mert, et al.
Published: (2024)
by: Yazan, Mert, et al.
Published: (2024)
FlashSketch: Sketch-Kernel Co-Design for Fast Sparse Sketching on GPUs
by: Dwaraknath, Rajat Vadiraj, et al.
Published: (2026)
by: Dwaraknath, Rajat Vadiraj, et al.
Published: (2026)
An Information-Theoretic Perspective on LLM Tokenizers
by: Erdogan, Mete, et al.
Published: (2026)
by: Erdogan, Mete, et al.
Published: (2026)
Thinking While Listening: Simple Test Time Scaling For Audio Classification
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
An Optimal, Universal and Agnostic Decoding Method for Message Reconstruction, Bio and Technosignature Detection
by: Zenil, Hector, et al.
Published: (2023)
by: Zenil, Hector, et al.
Published: (2023)
End-to-End NOMA with Perfect and Quantized CSI Over Rayleigh Fading Channels
by: Benouadah, Selma, et al.
Published: (2026)
by: Benouadah, Selma, et al.
Published: (2026)
Rate-Distortion Guided Knowledge Graph Construction from Lecture Notes Using Gromov-Wasserstein Optimal Transport
by: An, Yuan, et al.
Published: (2025)
by: An, Yuan, et al.
Published: (2025)
Adaptive Large Language Models By Layerwise Attention Shortcuts
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
Towards Signal Processing In Large Language Models
by: Verma, Prateek, et al.
Published: (2024)
by: Verma, Prateek, et al.
Published: (2024)
Turbo-CF: Matrix Decomposition-Free Graph Filtering for Fast Recommendation
by: Park, Jin-Duk, et al.
Published: (2024)
by: Park, Jin-Duk, et al.
Published: (2024)
SQuat: Subspace-orthogonal KV Cache Quantization
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Order-Optimal Sample Complexity of Rectified Flows
by: Sahoo, Hari Krishna, et al.
Published: (2026)
by: Sahoo, Hari Krishna, et al.
Published: (2026)
LightCode: Light Analytical and Neural Codes for Channels with Feedback
by: Ankireddy, Sravan Kumar, et al.
Published: (2024)
by: Ankireddy, Sravan Kumar, et al.
Published: (2024)
Information-Theoretic Equivalence of Entropic Multi-Marginal Optimal Transport: A Theory for Multi-Agent Communication
by: Wang, Shuchan
Published: (2022)
by: Wang, Shuchan
Published: (2022)
The Causal Information Bottleneck and Optimal Causal Variable Abstractions
by: Simoes, Francisco N. F. Q., et al.
Published: (2024)
by: Simoes, Francisco N. F. Q., et al.
Published: (2024)
Generative Decompression: Optimal Lossy Decoding Against Distribution Mismatch
by: Khosravirad, Saeed R., et al.
Published: (2026)
by: Khosravirad, Saeed R., et al.
Published: (2026)
Optimal Multi-Objective Best Arm Identification with Fixed Confidence
by: Chen, Zhirui, et al.
Published: (2025)
by: Chen, Zhirui, et al.
Published: (2025)
The Normalized Cross Density Functional: A Framework to Quantify Statistical Dependence for Random Processes
by: Hu, Bo, et al.
Published: (2022)
by: Hu, Bo, et al.
Published: (2022)
Exploiting Information Redundancy in Attention Maps for Extreme Quantization of Vision Transformers
by: Maisonnave, Lucas, et al.
Published: (2025)
by: Maisonnave, Lucas, et al.
Published: (2025)
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
by: Bian, Jie, et al.
Published: (2025)
by: Bian, Jie, et al.
Published: (2025)
Large Language Models Implicitly Learn to See and Hear Just By Reading
by: Verma, Prateek, et al.
Published: (2025)
by: Verma, Prateek, et al.
Published: (2025)
Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits
by: Hou, Yunlong, et al.
Published: (2024)
by: Hou, Yunlong, et al.
Published: (2024)
Similar Items
-
Optimal Quantization for Matrix Multiplication
by: Ordentlich, Or, et al.
Published: (2024) -
High-Rate Quantized Matrix Multiplication I
by: Ordentlich, Or, et al.
Published: (2026) -
High-Rate Quantized Matrix Multiplication II
by: Ordentlich, Or, et al.
Published: (2026) -
Optimizer-Induced Mode Connectivity: From AdamW to Muon
by: Zhang, Fangzhao, et al.
Published: (2026) -
Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization
by: Borzechowski, Florian, et al.
Published: (2025)