Saved in:
| Main Authors: | Tseng, Albert, De Sa, Christopher |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21461 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Model-Preserving Adaptive Rounding
by: Tseng, Albert, et al.
Published: (2025)
by: Tseng, Albert, et al.
Published: (2025)
QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
by: Tseng, Albert, et al.
Published: (2024)
by: Tseng, Albert, et al.
Published: (2024)
LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers
by: Karmore, Aryan
Published: (2026)
by: Karmore, Aryan
Published: (2026)
Lookup multivariate Kolmogorov-Arnold Networks
by: Pozdnyakov, Sergey, et al.
Published: (2025)
by: Pozdnyakov, Sergey, et al.
Published: (2025)
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
by: Bacellar, Alan T. L., et al.
Published: (2026)
by: Bacellar, Alan T. L., et al.
Published: (2026)
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
by: Yin, Junjie, et al.
Published: (2023)
by: Yin, Junjie, et al.
Published: (2023)
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
by: Li, Guoyu, et al.
Published: (2025)
by: Li, Guoyu, et al.
Published: (2025)
LookupViT: Compressing visual information to a limited number of tokens
by: Koner, Rajat, et al.
Published: (2024)
by: Koner, Rajat, et al.
Published: (2024)
Pixel Embedding: Fully Quantized Convolutional Neural Network with Differentiable Lookup Table
by: Tokunaga, Hiroyuki, et al.
Published: (2024)
by: Tokunaga, Hiroyuki, et al.
Published: (2024)
STAT: Shrinking Transformers After Training
by: Flynn, Megan, et al.
Published: (2024)
by: Flynn, Megan, et al.
Published: (2024)
Compute-Optimal LLMs Provably Generalize Better With Scale
by: Finzi, Marc, et al.
Published: (2025)
by: Finzi, Marc, et al.
Published: (2025)
QTIP: Quantization with Trellises and Incoherence Processing
by: Tseng, Albert, et al.
Published: (2024)
by: Tseng, Albert, et al.
Published: (2024)
SG-XDEAT: Sparsity-Guided Cross-Dimensional and Cross-Encoding Attention with Target-Aware Conditioning in Tabular Learning
by: Cheng, Chih-Chuan, et al.
Published: (2025)
by: Cheng, Chih-Chuan, et al.
Published: (2025)
On the Limits of Layer Pruning for Generative Reasoning in Large Language Models
by: Shrestha, Safal, et al.
Published: (2026)
by: Shrestha, Safal, et al.
Published: (2026)
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
by: Datta, Shrestha, et al.
Published: (2026)
by: Datta, Shrestha, et al.
Published: (2026)
The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
by: Lu, Yao, et al.
Published: (2025)
by: Lu, Yao, et al.
Published: (2025)
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
by: Askari, Hadi, et al.
Published: (2025)
by: Askari, Hadi, et al.
Published: (2025)
Hierarchically branched diffusion models leverage dataset structure for class-conditional generation
by: Tseng, Alex M., et al.
Published: (2022)
by: Tseng, Alex M., et al.
Published: (2022)
Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models
by: Xu, Yichen, et al.
Published: (2026)
by: Xu, Yichen, et al.
Published: (2026)
Multimodal Survival Analysis with Locally Deployable Large Language Models
by: Gögl, Moritz, et al.
Published: (2026)
by: Gögl, Moritz, et al.
Published: (2026)
ECG-Soup: Harnessing Multi-Layer Synergy for ECG Foundation Models
by: Nguyen, Phu X., et al.
Published: (2025)
by: Nguyen, Phu X., et al.
Published: (2025)
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
by: Su, Songqiao, et al.
Published: (2025)
by: Su, Songqiao, et al.
Published: (2025)
MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection
by: Shaeri, Pouya, et al.
Published: (2025)
by: Shaeri, Pouya, et al.
Published: (2025)
L3Ms -- Lagrange Large Language Models
by: Dhillon, Guneet S., et al.
Published: (2024)
by: Dhillon, Guneet S., et al.
Published: (2024)
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
by: Guo, Wentao, et al.
Published: (2024)
by: Guo, Wentao, et al.
Published: (2024)
How to Measure the Intelligence of Large Language Models?
by: Körber, Nils, et al.
Published: (2024)
by: Körber, Nils, et al.
Published: (2024)
Ablate and Rescue: A Causal Analysis of Residual Stream Hyper-Connections
by: Peng, William, et al.
Published: (2026)
by: Peng, William, et al.
Published: (2026)
Towards Evolutionary-based Automated Machine Learning for Small Molecule Pharmacokinetic Prediction
by: de Sá, Alex G. C., et al.
Published: (2024)
by: de Sá, Alex G. C., et al.
Published: (2024)
TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training
by: Zhang, Ruijie, et al.
Published: (2026)
by: Zhang, Ruijie, et al.
Published: (2026)
One Permutation Is All You Need: Fast, Reliable Variable Importance and Model Stress-Testing
by: Dorador, Albert
Published: (2025)
by: Dorador, Albert
Published: (2025)
RACE Attention: A Strictly Linear-Time Attention Layer for Training on Outrageously Large Contexts
by: Joshi, Sahil, et al.
Published: (2025)
by: Joshi, Sahil, et al.
Published: (2025)
CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
by: Kautsar, Muchammad Daniyal, et al.
Published: (2025)
by: Kautsar, Muchammad Daniyal, et al.
Published: (2025)
Quantifying Empirical Compute-Supervision Tradeoffs in RLVR
by: Mitsuhashi, Ryo, et al.
Published: (2026)
by: Mitsuhashi, Ryo, et al.
Published: (2026)
Can Go AIs be adversarially robust?
by: Tseng, Tom, et al.
Published: (2024)
by: Tseng, Tom, et al.
Published: (2024)
Shadow Cones: A Generalized Framework for Partial Order Embeddings
by: Yu, Tao, et al.
Published: (2023)
by: Yu, Tao, et al.
Published: (2023)
On the Nonlinearity of Layer Normalization
by: Ni, Yunhao, et al.
Published: (2024)
by: Ni, Yunhao, et al.
Published: (2024)
Machine learning based radiative parameterization scheme and its performance in operational reforecast experiments
by: Jing, Hao, et al.
Published: (2026)
by: Jing, Hao, et al.
Published: (2026)
Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification
by: Cooper, A. Feder, et al.
Published: (2023)
by: Cooper, A. Feder, et al.
Published: (2023)
Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
by: Hameed, Humza Wajid, et al.
Published: (2024)
by: Hameed, Humza Wajid, et al.
Published: (2024)
Similar Items
-
Model-Preserving Adaptive Rounding
by: Tseng, Albert, et al.
Published: (2025) -
QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
by: Tseng, Albert, et al.
Published: (2024) -
LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers
by: Karmore, Aryan
Published: (2026) -
Lookup multivariate Kolmogorov-Arnold Networks
by: Pozdnyakov, Sergey, et al.
Published: (2025) -
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
by: Bacellar, Alan T. L., et al.
Published: (2026)