:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tseng, Albert, De Sa, Christopher
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.21461
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Model-Preserving Adaptive Rounding
by: Tseng, Albert, et al.
Published: (2025)

QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
by: Tseng, Albert, et al.
Published: (2024)

LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers
by: Karmore, Aryan
Published: (2026)

Lookup multivariate Kolmogorov-Arnold Networks
by: Pozdnyakov, Sergey, et al.
Published: (2025)

From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
by: Bacellar, Alan T. L., et al.
Published: (2026)

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
by: Yin, Junjie, et al.
Published: (2023)

LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
by: Li, Guoyu, et al.
Published: (2025)

LookupViT: Compressing visual information to a limited number of tokens
by: Koner, Rajat, et al.
Published: (2024)

Pixel Embedding: Fully Quantized Convolutional Neural Network with Differentiable Lookup Table
by: Tokunaga, Hiroyuki, et al.
Published: (2024)

STAT: Shrinking Transformers After Training
by: Flynn, Megan, et al.
Published: (2024)

Compute-Optimal LLMs Provably Generalize Better With Scale
by: Finzi, Marc, et al.
Published: (2025)

QTIP: Quantization with Trellises and Incoherence Processing
by: Tseng, Albert, et al.
Published: (2024)

SG-XDEAT: Sparsity-Guided Cross-Dimensional and Cross-Encoding Attention with Target-Aware Conditioning in Tabular Learning
by: Cheng, Chih-Chuan, et al.
Published: (2025)

On the Limits of Layer Pruning for Generative Reasoning in Large Language Models
by: Shrestha, Safal, et al.
Published: (2026)

Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)

Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
by: Datta, Shrestha, et al.
Published: (2026)

The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
by: Lu, Yao, et al.
Published: (2025)

LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
by: Askari, Hadi, et al.
Published: (2025)

Hierarchically branched diffusion models leverage dataset structure for class-conditional generation
by: Tseng, Alex M., et al.
Published: (2022)

Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models
by: Xu, Yichen, et al.
Published: (2026)

Multimodal Survival Analysis with Locally Deployable Large Language Models
by: Gögl, Moritz, et al.
Published: (2026)

ECG-Soup: Harnessing Multi-Layer Synergy for ECG Foundation Models
by: Nguyen, Phu X., et al.
Published: (2025)

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
by: Su, Songqiao, et al.
Published: (2025)

MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection
by: Shaeri, Pouya, et al.
Published: (2025)

L3Ms -- Lagrange Large Language Models
by: Dhillon, Guneet S., et al.
Published: (2024)

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
by: Guo, Wentao, et al.
Published: (2024)

How to Measure the Intelligence of Large Language Models?
by: Körber, Nils, et al.
Published: (2024)

Ablate and Rescue: A Causal Analysis of Residual Stream Hyper-Connections
by: Peng, William, et al.
Published: (2026)

Towards Evolutionary-based Automated Machine Learning for Small Molecule Pharmacokinetic Prediction
by: de Sá, Alex G. C., et al.
Published: (2024)

TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training
by: Zhang, Ruijie, et al.
Published: (2026)

One Permutation Is All You Need: Fast, Reliable Variable Importance and Model Stress-Testing
by: Dorador, Albert
Published: (2025)

RACE Attention: A Strictly Linear-Time Attention Layer for Training on Outrageously Large Contexts
by: Joshi, Sahil, et al.
Published: (2025)

CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
by: Kautsar, Muchammad Daniyal, et al.
Published: (2025)

Quantifying Empirical Compute-Supervision Tradeoffs in RLVR
by: Mitsuhashi, Ryo, et al.
Published: (2026)

Can Go AIs be adversarially robust?
by: Tseng, Tom, et al.
Published: (2024)

Shadow Cones: A Generalized Framework for Partial Order Embeddings
by: Yu, Tao, et al.
Published: (2023)

On the Nonlinearity of Layer Normalization
by: Ni, Yunhao, et al.
Published: (2024)

Machine learning based radiative parameterization scheme and its performance in operational reforecast experiments
by: Jing, Hao, et al.
Published: (2026)

Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification
by: Cooper, A. Feder, et al.
Published: (2023)

Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
by: Hameed, Humza Wajid, et al.
Published: (2024)