:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Choudhary, Sakshi, Saxena, Utkarsh, Roy, Kaushik
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2604.09406
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024)

TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
by: Nagaraj, Manish, et al.
Published: (2025)

KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction
by: Saxena, Utkarsh, et al.
Published: (2025)

Averaging Rate Scheduler for Decentralized Learning on Heterogeneous Data
by: Aketi, Sai Aparna, et al.
Published: (2024)

SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
by: Choudhary, Sakshi, et al.
Published: (2024)

CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
by: Apolinario, Marco Paul E., et al.
Published: (2024)

ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals
by: Saxena, Utkarsh, et al.
Published: (2024)

Memory-Efficient LLM Training with Online Subspace Descent
by: Liang, Kaizhao, et al.
Published: (2024)

PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
by: Li, Yanyi, et al.
Published: (2026)

LANCE: Low Rank Activation Compression for Efficient On-Device Continual Learning
by: Apolinario, Marco Paul E., et al.
Published: (2025)

SPREAD: Subspace Representation Distillation for Lifelong Imitation Learning
by: Roy, Kaushik, et al.
Published: (2026)

SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
by: Roy, Arani, et al.
Published: (2025)

Learning When to Attend: Conditional Memory Access for Long-Context LLMs
by: Choudhary, Sakshi, et al.
Published: (2026)

MANGO: Meta-Adaptive Network Gradient Optimization for Online Continual Learning
by: Awasthi, Ankita, et al.
Published: (2026)

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models
by: Chen, Yiming, et al.
Published: (2025)

HCiM: ADC-Less Hybrid Analog-Digital Compute in Memory Accelerator for Deep Learning Workloads
by: Negi, Shubham, et al.
Published: (2024)

Scalable Neural Network Training over Distributed Graphs
by: Kolluri, Aashish, et al.
Published: (2023)

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training
by: Refael, Yehonathan, et al.
Published: (2025)

Post Training Quantization of Large Language Models with Microscaling Formats
by: Sharify, Sayeh, et al.
Published: (2024)

On the Noise Stability and Robustness of Adversarially Trained Networks on NVM Crossbars
by: Tao, Chun, et al.
Published: (2021)

CompAct: Compressed Activations for Memory-Efficient LLM Training
by: Shamshoum, Yara, et al.
Published: (2024)

EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference
by: Kaushik, Prakhar, et al.
Published: (2025)

CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
by: Yang, Zi, et al.
Published: (2024)

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy
by: Li, Tao, et al.
Published: (2024)

Randomized Gradient Subspaces for Efficient Large Language Model Training
by: Rajabi, Sahar, et al.
Published: (2025)

Efficient Resource-Constrained Training of Transformers via Subspace Optimization
by: Nguyen, Le-Trung, et al.
Published: (2025)

Attacking Byzantine Robust Aggregation in High Dimensions
by: Choudhary, Sarthak, et al.
Published: (2023)

SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness
by: Kodge, Sangamesh, et al.
Published: (2024)

ELLA: Efficient Lifelong Learning for Adapters in Large Language Models
by: Biswas, Shristi Das, et al.
Published: (2026)

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
by: Xi, Haocheng, et al.
Published: (2024)

Understanding In-context Learning of Addition via Activation Subspaces
by: Hu, Xinyan, et al.
Published: (2025)

Thinking Forward: Memory-Efficient Federated Finetuning of Language Models
by: Panchal, Kunjal, et al.
Published: (2024)

Shared LoRA Subspaces for almost Strict Continual Learning
by: Kaushik, Prakhar, et al.
Published: (2026)

FINDER: Stochastic Mirroring of Noisy Quasi-Newton Search and Deep Network Training
by: Suman, Uttam, et al.
Published: (2024)

WWW: What, When, Where to Compute-in-Memory
by: Sharma, Tanvi, et al.
Published: (2023)

Inverted Activations: Reducing Memory Footprint in Neural Network Training
by: Novikov, Georgii, et al.
Published: (2024)

The Universal Weight Subspace Hypothesis
by: Kaushik, Prakhar, et al.
Published: (2025)

Auction-Based Online Policy Adaptation for Evolving Objectives
by: Shabadi, Guruprerana, et al.
Published: (2026)

AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression
by: Aketi, Sai Aparna, et al.
Published: (2024)

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature
by: Ravikumar, Deepak, et al.
Published: (2024)