Saved in:
| Main Authors: | Choudhary, Sakshi, Saxena, Utkarsh, Roy, Kaushik |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.09406 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024)
by: Saxena, Utkarsh, et al.
Published: (2024)
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
by: Nagaraj, Manish, et al.
Published: (2025)
by: Nagaraj, Manish, et al.
Published: (2025)
KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction
by: Saxena, Utkarsh, et al.
Published: (2025)
by: Saxena, Utkarsh, et al.
Published: (2025)
Averaging Rate Scheduler for Decentralized Learning on Heterogeneous Data
by: Aketi, Sai Aparna, et al.
Published: (2024)
by: Aketi, Sai Aparna, et al.
Published: (2024)
SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
by: Choudhary, Sakshi, et al.
Published: (2024)
by: Choudhary, Sakshi, et al.
Published: (2024)
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
by: Apolinario, Marco Paul E., et al.
Published: (2024)
by: Apolinario, Marco Paul E., et al.
Published: (2024)
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals
by: Saxena, Utkarsh, et al.
Published: (2024)
by: Saxena, Utkarsh, et al.
Published: (2024)
Memory-Efficient LLM Training with Online Subspace Descent
by: Liang, Kaizhao, et al.
Published: (2024)
by: Liang, Kaizhao, et al.
Published: (2024)
PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
by: Li, Yanyi, et al.
Published: (2026)
by: Li, Yanyi, et al.
Published: (2026)
LANCE: Low Rank Activation Compression for Efficient On-Device Continual Learning
by: Apolinario, Marco Paul E., et al.
Published: (2025)
by: Apolinario, Marco Paul E., et al.
Published: (2025)
SPREAD: Subspace Representation Distillation for Lifelong Imitation Learning
by: Roy, Kaushik, et al.
Published: (2026)
by: Roy, Kaushik, et al.
Published: (2026)
SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
by: Roy, Arani, et al.
Published: (2025)
by: Roy, Arani, et al.
Published: (2025)
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
by: Choudhary, Sakshi, et al.
Published: (2026)
by: Choudhary, Sakshi, et al.
Published: (2026)
MANGO: Meta-Adaptive Network Gradient Optimization for Online Continual Learning
by: Awasthi, Ankita, et al.
Published: (2026)
by: Awasthi, Ankita, et al.
Published: (2026)
A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models
by: Chen, Yiming, et al.
Published: (2025)
by: Chen, Yiming, et al.
Published: (2025)
HCiM: ADC-Less Hybrid Analog-Digital Compute in Memory Accelerator for Deep Learning Workloads
by: Negi, Shubham, et al.
Published: (2024)
by: Negi, Shubham, et al.
Published: (2024)
Scalable Neural Network Training over Distributed Graphs
by: Kolluri, Aashish, et al.
Published: (2023)
by: Kolluri, Aashish, et al.
Published: (2023)
SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training
by: Refael, Yehonathan, et al.
Published: (2025)
by: Refael, Yehonathan, et al.
Published: (2025)
Post Training Quantization of Large Language Models with Microscaling Formats
by: Sharify, Sayeh, et al.
Published: (2024)
by: Sharify, Sayeh, et al.
Published: (2024)
On the Noise Stability and Robustness of Adversarially Trained Networks on NVM Crossbars
by: Tao, Chun, et al.
Published: (2021)
by: Tao, Chun, et al.
Published: (2021)
CompAct: Compressed Activations for Memory-Efficient LLM Training
by: Shamshoum, Yara, et al.
Published: (2024)
by: Shamshoum, Yara, et al.
Published: (2024)
EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference
by: Kaushik, Prakhar, et al.
Published: (2025)
by: Kaushik, Prakhar, et al.
Published: (2025)
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
by: Yang, Zi, et al.
Published: (2024)
by: Yang, Zi, et al.
Published: (2024)
Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy
by: Li, Tao, et al.
Published: (2024)
by: Li, Tao, et al.
Published: (2024)
Randomized Gradient Subspaces for Efficient Large Language Model Training
by: Rajabi, Sahar, et al.
Published: (2025)
by: Rajabi, Sahar, et al.
Published: (2025)
Efficient Resource-Constrained Training of Transformers via Subspace Optimization
by: Nguyen, Le-Trung, et al.
Published: (2025)
by: Nguyen, Le-Trung, et al.
Published: (2025)
Attacking Byzantine Robust Aggregation in High Dimensions
by: Choudhary, Sarthak, et al.
Published: (2023)
by: Choudhary, Sarthak, et al.
Published: (2023)
SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness
by: Kodge, Sangamesh, et al.
Published: (2024)
by: Kodge, Sangamesh, et al.
Published: (2024)
ELLA: Efficient Lifelong Learning for Adapters in Large Language Models
by: Biswas, Shristi Das, et al.
Published: (2026)
by: Biswas, Shristi Das, et al.
Published: (2026)
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
by: Xi, Haocheng, et al.
Published: (2024)
by: Xi, Haocheng, et al.
Published: (2024)
Understanding In-context Learning of Addition via Activation Subspaces
by: Hu, Xinyan, et al.
Published: (2025)
by: Hu, Xinyan, et al.
Published: (2025)
Thinking Forward: Memory-Efficient Federated Finetuning of Language Models
by: Panchal, Kunjal, et al.
Published: (2024)
by: Panchal, Kunjal, et al.
Published: (2024)
Shared LoRA Subspaces for almost Strict Continual Learning
by: Kaushik, Prakhar, et al.
Published: (2026)
by: Kaushik, Prakhar, et al.
Published: (2026)
FINDER: Stochastic Mirroring of Noisy Quasi-Newton Search and Deep Network Training
by: Suman, Uttam, et al.
Published: (2024)
by: Suman, Uttam, et al.
Published: (2024)
WWW: What, When, Where to Compute-in-Memory
by: Sharma, Tanvi, et al.
Published: (2023)
by: Sharma, Tanvi, et al.
Published: (2023)
Inverted Activations: Reducing Memory Footprint in Neural Network Training
by: Novikov, Georgii, et al.
Published: (2024)
by: Novikov, Georgii, et al.
Published: (2024)
The Universal Weight Subspace Hypothesis
by: Kaushik, Prakhar, et al.
Published: (2025)
by: Kaushik, Prakhar, et al.
Published: (2025)
Auction-Based Online Policy Adaptation for Evolving Objectives
by: Shabadi, Guruprerana, et al.
Published: (2026)
by: Shabadi, Guruprerana, et al.
Published: (2026)
AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression
by: Aketi, Sai Aparna, et al.
Published: (2024)
by: Aketi, Sai Aparna, et al.
Published: (2024)
Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature
by: Ravikumar, Deepak, et al.
Published: (2024)
by: Ravikumar, Deepak, et al.
Published: (2024)
Similar Items
-
Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
by: Saxena, Utkarsh, et al.
Published: (2024) -
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
by: Nagaraj, Manish, et al.
Published: (2025) -
KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction
by: Saxena, Utkarsh, et al.
Published: (2025) -
Averaging Rate Scheduler for Decentralized Learning on Heterogeneous Data
by: Aketi, Sai Aparna, et al.
Published: (2024) -
SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
by: Choudhary, Sakshi, et al.
Published: (2024)