Saved in:
| Main Authors: | Gul, Hongyaoxing, Hu, Lijuan, Niu, Shuzi, Liu, Fangfang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.05684 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TileQ: Efficient Low-Rank Quantization of Mixture-of-Experts with 2D Tiling
by: Gu, Hongyaoxing, et al.
Published: (2026)
by: Gu, Hongyaoxing, et al.
Published: (2026)
LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation
by: Gu, Hongyaoxing, et al.
Published: (2026)
by: Gu, Hongyaoxing, et al.
Published: (2026)
Self-Supervised Learning for Sparse Matrix Reordering
by: Li, Ziwei, et al.
Published: (2026)
by: Li, Ziwei, et al.
Published: (2026)
A method of using RSVD in residual calculation of LowBit GEMM
by: Gu, Hongyaoxing
Published: (2024)
by: Gu, Hongyaoxing
Published: (2024)
Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction
by: Li, Ziwei, et al.
Published: (2026)
by: Li, Ziwei, et al.
Published: (2026)
Factorization-in-Loop: Proximal Fill-in Minimization for Sparse Matrix Reordering
by: Li, Ziwei, et al.
Published: (2025)
by: Li, Ziwei, et al.
Published: (2025)
Sketching Low-Rank Plus Diagonal Matrices
by: Fernandez, Andres, et al.
Published: (2025)
by: Fernandez, Andres, et al.
Published: (2025)
SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization
by: Park, Yeonsik, et al.
Published: (2026)
by: Park, Yeonsik, et al.
Published: (2026)
LLM-Sketch: Enhancing Network Sketches with LLM
by: Li, Yuanpeng, et al.
Published: (2025)
by: Li, Yuanpeng, et al.
Published: (2025)
Learning Fill-in Reduction Ordering via Graph Policy Optimization for Sparse Matrices
by: Li, Ziwei, et al.
Published: (2026)
by: Li, Ziwei, et al.
Published: (2026)
Low-Rank Correction for Quantized LLMs
by: Scetbon, Meyer, et al.
Published: (2024)
by: Scetbon, Meyer, et al.
Published: (2024)
UltraSketchLLM: Saliency-Driven Sketching for Ultra-Low Bit LLM Compression
by: Zou, Sunan, et al.
Published: (2025)
by: Zou, Sunan, et al.
Published: (2025)
Faster Linear Systems and Matrix Norm Approximation via Multi-level Sketched Preconditioning
by: Dereziński, Michał, et al.
Published: (2024)
by: Dereziński, Michał, et al.
Published: (2024)
Tailed Low-Rank Matrix Factorization for Similarity Matrix Completion
by: Ma, Changyi, et al.
Published: (2024)
by: Ma, Changyi, et al.
Published: (2024)
Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation
by: Tang, Pingzhi, et al.
Published: (2026)
by: Tang, Pingzhi, et al.
Published: (2026)
Quantization-Robust LLM Unlearning via Low-Rank Adaptation
by: Abitante, João Vitor Boer, et al.
Published: (2026)
by: Abitante, João Vitor Boer, et al.
Published: (2026)
Low-Rank Extragradient Method for Nonsmooth and Low-Rank Matrix Optimization Problems
by: Garber, Dan, et al.
Published: (2022)
by: Garber, Dan, et al.
Published: (2022)
Low-Rank Mirror-Prox for Nonsmooth and Low-Rank Matrix Optimization Problems
by: Garber, Dan, et al.
Published: (2022)
by: Garber, Dan, et al.
Published: (2022)
FlexLoRA: Entropy-Guided Flexible Low-Rank Adaptation
by: Liu, Muqing, et al.
Published: (2026)
by: Liu, Muqing, et al.
Published: (2026)
CoreFlow: Low-Rank Matrix Generative Models
by: Wu, Dongze, et al.
Published: (2026)
by: Wu, Dongze, et al.
Published: (2026)
A Probabilistic Basis for Low-Rank Matrix Learning
by: Segert, Simon, et al.
Published: (2025)
by: Segert, Simon, et al.
Published: (2025)
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
by: Hajimolahoseini, Habib, et al.
Published: (2023)
by: Hajimolahoseini, Habib, et al.
Published: (2023)
Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
by: Jang, Kyoungseok, et al.
Published: (2024)
by: Jang, Kyoungseok, et al.
Published: (2024)
Heaviside Low-Rank Support Matrix Machine
by: Xiu, Xianchao, et al.
Published: (2026)
by: Xiu, Xianchao, et al.
Published: (2026)
Efficient Federated Low Rank Matrix Completion
by: Abbasi, Ahmed Ali, et al.
Published: (2024)
by: Abbasi, Ahmed Ali, et al.
Published: (2024)
Accelerating Power Method with Fast Sketching for Stronger Low-Rank Approximation
by: Chenakkod, Shabarish, et al.
Published: (2026)
by: Chenakkod, Shabarish, et al.
Published: (2026)
LQER: Low-Rank Quantization Error Reconstruction for LLMs
by: Zhang, Cheng, et al.
Published: (2024)
by: Zhang, Cheng, et al.
Published: (2024)
LoQT: Low-Rank Adapters for Quantized Pretraining
by: Loeschcke, Sebastian, et al.
Published: (2024)
by: Loeschcke, Sebastian, et al.
Published: (2024)
Low-Rank Quantization-Aware Training for LLMs
by: Bondarenko, Yelysei, et al.
Published: (2024)
by: Bondarenko, Yelysei, et al.
Published: (2024)
Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation
by: Zhang, Tianyi, et al.
Published: (2024)
by: Zhang, Tianyi, et al.
Published: (2024)
Revisiting Matrix Sketching in Linear Bandits: Achieving Sublinear Regret via Dyadic Block Sketching
by: Wen, Dongxie, et al.
Published: (2024)
by: Wen, Dongxie, et al.
Published: (2024)
FrameQuant: Flexible Low-Bit Quantization for Transformers
by: Adepu, Harshavardhan, et al.
Published: (2024)
by: Adepu, Harshavardhan, et al.
Published: (2024)
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
by: Kang, Yue, et al.
Published: (2024)
by: Kang, Yue, et al.
Published: (2024)
Fundamental limits of Non-Linear Low-Rank Matrix Estimation
by: Mergny, Pierre, et al.
Published: (2024)
by: Mergny, Pierre, et al.
Published: (2024)
Generalized Low-Rank Matrix Contextual Bandits with Graph Information
by: Wang, Yao, et al.
Published: (2025)
by: Wang, Yao, et al.
Published: (2025)
Low-Rank Matrix Approximation for Neural Network Compression
by: Cherukuri, Kalyan, et al.
Published: (2025)
by: Cherukuri, Kalyan, et al.
Published: (2025)
Matrix Low-Rank Approximation For Policy Gradient Methods
by: Rozada, Sergio, et al.
Published: (2024)
by: Rozada, Sergio, et al.
Published: (2024)
Matrix Low-Rank Trust Region Policy Optimization
by: Rozada, Sergio, et al.
Published: (2024)
by: Rozada, Sergio, et al.
Published: (2024)
New Hardness Results for Low-Rank Matrix Completion
by: Chawin, Dror, et al.
Published: (2025)
by: Chawin, Dror, et al.
Published: (2025)
Clustering-Based Low-Rank Matrix Approximation for Medical Image Compression
by: Hamlomo, Sisipho, et al.
Published: (2025)
by: Hamlomo, Sisipho, et al.
Published: (2025)
Similar Items
-
TileQ: Efficient Low-Rank Quantization of Mixture-of-Experts with 2D Tiling
by: Gu, Hongyaoxing, et al.
Published: (2026) -
LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation
by: Gu, Hongyaoxing, et al.
Published: (2026) -
Self-Supervised Learning for Sparse Matrix Reordering
by: Li, Ziwei, et al.
Published: (2026) -
A method of using RSVD in residual calculation of LowBit GEMM
by: Gu, Hongyaoxing
Published: (2024) -
Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction
by: Li, Ziwei, et al.
Published: (2026)