Saved in:
| Main Authors: | Ahn, Myeonghwan, Yoo, Sungjoo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11170 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LAQuant: A Simple Overhead-free Large Reasoning Model Quantization by Layer-wise Lookahead Loss
by: Choi, Euntae, et al.
Published: (2026)
by: Choi, Euntae, et al.
Published: (2026)
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
by: Son, Donghyun, et al.
Published: (2025)
by: Son, Donghyun, et al.
Published: (2025)
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer
by: Choi, Euntae, et al.
Published: (2025)
by: Choi, Euntae, et al.
Published: (2025)
Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
by: Choi, Euntae, et al.
Published: (2025)
by: Choi, Euntae, et al.
Published: (2025)
MetaMix: Meta-state Precision Searcher for Mixed-precision Activation Quantization
by: Kim, Han-Byul, et al.
Published: (2023)
by: Kim, Han-Byul, et al.
Published: (2023)
Rethinking Weight Tying: Pseudo-Inverse Tying for LM Stable Training and Updates
by: Gu, Jian, et al.
Published: (2026)
by: Gu, Jian, et al.
Published: (2026)
Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling
by: Kim, Jinhee, et al.
Published: (2025)
by: Kim, Jinhee, et al.
Published: (2025)
Breaking MLPerf Training: A Case Study on Optimizing BERT
by: Kim, Yongdeok, et al.
Published: (2024)
by: Kim, Yongdeok, et al.
Published: (2024)
Sample Weight Averaging for Stable Prediction
by: Yu, Han, et al.
Published: (2025)
by: Yu, Han, et al.
Published: (2025)
LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices
by: Lee, Jung Hyun, et al.
Published: (2024)
by: Lee, Jung Hyun, et al.
Published: (2024)
Energy-Weighted Flow Matching: Unlocking Continuous Normalizing Flows for Efficient and Scalable Boltzmann Sampling
by: Dern, Niclas, et al.
Published: (2025)
by: Dern, Niclas, et al.
Published: (2025)
ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models
by: Lucas, Ryan, et al.
Published: (2026)
by: Lucas, Ryan, et al.
Published: (2026)
Scalable Bayesian Structure Learning for Gaussian Graphical Models Using Marginal Pseudo-likelihood
by: Mohammadi, Reza, et al.
Published: (2023)
by: Mohammadi, Reza, et al.
Published: (2023)
Infinite Sampling: Efficient and Stable Grouped RL Training for Large Language Models
by: Wang, Liangyu, et al.
Published: (2025)
by: Wang, Liangyu, et al.
Published: (2025)
StableQAT: Stable Quantization-Aware Training at Ultra-Low Bitwidths
by: Chen, Tianyi, et al.
Published: (2026)
by: Chen, Tianyi, et al.
Published: (2026)
DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMs
by: Luo, Yingsong, et al.
Published: (2024)
by: Luo, Yingsong, et al.
Published: (2024)
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
by: Zhang, Aozhong, et al.
Published: (2024)
by: Zhang, Aozhong, et al.
Published: (2024)
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression
by: Yu, Xiaoming, et al.
Published: (2026)
by: Yu, Xiaoming, et al.
Published: (2026)
EfQAT: An Efficient Framework for Quantization-Aware Training
by: Ashkboos, Saleh, et al.
Published: (2024)
by: Ashkboos, Saleh, et al.
Published: (2024)
Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training
by: Naganuma, Hiroki, et al.
Published: (2025)
by: Naganuma, Hiroki, et al.
Published: (2025)
Sampling and Loss Weights in Multi-Domain Training
by: Salmani, Mahdi, et al.
Published: (2025)
by: Salmani, Mahdi, et al.
Published: (2025)
Neural Quantum Spectral Operator Learning for Solving Partial Differential Equations
by: Kim, Chanyoung, et al.
Published: (2026)
by: Kim, Chanyoung, et al.
Published: (2026)
ECO: Quantized Training without Full-Precision Master Weights
by: Nikdan, Mahdi, et al.
Published: (2026)
by: Nikdan, Mahdi, et al.
Published: (2026)
Training-Free Vector Quantization via Gaussian VAEs
by: Xu, Tongda, et al.
Published: (2025)
by: Xu, Tongda, et al.
Published: (2025)
Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling
by: Malu, Deeptanshu, et al.
Published: (2026)
by: Malu, Deeptanshu, et al.
Published: (2026)
PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression
by: Vicentino, Caio
Published: (2026)
by: Vicentino, Caio
Published: (2026)
discretize_distributions: Efficient Quantization of Gaussian Mixtures with Guarantees in Wasserstein Distance
by: Adams, Steven, et al.
Published: (2025)
by: Adams, Steven, et al.
Published: (2025)
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization
by: Kim, Hyeonah, et al.
Published: (2023)
by: Kim, Hyeonah, et al.
Published: (2023)
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
by: Kleinegger, Maximilian, et al.
Published: (2026)
by: Kleinegger, Maximilian, et al.
Published: (2026)
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
by: Chitsaz, Kamran, et al.
Published: (2024)
by: Chitsaz, Kamran, et al.
Published: (2024)
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
by: Ji, Xiaotong, et al.
Published: (2026)
by: Ji, Xiaotong, et al.
Published: (2026)
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
by: Panferov, Andrei, et al.
Published: (2025)
by: Panferov, Andrei, et al.
Published: (2025)
D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs
by: Yan, Xianglong, et al.
Published: (2026)
by: Yan, Xianglong, et al.
Published: (2026)
Transition Path Sampling with Improved Off-Policy Training of Diffusion Path Samplers
by: Seong, Kiyoung, et al.
Published: (2024)
by: Seong, Kiyoung, et al.
Published: (2024)
Coverage-Based Calibration for Post-Training Quantization via Weighted Set Cover over Outlier Channels
by: Shihab, Ibne Farabi, et al.
Published: (2026)
by: Shihab, Ibne Farabi, et al.
Published: (2026)
Quantize-then-Rectify: Efficient VQ-VAE Training
by: Zhang, Borui, et al.
Published: (2025)
by: Zhang, Borui, et al.
Published: (2025)
Pseudo-Quantized Actor-Critic Algorithm for Robustness to Noisy Temporal Difference Error
by: Kobayashi, Taisuke
Published: (2026)
by: Kobayashi, Taisuke
Published: (2026)
Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?
by: Nguyen, Huy, et al.
Published: (2024)
by: Nguyen, Huy, et al.
Published: (2024)
Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering
by: Choi, Euntae, et al.
Published: (2024)
by: Choi, Euntae, et al.
Published: (2024)
Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate
by: Kim, Jaemin, et al.
Published: (2026)
by: Kim, Jaemin, et al.
Published: (2026)
Similar Items
-
LAQuant: A Simple Overhead-free Large Reasoning Model Quantization by Layer-wise Lookahead Loss
by: Choi, Euntae, et al.
Published: (2026) -
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
by: Son, Donghyun, et al.
Published: (2025) -
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer
by: Choi, Euntae, et al.
Published: (2025) -
Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
by: Choi, Euntae, et al.
Published: (2025) -
MetaMix: Meta-state Precision Searcher for Mixed-precision Activation Quantization
by: Kim, Han-Byul, et al.
Published: (2023)