Saved in:
| Main Authors: | Catalan-Tatjer, Albert, Ajroldi, Niccolò, Geiping, Jonas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.06213 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When, Where and Why to Average Weights?
by: Ajroldi, Niccolò, et al.
Published: (2025)
by: Ajroldi, Niccolò, et al.
Published: (2025)
Can you Finetune your Binoculars? Embedding Text Watermarks into the Weights of Large Language Models
by: Elhassan, Fay, et al.
Published: (2025)
by: Elhassan, Fay, et al.
Published: (2025)
Layer Collapse in Diffusion Language Models
by: Conzelmann, Alexander, et al.
Published: (2026)
by: Conzelmann, Alexander, et al.
Published: (2026)
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
by: Islamov, Rustem, et al.
Published: (2025)
by: Islamov, Rustem, et al.
Published: (2025)
Loss Landscape Characterization of Neural Networks without Over-Parametrization
by: Islamov, Rustem, et al.
Published: (2024)
by: Islamov, Rustem, et al.
Published: (2024)
Training Data Reconstruction: Privacy due to Uncertainty?
by: Runkel, Christina, et al.
Published: (2024)
by: Runkel, Christina, et al.
Published: (2024)
Assessing the Potential for Catastrophic Failure in Dynamic Post-Training Quantization
by: Frank, Logan, et al.
Published: (2025)
by: Frank, Logan, et al.
Published: (2025)
Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization
by: Bai, Runsheng, et al.
Published: (2024)
by: Bai, Runsheng, et al.
Published: (2024)
Efficiently Dispatching Flash Attention For Partially Filled Attention Masks
by: Sharma, Agniv, et al.
Published: (2024)
by: Sharma, Agniv, et al.
Published: (2024)
Data Generation for Hardware-Friendly Post-Training Quantization
by: Dikstein, Lior, et al.
Published: (2024)
by: Dikstein, Lior, et al.
Published: (2024)
Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate
by: Kim, Jaemin, et al.
Published: (2026)
by: Kim, Jaemin, et al.
Published: (2026)
Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization
by: Arai, Yamato, et al.
Published: (2025)
by: Arai, Yamato, et al.
Published: (2025)
Deriving Hyperparameter Scaling Laws via Modern Optimization Theory
by: Shulgin, Egor, et al.
Published: (2026)
by: Shulgin, Egor, et al.
Published: (2026)
Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
Improving Quantization with Post-Training Model Expansion
by: Franco, Giuseppe, et al.
Published: (2025)
by: Franco, Giuseppe, et al.
Published: (2025)
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
by: Xu, Zifei, et al.
Published: (2024)
by: Xu, Zifei, et al.
Published: (2024)
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
by: Kleinegger, Maximilian, et al.
Published: (2026)
by: Kleinegger, Maximilian, et al.
Published: (2026)
Beacon: Post-Training Quantization with Integrated Grid Selection
by: Zhang, Shihao, et al.
Published: (2025)
by: Zhang, Shihao, et al.
Published: (2025)
Pushing the Limits of Block Rotations in Post-Training Quantization
by: Sanjeet, Sai, et al.
Published: (2026)
by: Sanjeet, Sai, et al.
Published: (2026)
A Quantized VAE-MLP Botnet Detection Model: A Systematic Evaluation of Quantization-Aware Training and Post-Training Quantization Strategies
by: Wasswa, Hassan, et al.
Published: (2025)
by: Wasswa, Hassan, et al.
Published: (2025)
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
by: Pham, Cuong, et al.
Published: (2025)
by: Pham, Cuong, et al.
Published: (2025)
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
by: Du, Yupei, et al.
Published: (2023)
by: Du, Yupei, et al.
Published: (2023)
Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
by: Geiping, Jonas, et al.
Published: (2025)
by: Geiping, Jonas, et al.
Published: (2025)
Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling
by: Srećković, Teodora, et al.
Published: (2025)
by: Srećković, Teodora, et al.
Published: (2025)
Robust Training of Vector Quantized Bottleneck Models
by: Łańcucki, Adrian, et al.
Published: (2020)
by: Łańcucki, Adrian, et al.
Published: (2020)
CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
Scaling Laws for Post Training Quantized Large Language Models
by: Xu, Zifei, et al.
Published: (2024)
by: Xu, Zifei, et al.
Published: (2024)
Activation Sensitivity as a Unifying Principle for Post-Training Quantization
by: Xu, Bruce Changlong
Published: (2026)
by: Xu, Bruce Changlong
Published: (2026)
Post Training Quantization of Large Language Models with Microscaling Formats
by: Sharify, Sayeh, et al.
Published: (2024)
by: Sharify, Sayeh, et al.
Published: (2024)
ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models
by: Lucas, Ryan, et al.
Published: (2026)
by: Lucas, Ryan, et al.
Published: (2026)
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach
by: Ghaffari, Alireza, et al.
Published: (2025)
by: Ghaffari, Alireza, et al.
Published: (2025)
Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models
by: Pham, Cuong, et al.
Published: (2025)
by: Pham, Cuong, et al.
Published: (2025)
Achieving binary weight and activation for LLMs using Post-Training Quantization
by: Song, Siqing, et al.
Published: (2025)
by: Song, Siqing, et al.
Published: (2025)
PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models
by: Xiao, He, et al.
Published: (2025)
by: Xiao, He, et al.
Published: (2025)
Bits for Privacy: Evaluating Post-Training Quantization via Membership Inference
by: Zhang, Chenxiang, et al.
Published: (2025)
by: Zhang, Chenxiang, et al.
Published: (2025)
DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMs
by: Luo, Yingsong, et al.
Published: (2024)
by: Luo, Yingsong, et al.
Published: (2024)
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
by: Zhang, Aozhong, et al.
Published: (2024)
by: Zhang, Aozhong, et al.
Published: (2024)
Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers
by: Kim, Junhan, et al.
Published: (2024)
by: Kim, Junhan, et al.
Published: (2024)
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression
by: Yu, Xiaoming, et al.
Published: (2026)
by: Yu, Xiaoming, et al.
Published: (2026)
Similar Items
-
When, Where and Why to Average Weights?
by: Ajroldi, Niccolò, et al.
Published: (2025) -
Can you Finetune your Binoculars? Embedding Text Watermarks into the Weights of Large Language Models
by: Elhassan, Fay, et al.
Published: (2025) -
Layer Collapse in Diffusion Language Models
by: Conzelmann, Alexander, et al.
Published: (2026) -
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
by: Islamov, Rustem, et al.
Published: (2025) -
Loss Landscape Characterization of Neural Networks without Over-Parametrization
by: Islamov, Rustem, et al.
Published: (2024)