Saved in:
| Main Authors: | Zhou, Changhai, Zhang, Shiyang, Zhou, Yuhua, Qiao, Qian, Gao, Jun, Jin, Cheng, Qin, Kaizhou, Zhang, Weizhong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.22268 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Balancing Fidelity and Plasticity: Aligning Mixed-Precision Fine-Tuning with Linguistic Hierarchies
by: Zhou, Changhai, et al.
Published: (2025)
by: Zhou, Changhai, et al.
Published: (2025)
AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning
by: Zhou, Changhai, et al.
Published: (2024)
by: Zhou, Changhai, et al.
Published: (2024)
Large Language Model Compression with Global Rank and Sparsity Optimization
by: Zhou, Changhai, et al.
Published: (2025)
by: Zhou, Changhai, et al.
Published: (2025)
CoRE: A Fine-Grained Code Reasoning Benchmark Beyond Output Prediction
by: Gao, Jun, et al.
Published: (2026)
by: Gao, Jun, et al.
Published: (2026)
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
by: Zhou, Changhai, et al.
Published: (2024)
by: Zhou, Changhai, et al.
Published: (2024)
RankAdaptor: Hierarchical Rank Allocation for Efficient Fine-Tuning Pruned LLMs via Performance Model
by: Zhou, Changhai, et al.
Published: (2024)
by: Zhou, Changhai, et al.
Published: (2024)
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
by: Chen, Jun, et al.
Published: (2023)
by: Chen, Jun, et al.
Published: (2023)
Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters
by: Zhou, Yuhang, et al.
Published: (2024)
by: Zhou, Yuhang, et al.
Published: (2024)
Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters
by: Xiang, Haotian, et al.
Published: (2026)
by: Xiang, Haotian, et al.
Published: (2026)
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Pear: Pruning and Sharing Adapters in Visual Parameter-Efficient Fine-Tuning
by: Zhong, Yibo, et al.
Published: (2024)
by: Zhong, Yibo, et al.
Published: (2024)
ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale
by: Shi, Ge, et al.
Published: (2025)
by: Shi, Ge, et al.
Published: (2025)
Safeti QRA- more than just compliance
by: Cavanagh, Nic
Published: (2007)
by: Cavanagh, Nic
Published: (2007)
FineQ: Software-Hardware Co-Design for Low-Bit Fine-Grained Mixed-Precision Quantization of LLMs
by: Xie, Xilong, et al.
Published: (2025)
by: Xie, Xilong, et al.
Published: (2025)
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
by: Huang, Wei, et al.
Published: (2024)
by: Huang, Wei, et al.
Published: (2024)
FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference
by: Hooper, Coleman, et al.
Published: (2025)
by: Hooper, Coleman, et al.
Published: (2025)
MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning
by: Ren, Pengjie, et al.
Published: (2024)
by: Ren, Pengjie, et al.
Published: (2024)
SMoA: Spectrum Modulation Adapter for Parameter-Efficient Fine-Tuning
by: Liu, Yongkang, et al.
Published: (2026)
by: Liu, Yongkang, et al.
Published: (2026)
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
by: He, Yefei, et al.
Published: (2023)
by: He, Yefei, et al.
Published: (2023)
Iterative Low-rank Network for Hyperspectral Image Denoising
by: Ye, Jin, et al.
Published: (2025)
by: Ye, Jin, et al.
Published: (2025)
On-Chip Hardware-Aware Quantization for Mixed Precision Neural Networks
by: Huang, Wei, et al.
Published: (2023)
by: Huang, Wei, et al.
Published: (2023)
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models
by: Feng, Weilun, et al.
Published: (2024)
by: Feng, Weilun, et al.
Published: (2024)
Parameter-Efficient Fine-Tuning With Adapters
by: Chen, Keyu, et al.
Published: (2024)
by: Chen, Keyu, et al.
Published: (2024)
Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning
by: Tang, Ningyuan, et al.
Published: (2024)
by: Tang, Ningyuan, et al.
Published: (2024)
Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference
by: Tao, Wei, et al.
Published: (2025)
by: Tao, Wei, et al.
Published: (2025)
Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning
by: Wang, Zhaozhi, et al.
Published: (2024)
by: Wang, Zhaozhi, et al.
Published: (2024)
A Coarse-to-Fine Human Pose Estimation Method based on Two-stage Distillation and Progressive Graph Neural Network
by: Ji, Zhangjian, et al.
Published: (2025)
by: Ji, Zhangjian, et al.
Published: (2025)
SONIQ: System-Optimized Noise-Injected Ultra-Low-Precision Quantization with Full-Precision Parity
by: Zhou, Cyrus, et al.
Published: (2023)
by: Zhou, Cyrus, et al.
Published: (2023)
Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models
by: Liu, Wanlong, et al.
Published: (2025)
by: Liu, Wanlong, et al.
Published: (2025)
AutoFT: Learning an Objective for Robust Fine-Tuning
by: Choi, Caroline, et al.
Published: (2024)
by: Choi, Caroline, et al.
Published: (2024)
ProjQ: Project-and-Quantize for Adapter-Aware LLM Compression
by: Yu, Wneya, et al.
Published: (2026)
by: Yu, Wneya, et al.
Published: (2026)
OMPQ: Orthogonal Mixed Precision Quantization
by: Ma, Yuexiao, et al.
Published: (2021)
by: Ma, Yuexiao, et al.
Published: (2021)
STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization
by: Federici, Marco, et al.
Published: (2025)
by: Federici, Marco, et al.
Published: (2025)
Noni Fruit Polysaccharide Derivatives: Preparation, Structural Characterization, and Antioxidant Activities Evaluation
by: Shiyang Zhou, et al.
Published: (2026)
by: Shiyang Zhou, et al.
Published: (2026)
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
by: Chen, Zhipeng, et al.
Published: (2026)
by: Chen, Zhipeng, et al.
Published: (2026)
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts
by: Tao, Wei, et al.
Published: (2025)
by: Tao, Wei, et al.
Published: (2025)
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
by: Zhou, Han, et al.
Published: (2023)
by: Zhou, Han, et al.
Published: (2023)
Symbiosis: Multi-Adapter Inference and Fine-Tuning
by: Gupta, Saransh, et al.
Published: (2025)
by: Gupta, Saransh, et al.
Published: (2025)
QEFT: Quantization for Efficient Fine-Tuning of LLMs
by: Lee, Changhun, et al.
Published: (2024)
by: Lee, Changhun, et al.
Published: (2024)
TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
by: Zhang, Jipeng, et al.
Published: (2024)
by: Zhang, Jipeng, et al.
Published: (2024)
Similar Items
-
Balancing Fidelity and Plasticity: Aligning Mixed-Precision Fine-Tuning with Linguistic Hierarchies
by: Zhou, Changhai, et al.
Published: (2025) -
AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning
by: Zhou, Changhai, et al.
Published: (2024) -
Large Language Model Compression with Global Rank and Sparsity Optimization
by: Zhou, Changhai, et al.
Published: (2025) -
CoRE: A Fine-Grained Code Reasoning Benchmark Beyond Output Prediction
by: Gao, Jun, et al.
Published: (2026) -
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
by: Zhou, Changhai, et al.
Published: (2024)