Saved in:
| Main Author: | Chen, Xinye |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.14268 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Precision autotuning for linear solvers via contextual bandit-based RL
by: Carson, Erin, et al.
Published: (2026)
by: Carson, Erin, et al.
Published: (2026)
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
by: Carson, Erin, et al.
Published: (2025)
by: Carson, Erin, et al.
Published: (2025)
A Metric Driven Approach to Mixed Precision Training
by: Rasquinha, Mitchelle, et al.
Published: (2024)
by: Rasquinha, Mitchelle, et al.
Published: (2024)
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
by: Chen, Jun, et al.
Published: (2023)
by: Chen, Jun, et al.
Published: (2023)
Balancing Fidelity and Plasticity: Aligning Mixed-Precision Fine-Tuning with Linguistic Hierarchies
by: Zhou, Changhai, et al.
Published: (2025)
by: Zhou, Changhai, et al.
Published: (2025)
MPX: Mixed Precision Training for JAX
by: Gräfe, Alexander, et al.
Published: (2025)
by: Gräfe, Alexander, et al.
Published: (2025)
OMPQ: Orthogonal Mixed Precision Quantization
by: Ma, Yuexiao, et al.
Published: (2021)
by: Ma, Yuexiao, et al.
Published: (2021)
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
by: Yang, Zherui, et al.
Published: (2025)
by: Yang, Zherui, et al.
Published: (2025)
From Imitation to Refinement -- Residual RL for Precise Assembly
by: Ankile, Lars, et al.
Published: (2024)
by: Ankile, Lars, et al.
Published: (2024)
Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges
by: Lu, Guanxi, et al.
Published: (2025)
by: Lu, Guanxi, et al.
Published: (2025)
Accelerating Conjugate Gradient Solvers for Homogenization Problems with Unitary Neural Operators
by: Herb, Julius, et al.
Published: (2025)
by: Herb, Julius, et al.
Published: (2025)
STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization
by: Federici, Marco, et al.
Published: (2025)
by: Federici, Marco, et al.
Published: (2025)
Mixed-Precision Federated Learning via Multi-Precision Over-The-Air Aggregation
by: Yuan, Jinsheng, et al.
Published: (2024)
by: Yuan, Jinsheng, et al.
Published: (2024)
Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers
by: Tao, Wei, et al.
Published: (2024)
by: Tao, Wei, et al.
Published: (2024)
TMPDiff: Temporal Mixed-Precision for Diffusion Models
by: Lewandowski, Basile, et al.
Published: (2026)
by: Lewandowski, Basile, et al.
Published: (2026)
OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs
by: Chen, Shaoyuan, et al.
Published: (2025)
by: Chen, Shaoyuan, et al.
Published: (2025)
Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems
by: Li, Rui, et al.
Published: (2024)
by: Li, Rui, et al.
Published: (2024)
Progressive Mixed-Precision Decoding for Efficient LLM Inference
by: Chen, Hao Mark, et al.
Published: (2024)
by: Chen, Hao Mark, et al.
Published: (2024)
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
by: Huang, Wei, et al.
Published: (2024)
by: Huang, Wei, et al.
Published: (2024)
AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning
by: Zhou, Changhai, et al.
Published: (2026)
by: Zhou, Changhai, et al.
Published: (2026)
AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
by: Gimenes, Pedro, et al.
Published: (2025)
by: Gimenes, Pedro, et al.
Published: (2025)
Efficient Mixed Precision Quantization in Graph Neural Networks
by: Moustafa, Samir, et al.
Published: (2025)
by: Moustafa, Samir, et al.
Published: (2025)
MoPEQ: Mixture of Mixed Precision Quantized Experts
by: Chitty-Venkata, Krishna Teja, et al.
Published: (2025)
by: Chitty-Venkata, Krishna Teja, et al.
Published: (2025)
AMED: Automatic Mixed-Precision Quantization for Edge Devices
by: Kimhi, Moshe, et al.
Published: (2022)
by: Kimhi, Moshe, et al.
Published: (2022)
GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation
by: Li, Yunfei, et al.
Published: (2025)
by: Li, Yunfei, et al.
Published: (2025)
MPQ-Diff: Mixed Precision Quantization for Diffusion Models
by: Maruzzelli, Rocco Manz, et al.
Published: (2024)
by: Maruzzelli, Rocco Manz, et al.
Published: (2024)
Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo
by: Wang, Ziyi, et al.
Published: (2023)
by: Wang, Ziyi, et al.
Published: (2023)
Neural Quantum States in Mixed Precision
by: Solinas, Massimo, et al.
Published: (2026)
by: Solinas, Massimo, et al.
Published: (2026)
Precision Adaptive Imputation Network : An Unified Technique for Mixed Datasets
by: Joshi, Harsh, et al.
Published: (2025)
by: Joshi, Harsh, et al.
Published: (2025)
Causal Discovery with Mixed Latent Confounding via Precision Decomposition
by: Asiaee, Amir, et al.
Published: (2025)
by: Asiaee, Amir, et al.
Published: (2025)
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
by: Frantar, Elias, et al.
Published: (2024)
by: Frantar, Elias, et al.
Published: (2024)
Mixed-Precision Quantization for Language Models: Techniques and Prospects
by: Rakka, Mariam, et al.
Published: (2025)
by: Rakka, Mariam, et al.
Published: (2025)
MoR: Mixture Of Representations For Mixed-Precision Training
by: Su, Bor-Yiing, et al.
Published: (2025)
by: Su, Bor-Yiing, et al.
Published: (2025)
Guaranteed Approximation Bounds for Mixed-Precision Neural Operators
by: Tu, Renbo, et al.
Published: (2023)
by: Tu, Renbo, et al.
Published: (2023)
MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning
by: Zhang, Tao, et al.
Published: (2025)
by: Zhang, Tao, et al.
Published: (2025)
Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control
by: Li, Bolian, et al.
Published: (2026)
by: Li, Bolian, et al.
Published: (2026)
KVmix: Gradient-Based Layer Importance-Aware Mixed-Precision Quantization for KV Cache
by: Li, Fei, et al.
Published: (2025)
by: Li, Fei, et al.
Published: (2025)
Flexible Mixed Precision Quantization for Learned Image Compression
by: Hossain, Md Adnan Faisal, et al.
Published: (2025)
by: Hossain, Md Adnan Faisal, et al.
Published: (2025)
InfoQ: Mixed-Precision Quantization via Global Information Flow
by: Akbulut, Mehmet Emre, et al.
Published: (2025)
by: Akbulut, Mehmet Emre, et al.
Published: (2025)
APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
by: Bouzouad, Meriem, et al.
Published: (2026)
by: Bouzouad, Meriem, et al.
Published: (2026)
Similar Items
-
Precision autotuning for linear solvers via contextual bandit-based RL
by: Carson, Erin, et al.
Published: (2026) -
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
by: Carson, Erin, et al.
Published: (2025) -
A Metric Driven Approach to Mixed Precision Training
by: Rasquinha, Mitchelle, et al.
Published: (2024) -
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
by: Chen, Jun, et al.
Published: (2023) -
Balancing Fidelity and Plasticity: Aligning Mixed-Precision Fine-Tuning with Linguistic Hierarchies
by: Zhou, Changhai, et al.
Published: (2025)