:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Chen, Xinye
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2504.14268
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Precision autotuning for linear solvers via contextual bandit-based RL
by: Carson, Erin, et al.
Published: (2026)

Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
by: Carson, Erin, et al.
Published: (2025)

A Metric Driven Approach to Mixed Precision Training
by: Rasquinha, Mitchelle, et al.
Published: (2024)

Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
by: Chen, Jun, et al.
Published: (2023)

Balancing Fidelity and Plasticity: Aligning Mixed-Precision Fine-Tuning with Linguistic Hierarchies
by: Zhou, Changhai, et al.
Published: (2025)

MPX: Mixed Precision Training for JAX
by: Gräfe, Alexander, et al.
Published: (2025)

OMPQ: Orthogonal Mixed Precision Quantization
by: Ma, Yuexiao, et al.
Published: (2021)

Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
by: Yang, Zherui, et al.
Published: (2025)

From Imitation to Refinement -- Residual RL for Precise Assembly
by: Ankile, Lars, et al.
Published: (2024)

Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges
by: Lu, Guanxi, et al.
Published: (2025)

Accelerating Conjugate Gradient Solvers for Homogenization Problems with Unitary Neural Operators
by: Herb, Julius, et al.
Published: (2025)

STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization
by: Federici, Marco, et al.
Published: (2025)

Mixed-Precision Federated Learning via Multi-Precision Over-The-Air Aggregation
by: Yuan, Jinsheng, et al.
Published: (2024)

Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers
by: Tao, Wei, et al.
Published: (2024)

TMPDiff: Temporal Mixed-Precision for Diffusion Models
by: Lewandowski, Basile, et al.
Published: (2026)

OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs
by: Chen, Shaoyuan, et al.
Published: (2025)

Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems
by: Li, Rui, et al.
Published: (2024)

Progressive Mixed-Precision Decoding for Efficient LLM Inference
by: Chen, Hao Mark, et al.
Published: (2024)

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
by: Huang, Wei, et al.
Published: (2024)

AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning
by: Zhou, Changhai, et al.
Published: (2026)

AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
by: Gimenes, Pedro, et al.
Published: (2025)

Efficient Mixed Precision Quantization in Graph Neural Networks
by: Moustafa, Samir, et al.
Published: (2025)

MoPEQ: Mixture of Mixed Precision Quantized Experts
by: Chitty-Venkata, Krishna Teja, et al.
Published: (2025)

AMED: Automatic Mixed-Precision Quantization for Edge Devices
by: Kimhi, Moshe, et al.
Published: (2022)

GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation
by: Li, Yunfei, et al.
Published: (2025)

MPQ-Diff: Mixed Precision Quantization for Diffusion Models
by: Maruzzelli, Rocco Manz, et al.
Published: (2024)

Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo
by: Wang, Ziyi, et al.
Published: (2023)

Neural Quantum States in Mixed Precision
by: Solinas, Massimo, et al.
Published: (2026)

Precision Adaptive Imputation Network : An Unified Technique for Mixed Datasets
by: Joshi, Harsh, et al.
Published: (2025)

Causal Discovery with Mixed Latent Confounding via Precision Decomposition
by: Asiaee, Amir, et al.
Published: (2025)

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
by: Frantar, Elias, et al.
Published: (2024)

Mixed-Precision Quantization for Language Models: Techniques and Prospects
by: Rakka, Mariam, et al.
Published: (2025)

MoR: Mixture Of Representations For Mixed-Precision Training
by: Su, Bor-Yiing, et al.
Published: (2025)

Guaranteed Approximation Bounds for Mixed-Precision Neural Operators
by: Tu, Renbo, et al.
Published: (2023)

MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning
by: Zhang, Tao, et al.
Published: (2025)

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control
by: Li, Bolian, et al.
Published: (2026)

KVmix: Gradient-Based Layer Importance-Aware Mixed-Precision Quantization for KV Cache
by: Li, Fei, et al.
Published: (2025)

Flexible Mixed Precision Quantization for Learned Image Compression
by: Hossain, Md Adnan Faisal, et al.
Published: (2025)

InfoQ: Mixed-Precision Quantization via Global Information Flow
by: Akbulut, Mehmet Emre, et al.
Published: (2025)

APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
by: Bouzouad, Meriem, et al.
Published: (2026)