:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Qingyu, Liu, Rui, Lin, Wei, Liao, Peiyu, Zhao, Wenqian, Wang, Yiwen, Hu, Shoubo, Jiang, Yining, Long, Mochun, Zhen, Hui-Ling, Jiang, Ning, Yuan, Mingxuan, Xiang, Qiao, Xu, Hong
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.15030
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Innovation Discovery System for Networking Research
by: Zhang, Mengrui, et al.
Published: (2026)

ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control
by: Tang, Zhentao, et al.
Published: (2026)

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
by: Ming, Rui, et al.
Published: (2025)

The Regulation of Local Li + Coordination Environment for High‐Performance Quasi‐Solid‐State Polymer Electrolyte
by: Fan Yang, et al.
Published: (2025)

Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
by: Jiang, Yichen, et al.
Published: (2024)

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs
by: Zheng, Rui-Chen, et al.
Published: (2024)

MixPE: Quantization and Hardware Co-design for Efficient LLM Inference
by: Zhang, Yu, et al.
Published: (2024)

A Streamable Neural Audio Codec with Residual Scalar-Vector Quantization for Real-Time Communication
by: Jiang, Xiao-Hang, et al.
Published: (2025)

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
by: Zhang, Yu, et al.
Published: (2025)

QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
by: Li, Zhikai, et al.
Published: (2023)

Nanoconjugate Improves Cognitive Deficit and Limits the Pathogenic Tau Burden in Okadaic‐Acid‐Induced Alzheimer's Mice
by: Qiuju Liang, et al.
Published: (2025)

SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention
by: Yankun, Hong, et al.
Published: (2025)

Greening the Grid: Electricity Market Clearing with Consumer-Based Carbon Cost
by: Jiang, Wenqian, et al.
Published: (2025)

Intelligent Icing Detection Model of Wind Turbine Blades Based on SCADA data
by: Jiang, Wenqian, et al.
Published: (2021)

Resource-Efficient Teleportation of High-Dimensional Quantum Coherence via Initial Phase Engineering
by: Huang, Long, et al.
Published: (2026)

Hypercrosslinked porous and coordination polymer materials for electrolyte membranes in lithium‐metal batteries
by: Mochun Zhang, et al.
Published: (2024)

KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
by: Li, Xing, et al.
Published: (2025)

BetterV: Controlled Verilog Generation with Discriminative Guidance
by: Pei, Zehua, et al.
Published: (2024)

End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
by: Tan, Qitao, et al.
Published: (2025)

Awakening of A Blazar at Redshift 2.7 Temporally Coincident with Arrival of Cospatial Neutrino Event IceCube-201221A
by: Jiang, Xiong, et al.
Published: (2024)

Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks
by: He, Bowei, et al.
Published: (2025)

Sharp upper bounds on the $A_α$-spectral radius of graphs
by: Hong, Zhen-Mu, et al.
Published: (2026)

On the disjunctive domination numbers of the torus grid graphs
by: Qiao, Zhi, et al.
Published: (2026)

PQD: Post-training Quantization for Efficient Diffusion Models
by: Ye, Jiaojiao, et al.
Published: (2024)

Cell‐Specific Control of Mammalian Gene Expression Using DNA Repair Inducible Ribozyme Switches
by: Jieling Hong, et al.
Published: (2024)

Cell‐Specific Control of Mammalian Gene Expression Using DNA Repair Inducible Ribozyme Switches
by: Jieling Hong, et al.
Published: (2024)

Embers of Active Galactic Nuclei: Tidal Disruption Events and Quasiperiodic Eruptions
by: Jiang, Ning, et al.
Published: (2025)

Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices
by: Qin, Ruiyang, et al.
Published: (2024)

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
by: Lin, Haokun, et al.
Published: (2025)

Beyond Speedup -- Utilizing KV Cache for Sampling and Reasoning
by: Xing, Zeyu, et al.
Published: (2026)

MD-AirComp+: Adaptive Quantization for Blind Massive Digital Over-the-Air Computation
by: Qiao, Li, et al.
Published: (2026)

Uncovering Cross-Objective Interference in Multi-Objective Alignment
by: Lu, Yining, et al.
Published: (2026)

Systematic study of capture thresholds with time dependent Hartree-Fock theory
by: Yao, Hong, et al.
Published: (2024)

Analytical Heterogeneous Die-to-Die 3D Placement with Macros
by: Zhao, Yuxuan, et al.
Published: (2024)

Evolution of Optimization Algorithms for Global Placement via Large Language Models
by: Yao, Xufeng, et al.
Published: (2025)

Consumer-based Carbon Costs: Integrating Consumer Carbon Preferences in Electricity Markets
by: Jiang, Wenqian, et al.
Published: (2025)

DemoTuner: Automatic Performance Tuning for Database Management Systems Based on Demonstration Reinforcement Learning
by: Dou, Hui, et al.
Published: (2025)

PSD: Pushing the Pareto Frontier of Diffusion LLMs via Parallel Speculative Decoding
by: Sun, Shengyin, et al.
Published: (2026)

HBLLM: Wavelet-Enhanced High-Fidelity 1-Bit Quantization for LLMs
by: Chen, Ningning, et al.
Published: (2025)

What Matters For Safety Alignment?
by: Li, Xing, et al.
Published: (2026)