Saved in:
| Main Authors: | Song, Qingyu, Liu, Rui, Lin, Wei, Liao, Peiyu, Zhao, Wenqian, Wang, Yiwen, Hu, Shoubo, Jiang, Yining, Long, Mochun, Zhen, Hui-Ling, Jiang, Ning, Yuan, Mingxuan, Xiang, Qiao, Xu, Hong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.15030 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Innovation Discovery System for Networking Research
by: Zhang, Mengrui, et al.
Published: (2026)
by: Zhang, Mengrui, et al.
Published: (2026)
ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control
by: Tang, Zhentao, et al.
Published: (2026)
by: Tang, Zhentao, et al.
Published: (2026)
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
by: Ming, Rui, et al.
Published: (2025)
by: Ming, Rui, et al.
Published: (2025)
The Regulation of Local Li + Coordination Environment for High‐Performance Quasi‐Solid‐State Polymer Electrolyte
by: Fan Yang, et al.
Published: (2025)
by: Fan Yang, et al.
Published: (2025)
Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
by: Jiang, Yichen, et al.
Published: (2024)
by: Jiang, Yichen, et al.
Published: (2024)
ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs
by: Zheng, Rui-Chen, et al.
Published: (2024)
by: Zheng, Rui-Chen, et al.
Published: (2024)
MixPE: Quantization and Hardware Co-design for Efficient LLM Inference
by: Zhang, Yu, et al.
Published: (2024)
by: Zhang, Yu, et al.
Published: (2024)
A Streamable Neural Audio Codec with Residual Scalar-Vector Quantization for Real-Time Communication
by: Jiang, Xiao-Hang, et al.
Published: (2025)
by: Jiang, Xiao-Hang, et al.
Published: (2025)
MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
by: Li, Zhikai, et al.
Published: (2023)
by: Li, Zhikai, et al.
Published: (2023)
Nanoconjugate Improves Cognitive Deficit and Limits the Pathogenic Tau Burden in Okadaic‐Acid‐Induced Alzheimer's Mice
by: Qiuju Liang, et al.
Published: (2025)
by: Qiuju Liang, et al.
Published: (2025)
SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention
by: Yankun, Hong, et al.
Published: (2025)
by: Yankun, Hong, et al.
Published: (2025)
Greening the Grid: Electricity Market Clearing with Consumer-Based Carbon Cost
by: Jiang, Wenqian, et al.
Published: (2025)
by: Jiang, Wenqian, et al.
Published: (2025)
Intelligent Icing Detection Model of Wind Turbine Blades Based on SCADA data
by: Jiang, Wenqian, et al.
Published: (2021)
by: Jiang, Wenqian, et al.
Published: (2021)
Resource-Efficient Teleportation of High-Dimensional Quantum Coherence via Initial Phase Engineering
by: Huang, Long, et al.
Published: (2026)
by: Huang, Long, et al.
Published: (2026)
Hypercrosslinked porous and coordination polymer materials for electrolyte membranes in lithium‐metal batteries
by: Mochun Zhang, et al.
Published: (2024)
by: Mochun Zhang, et al.
Published: (2024)
KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
by: Li, Xing, et al.
Published: (2025)
by: Li, Xing, et al.
Published: (2025)
BetterV: Controlled Verilog Generation with Discriminative Guidance
by: Pei, Zehua, et al.
Published: (2024)
by: Pei, Zehua, et al.
Published: (2024)
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
by: Tan, Qitao, et al.
Published: (2025)
by: Tan, Qitao, et al.
Published: (2025)
Awakening of A Blazar at Redshift 2.7 Temporally Coincident with Arrival of Cospatial Neutrino Event IceCube-201221A
by: Jiang, Xiong, et al.
Published: (2024)
by: Jiang, Xiong, et al.
Published: (2024)
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks
by: He, Bowei, et al.
Published: (2025)
by: He, Bowei, et al.
Published: (2025)
Sharp upper bounds on the $A_α$-spectral radius of graphs
by: Hong, Zhen-Mu, et al.
Published: (2026)
by: Hong, Zhen-Mu, et al.
Published: (2026)
On the disjunctive domination numbers of the torus grid graphs
by: Qiao, Zhi, et al.
Published: (2026)
by: Qiao, Zhi, et al.
Published: (2026)
PQD: Post-training Quantization for Efficient Diffusion Models
by: Ye, Jiaojiao, et al.
Published: (2024)
by: Ye, Jiaojiao, et al.
Published: (2024)
Cell‐Specific Control of Mammalian Gene Expression Using DNA Repair Inducible Ribozyme Switches
by: Jieling Hong, et al.
Published: (2024)
by: Jieling Hong, et al.
Published: (2024)
Cell‐Specific Control of Mammalian Gene Expression Using DNA Repair Inducible Ribozyme Switches
by: Jieling Hong, et al.
Published: (2024)
by: Jieling Hong, et al.
Published: (2024)
Embers of Active Galactic Nuclei: Tidal Disruption Events and Quasiperiodic Eruptions
by: Jiang, Ning, et al.
Published: (2025)
by: Jiang, Ning, et al.
Published: (2025)
Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices
by: Qin, Ruiyang, et al.
Published: (2024)
by: Qin, Ruiyang, et al.
Published: (2024)
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
by: Lin, Haokun, et al.
Published: (2025)
by: Lin, Haokun, et al.
Published: (2025)
Beyond Speedup -- Utilizing KV Cache for Sampling and Reasoning
by: Xing, Zeyu, et al.
Published: (2026)
by: Xing, Zeyu, et al.
Published: (2026)
MD-AirComp+: Adaptive Quantization for Blind Massive Digital Over-the-Air Computation
by: Qiao, Li, et al.
Published: (2026)
by: Qiao, Li, et al.
Published: (2026)
Uncovering Cross-Objective Interference in Multi-Objective Alignment
by: Lu, Yining, et al.
Published: (2026)
by: Lu, Yining, et al.
Published: (2026)
Systematic study of capture thresholds with time dependent Hartree-Fock theory
by: Yao, Hong, et al.
Published: (2024)
by: Yao, Hong, et al.
Published: (2024)
Analytical Heterogeneous Die-to-Die 3D Placement with Macros
by: Zhao, Yuxuan, et al.
Published: (2024)
by: Zhao, Yuxuan, et al.
Published: (2024)
Evolution of Optimization Algorithms for Global Placement via Large Language Models
by: Yao, Xufeng, et al.
Published: (2025)
by: Yao, Xufeng, et al.
Published: (2025)
Consumer-based Carbon Costs: Integrating Consumer Carbon Preferences in Electricity Markets
by: Jiang, Wenqian, et al.
Published: (2025)
by: Jiang, Wenqian, et al.
Published: (2025)
DemoTuner: Automatic Performance Tuning for Database Management Systems Based on Demonstration Reinforcement Learning
by: Dou, Hui, et al.
Published: (2025)
by: Dou, Hui, et al.
Published: (2025)
PSD: Pushing the Pareto Frontier of Diffusion LLMs via Parallel Speculative Decoding
by: Sun, Shengyin, et al.
Published: (2026)
by: Sun, Shengyin, et al.
Published: (2026)
HBLLM: Wavelet-Enhanced High-Fidelity 1-Bit Quantization for LLMs
by: Chen, Ningning, et al.
Published: (2025)
by: Chen, Ningning, et al.
Published: (2025)
What Matters For Safety Alignment?
by: Li, Xing, et al.
Published: (2026)
by: Li, Xing, et al.
Published: (2026)
Similar Items
-
Innovation Discovery System for Networking Research
by: Zhang, Mengrui, et al.
Published: (2026) -
ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control
by: Tang, Zhentao, et al.
Published: (2026) -
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
by: Ming, Rui, et al.
Published: (2025) -
The Regulation of Local Li + Coordination Environment for High‐Performance Quasi‐Solid‐State Polymer Electrolyte
by: Fan Yang, et al.
Published: (2025) -
Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
by: Jiang, Yichen, et al.
Published: (2024)