Saved in:
| Main Authors: | Kumar, Medha, Xu, Zifei, Wang, Xin, Webb, Tristan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.15694 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scaling Laws for Post Training Quantized Large Language Models
by: Xu, Zifei, et al.
Published: (2024)
by: Xu, Zifei, et al.
Published: (2024)
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
by: Xu, Zifei, et al.
Published: (2024)
by: Xu, Zifei, et al.
Published: (2024)
Post Training Quantization of Large Language Models with Microscaling Formats
by: Sharify, Sayeh, et al.
Published: (2024)
by: Sharify, Sayeh, et al.
Published: (2024)
Early Attentive Sparsification Accelerates Neural Speech Transcription
by: Xu, Zifei, et al.
Published: (2025)
by: Xu, Zifei, et al.
Published: (2025)
MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference
by: Xu, Zifei, et al.
Published: (2026)
by: Xu, Zifei, et al.
Published: (2026)
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
by: Lu, Heng, et al.
Published: (2024)
by: Lu, Heng, et al.
Published: (2024)
From Narratives to Probabilistic Reasoning: Predicting and Interpreting Drivers' Hazardous Actions in Crashes Using Large Language Model
by: Chen, Boyou, et al.
Published: (2025)
by: Chen, Boyou, et al.
Published: (2025)
QuRL: Efficient Reinforcement Learning with Quantized Rollout
by: Li, Yuhang, et al.
Published: (2026)
by: Li, Yuhang, et al.
Published: (2026)
Teaching Large Language Models to Reason with Reinforcement Learning
by: Havrilla, Alex, et al.
Published: (2024)
by: Havrilla, Alex, et al.
Published: (2024)
Mechanistic Interpretability of Reinforcement Learning Agents
by: Trim, Tristan, et al.
Published: (2024)
by: Trim, Tristan, et al.
Published: (2024)
A Survey of Reinforcement Learning for Large Reasoning Models
by: Zhang, Kaiyan, et al.
Published: (2025)
by: Zhang, Kaiyan, et al.
Published: (2025)
Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning
by: Wan, Qian, et al.
Published: (2026)
by: Wan, Qian, et al.
Published: (2026)
SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM
by: Zhang, Xiaojiang, et al.
Published: (2025)
by: Zhang, Xiaojiang, et al.
Published: (2025)
An ADHD Diagnostic Interface Based on EEG Spectrograms and Deep Learning Techniques
by: Pappula, Medha, et al.
Published: (2024)
by: Pappula, Medha, et al.
Published: (2024)
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals
by: Zhang, Nan, et al.
Published: (2026)
by: Zhang, Nan, et al.
Published: (2026)
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
Revisiting Entropy in Reinforcement Learning for Large Reasoning Models
by: Jin, Renren, et al.
Published: (2025)
by: Jin, Renren, et al.
Published: (2025)
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
by: Wang, Yiping, et al.
Published: (2025)
by: Wang, Yiping, et al.
Published: (2025)
Sinkhorn Treatment Effects: A Causal Optimal Transport Measure
by: Agarwal, Medha, et al.
Published: (2026)
by: Agarwal, Medha, et al.
Published: (2026)
CRoPE: Efficient Parametrization of Rotary Positional Embedding
by: Lou, Beicheng, et al.
Published: (2026)
by: Lou, Beicheng, et al.
Published: (2026)
On Predictability of Reinforcement Learning Dynamics for Large Language Models
by: Cai, Yuchen, et al.
Published: (2025)
by: Cai, Yuchen, et al.
Published: (2025)
Integrating Large Language Models and Reinforcement Learning for Non-Linear Reasoning
by: Alon, Yoav, et al.
Published: (2024)
by: Alon, Yoav, et al.
Published: (2024)
Quantizing Small-Scale State-Space Models for Edge AI
by: Zhao, Leo, et al.
Published: (2025)
by: Zhao, Leo, et al.
Published: (2025)
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
by: Le, Hung, et al.
Published: (2025)
by: Le, Hung, et al.
Published: (2025)
CBQ: Cross-Block Quantization for Large Language Models
by: Ding, Xin, et al.
Published: (2023)
by: Ding, Xin, et al.
Published: (2023)
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals
by: Saxena, Utkarsh, et al.
Published: (2024)
by: Saxena, Utkarsh, et al.
Published: (2024)
AR$^2$: Adversarial Reinforcement Learning for Abstract Reasoning in Large Language Models
by: Yeh, Cheng-Kai, et al.
Published: (2025)
by: Yeh, Cheng-Kai, et al.
Published: (2025)
Dynamic Adversarial Reinforcement Learning for Robust Multimodal Large Language Models
by: Bao, Yicheng, et al.
Published: (2026)
by: Bao, Yicheng, et al.
Published: (2026)
ReasonCache: Accelerating Large Reasoning Model Serving through KV Cache Sharing
by: Chen, Kaiwen, et al.
Published: (2025)
by: Chen, Kaiwen, et al.
Published: (2025)
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
by: Zeng, Weihao, et al.
Published: (2024)
by: Zeng, Weihao, et al.
Published: (2024)
Distributed Area Coverage with High Altitude Balloons Using Multi-Agent Reinforcement Learning
by: Haroon, Adam, et al.
Published: (2025)
by: Haroon, Adam, et al.
Published: (2025)
LAQuant: A Simple Overhead-free Large Reasoning Model Quantization by Layer-wise Lookahead Loss
by: Choi, Euntae, et al.
Published: (2026)
by: Choi, Euntae, et al.
Published: (2026)
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
by: Fu, Wei, et al.
Published: (2025)
by: Fu, Wei, et al.
Published: (2025)
IV-ICL: Bounding Causal Effects with Instrumental Variables via In-Context Learning
by: Balazadeh, Vahid, et al.
Published: (2026)
by: Balazadeh, Vahid, et al.
Published: (2026)
STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training
by: Liu, Minglu, et al.
Published: (2026)
by: Liu, Minglu, et al.
Published: (2026)
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
by: Zhao, Siyan, et al.
Published: (2025)
by: Zhao, Siyan, et al.
Published: (2025)
A Carbon Tracking Model for Federated Learning: Impact of Quantization and Sparsification
by: Barbieri, Luca, et al.
Published: (2023)
by: Barbieri, Luca, et al.
Published: (2023)
Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview
by: Wang, Yanshu, et al.
Published: (2024)
by: Wang, Yanshu, et al.
Published: (2024)
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning
by: Wang, Hu, et al.
Published: (2025)
by: Wang, Hu, et al.
Published: (2025)
Efficient Reinforcement Learning with Large Language Model Priors
by: Yan, Xue, et al.
Published: (2024)
by: Yan, Xue, et al.
Published: (2024)
Similar Items
-
Scaling Laws for Post Training Quantized Large Language Models
by: Xu, Zifei, et al.
Published: (2024) -
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
by: Xu, Zifei, et al.
Published: (2024) -
Post Training Quantization of Large Language Models with Microscaling Formats
by: Sharify, Sayeh, et al.
Published: (2024) -
Early Attentive Sparsification Accelerates Neural Speech Transcription
by: Xu, Zifei, et al.
Published: (2025) -
MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference
by: Xu, Zifei, et al.
Published: (2026)