Saved in:
| Main Authors: | Ozkara, Kaan, Yu, Tao, Park, Youngsuk |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.20566 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MuonBP: Faster Muon via Block-Periodic Orthogonalization
by: Khaled, Ahmed, et al.
Published: (2025)
by: Khaled, Ahmed, et al.
Published: (2025)
Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models
by: Deng, Wenlong, et al.
Published: (2026)
by: Deng, Wenlong, et al.
Published: (2026)
Training LLMs with MXFP4
by: Tseng, Albert, et al.
Published: (2025)
by: Tseng, Albert, et al.
Published: (2025)
SPIRE: Conditional Personalization for Federated Diffusion Generative Models
by: Ozkara, Kaan, et al.
Published: (2025)
by: Ozkara, Kaan, et al.
Published: (2025)
ADEPT: Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning
by: Ozkara, Kaan, et al.
Published: (2024)
by: Ozkara, Kaan, et al.
Published: (2024)
Collage: Light-Weight Low-Precision Strategy for LLM Training
by: Yu, Tao, et al.
Published: (2024)
by: Yu, Tao, et al.
Published: (2024)
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)
by: Bian, Song, et al.
Published: (2025)
MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
by: Ozkara, Kaan, et al.
Published: (2024)
by: Ozkara, Kaan, et al.
Published: (2024)
Direct Quantized Training of Language Models with Stochastic Rounding
by: Zhao, Kaiyan, et al.
Published: (2024)
by: Zhao, Kaiyan, et al.
Published: (2024)
TritonRL: Training LLMs to Think and Code Triton Without Cheating
by: Woo, Jiin, et al.
Published: (2025)
by: Woo, Jiin, et al.
Published: (2025)
Training with Fewer Bits: Unlocking Edge LLMs Training with Stochastic Rounding
by: Liu, Taowen, et al.
Published: (2025)
by: Liu, Taowen, et al.
Published: (2025)
MEL: Multi-level Ensemble Learning for Resource-Constrained Environments
by: Gudipaty, Krishna Praneet, et al.
Published: (2025)
by: Gudipaty, Krishna Praneet, et al.
Published: (2025)
Not-a-Bandit: Provably No-Regret Drafter Selection in Speculative Decoding for LLMs
by: Liu, Hongyi, et al.
Published: (2025)
by: Liu, Hongyi, et al.
Published: (2025)
Stochastic Rounding Increases Small Singular Values
by: Ma, Linkai, et al.
Published: (2026)
by: Ma, Linkai, et al.
Published: (2026)
Online Posterior Sampling with a Diffusion Prior
by: Kveton, Branislav, et al.
Published: (2024)
by: Kveton, Branislav, et al.
Published: (2024)
Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting
by: Hasson, Hilaf, et al.
Published: (2023)
by: Hasson, Hilaf, et al.
Published: (2023)
Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
by: Gautam, Tanmay, et al.
Published: (2024)
by: Gautam, Tanmay, et al.
Published: (2024)
On Stochastic Rounding with Few Random Bits
by: Fitzgibbon, Andrew, et al.
Published: (2025)
by: Fitzgibbon, Andrew, et al.
Published: (2025)
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
by: Lee, Jung Hyun, et al.
Published: (2023)
by: Lee, Jung Hyun, et al.
Published: (2023)
ProxSparse: Regularized Learning of Semi-Structured Sparsity Masks for Pretrained LLMs
by: Liu, Hongyi, et al.
Published: (2025)
by: Liu, Hongyi, et al.
Published: (2025)
Beyond Fixed Rounds: Data-Free Early Stopping for Practical Federated Learning
by: Lee, Youngjoon, et al.
Published: (2026)
by: Lee, Youngjoon, et al.
Published: (2026)
Stochastic Deep Graph Clustering for Practical Group Formation
by: Park, Junhyung, et al.
Published: (2025)
by: Park, Junhyung, et al.
Published: (2025)
Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization
by: Zhou, Yuli, et al.
Published: (2026)
by: Zhou, Yuli, et al.
Published: (2026)
Round-trip Reinforcement Learning: Self-Consistent Training for Better Chemical LLMs
by: Kong, Lecheng, et al.
Published: (2025)
by: Kong, Lecheng, et al.
Published: (2025)
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training
by: Pamuk, Ahmet Erdem, et al.
Published: (2025)
by: Pamuk, Ahmet Erdem, et al.
Published: (2025)
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
by: Wei, Quan, et al.
Published: (2025)
by: Wei, Quan, et al.
Published: (2025)
Verifier-free Test-Time Sampling for Vision Language Action Models
by: Jang, Suhyeok, et al.
Published: (2025)
by: Jang, Suhyeok, et al.
Published: (2025)
Elucidating Subspace Perturbation in Zeroth-Order Optimization: Theory and Practice at Scale
by: Park, Sihwan, et al.
Published: (2025)
by: Park, Sihwan, et al.
Published: (2025)
Efficient Training on Multiple Consumer GPUs with RoundPipe
by: Luo, Yibin, et al.
Published: (2026)
by: Luo, Yibin, et al.
Published: (2026)
Optimal Scheduling Algorithms for LLM Inference: Theory and Practice
by: Bari, Agrim, et al.
Published: (2025)
by: Bari, Agrim, et al.
Published: (2025)
A Universal Banach--Bregman Framework for Stochastic Iterations: Unifying Stochastic Mirror Descent, Learning and LLM Training
by: Zhang, Johnny R., et al.
Published: (2025)
by: Zhang, Johnny R., et al.
Published: (2025)
Bridging Theory and Practice: A Stochastic Learning-Optimization Model for Resilient Automotive Supply Chains
by: Shahnawaz, Muhammad, et al.
Published: (2025)
by: Shahnawaz, Muhammad, et al.
Published: (2025)
QuChaTeR: A Hybrid Quantum-Chaotic Temporal Framework for Earthquake Prediction
by: Özdemir, Emir Kaan
Published: (2026)
by: Özdemir, Emir Kaan
Published: (2026)
Random Matrix Theory for Stochastic Gradient Descent
by: Park, Chanju, et al.
Published: (2024)
by: Park, Chanju, et al.
Published: (2024)
Round and Round We Go! What makes Rotary Positional Encodings useful?
by: Barbero, Federico, et al.
Published: (2024)
by: Barbero, Federico, et al.
Published: (2024)
Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
by: Lu, Haoye, et al.
Published: (2025)
by: Lu, Haoye, et al.
Published: (2025)
A Reduction Algorithm for Markovian Contextual Linear Bandits
by: Buyukkalayci, Kaan, et al.
Published: (2026)
by: Buyukkalayci, Kaan, et al.
Published: (2026)
Inference Optimization of Foundation Models on AI Accelerators
by: Park, Youngsuk, et al.
Published: (2024)
by: Park, Youngsuk, et al.
Published: (2024)
Memory-Efficient LLM Training with Dynamic Sparsity: From Stability to Practical Scaling
by: Xiao, Qiao, et al.
Published: (2026)
by: Xiao, Qiao, et al.
Published: (2026)
Towards Best Practices for Open Datasets for LLM Training
by: Baack, Stefan, et al.
Published: (2025)
by: Baack, Stefan, et al.
Published: (2025)
Similar Items
-
MuonBP: Faster Muon via Block-Periodic Orthogonalization
by: Khaled, Ahmed, et al.
Published: (2025) -
Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models
by: Deng, Wenlong, et al.
Published: (2026) -
Training LLMs with MXFP4
by: Tseng, Albert, et al.
Published: (2025) -
SPIRE: Conditional Personalization for Federated Diffusion Generative Models
by: Ozkara, Kaan, et al.
Published: (2025) -
ADEPT: Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning
by: Ozkara, Kaan, et al.
Published: (2024)