:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ozkara, Kaan, Yu, Tao, Park, Youngsuk
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.20566
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MuonBP: Faster Muon via Block-Periodic Orthogonalization
by: Khaled, Ahmed, et al.
Published: (2025)

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models
by: Deng, Wenlong, et al.
Published: (2026)

Training LLMs with MXFP4
by: Tseng, Albert, et al.
Published: (2025)

SPIRE: Conditional Personalization for Federated Diffusion Generative Models
by: Ozkara, Kaan, et al.
Published: (2025)

ADEPT: Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning
by: Ozkara, Kaan, et al.
Published: (2024)

Collage: Light-Weight Low-Precision Strategy for LLM Training
by: Yu, Tao, et al.
Published: (2024)

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
by: Ozkara, Kaan, et al.
Published: (2024)

Direct Quantized Training of Language Models with Stochastic Rounding
by: Zhao, Kaiyan, et al.
Published: (2024)

TritonRL: Training LLMs to Think and Code Triton Without Cheating
by: Woo, Jiin, et al.
Published: (2025)

Training with Fewer Bits: Unlocking Edge LLMs Training with Stochastic Rounding
by: Liu, Taowen, et al.
Published: (2025)

MEL: Multi-level Ensemble Learning for Resource-Constrained Environments
by: Gudipaty, Krishna Praneet, et al.
Published: (2025)

Not-a-Bandit: Provably No-Regret Drafter Selection in Speculative Decoding for LLMs
by: Liu, Hongyi, et al.
Published: (2025)

Stochastic Rounding Increases Small Singular Values
by: Ma, Linkai, et al.
Published: (2026)

Online Posterior Sampling with a Diffusion Prior
by: Kveton, Branislav, et al.
Published: (2024)

Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting
by: Hasson, Hilaf, et al.
Published: (2023)

Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
by: Gautam, Tanmay, et al.
Published: (2024)

On Stochastic Rounding with Few Random Bits
by: Fitzgibbon, Andrew, et al.
Published: (2025)

FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
by: Lee, Jung Hyun, et al.
Published: (2023)

ProxSparse: Regularized Learning of Semi-Structured Sparsity Masks for Pretrained LLMs
by: Liu, Hongyi, et al.
Published: (2025)

Beyond Fixed Rounds: Data-Free Early Stopping for Practical Federated Learning
by: Lee, Youngjoon, et al.
Published: (2026)

Stochastic Deep Graph Clustering for Practical Group Formation
by: Park, Junhyung, et al.
Published: (2025)

Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization
by: Zhou, Yuli, et al.
Published: (2026)

Round-trip Reinforcement Learning: Self-Consistent Training for Better Chemical LLMs
by: Kong, Lecheng, et al.
Published: (2025)

Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training
by: Pamuk, Ahmet Erdem, et al.
Published: (2025)

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
by: Wei, Quan, et al.
Published: (2025)

Verifier-free Test-Time Sampling for Vision Language Action Models
by: Jang, Suhyeok, et al.
Published: (2025)

Elucidating Subspace Perturbation in Zeroth-Order Optimization: Theory and Practice at Scale
by: Park, Sihwan, et al.
Published: (2025)

Efficient Training on Multiple Consumer GPUs with RoundPipe
by: Luo, Yibin, et al.
Published: (2026)

Optimal Scheduling Algorithms for LLM Inference: Theory and Practice
by: Bari, Agrim, et al.
Published: (2025)

A Universal Banach--Bregman Framework for Stochastic Iterations: Unifying Stochastic Mirror Descent, Learning and LLM Training
by: Zhang, Johnny R., et al.
Published: (2025)

Bridging Theory and Practice: A Stochastic Learning-Optimization Model for Resilient Automotive Supply Chains
by: Shahnawaz, Muhammad, et al.
Published: (2025)

QuChaTeR: A Hybrid Quantum-Chaotic Temporal Framework for Earthquake Prediction
by: Özdemir, Emir Kaan
Published: (2026)

Random Matrix Theory for Stochastic Gradient Descent
by: Park, Chanju, et al.
Published: (2024)

Round and Round We Go! What makes Rotary Positional Encodings useful?
by: Barbero, Federico, et al.
Published: (2024)

Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
by: Lu, Haoye, et al.
Published: (2025)

A Reduction Algorithm for Markovian Contextual Linear Bandits
by: Buyukkalayci, Kaan, et al.
Published: (2026)

Inference Optimization of Foundation Models on AI Accelerators
by: Park, Youngsuk, et al.
Published: (2024)

Memory-Efficient LLM Training with Dynamic Sparsity: From Stability to Practical Scaling
by: Xiao, Qiao, et al.
Published: (2026)

Towards Best Practices for Open Datasets for LLM Training
by: Baack, Stefan, et al.
Published: (2025)