Saved in:
| Main Authors: | Kolehmainen, Jari, Blagoev, Nikolay, Donaghy, John, Ersoy, Oğuzhan, Nies, Christopher |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.10911 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks
by: Blagoev, Nikolay, et al.
Published: (2025)
by: Blagoev, Nikolay, et al.
Published: (2025)
Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO
by: Blagoev, Nikolay, et al.
Published: (2025)
by: Blagoev, Nikolay, et al.
Published: (2025)
All is Not Lost: LLM Recovery without Checkpoints
by: Blagoev, Nikolay, et al.
Published: (2025)
by: Blagoev, Nikolay, et al.
Published: (2025)
HDEE: Heterogeneous Domain Expert Ensemble
by: Ersoy, Oğuzhan, et al.
Published: (2025)
by: Ersoy, Oğuzhan, et al.
Published: (2025)
F-TIS: Harnessing Diverse Models in Collaborative GRPO
by: Blagoev, Nikolay, et al.
Published: (2026)
by: Blagoev, Nikolay, et al.
Published: (2026)
Backdoor Attacks on Decentralised Post-Training
by: Ersoy, Oğuzhan, et al.
Published: (2026)
by: Ersoy, Oğuzhan, et al.
Published: (2026)
Training-Free Dynamic Upcycling of Expert Language Models
by: Fanì, Eros, et al.
Published: (2026)
by: Fanì, Eros, et al.
Published: (2026)
Go With The Flow: Churn-Tolerant Decentralized Training of Large Language Models
by: Blagoev, Nikolay, et al.
Published: (2025)
by: Blagoev, Nikolay, et al.
Published: (2025)
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
by: Amico, Jeffrey, et al.
Published: (2025)
by: Amico, Jeffrey, et al.
Published: (2025)
LoCo: Low-Bit Communication Adaptor for Large-scale Model Training
by: Xie, Xingyu, et al.
Published: (2024)
by: Xie, Xingyu, et al.
Published: (2024)
DiLoCo: Distributed Low-Communication Training of Language Models
by: Douillard, Arthur, et al.
Published: (2023)
by: Douillard, Arthur, et al.
Published: (2023)
DEI: Diversity in Evolutionary Inference for Quality-Diversity Search
by: Donaghy, John, et al.
Published: (2026)
by: Donaghy, John, et al.
Published: (2026)
DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster
by: Qi, Ji, et al.
Published: (2025)
by: Qi, Ji, et al.
Published: (2025)
Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
by: Abad, Gorka, et al.
Published: (2023)
by: Abad, Gorka, et al.
Published: (2023)
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
by: Jaghouar, Sami, et al.
Published: (2024)
by: Jaghouar, Sami, et al.
Published: (2024)
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition
by: Yu, Yu, et al.
Published: (2024)
by: Yu, Yu, et al.
Published: (2024)
Robust Inference Methods for Latent Group Panel Models under Possible Group Non-Separation
by: Akgun, Oguzhan, et al.
Published: (2025)
by: Akgun, Oguzhan, et al.
Published: (2025)
LoLCATs: On Low-Rank Linearizing of Large Language Models
by: Zhang, Michael, et al.
Published: (2024)
by: Zhang, Michael, et al.
Published: (2024)
AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models
by: Kutuzov, Nikolay, et al.
Published: (2025)
by: Kutuzov, Nikolay, et al.
Published: (2025)
BiCoLoR: Communication-Efficient Optimization with Bidirectional Compression and Local Training
by: Condat, Laurent, et al.
Published: (2026)
by: Condat, Laurent, et al.
Published: (2026)
TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models
by: Mu, Lin, et al.
Published: (2026)
by: Mu, Lin, et al.
Published: (2026)
Towards a Small Language Model Lifecycle Framework
by: Miraghaei, Parsa, et al.
Published: (2025)
by: Miraghaei, Parsa, et al.
Published: (2025)
Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities
by: Hao, Zhiwei, et al.
Published: (2025)
by: Hao, Zhiwei, et al.
Published: (2025)
LoCoDL: Communication-Efficient Distributed Learning with Local Training and Compression
by: Condat, Laurent, et al.
Published: (2024)
by: Condat, Laurent, et al.
Published: (2024)
Communication Efficient LLM Pre-training with SparseLoCo
by: Sarfi, Amir, et al.
Published: (2025)
by: Sarfi, Amir, et al.
Published: (2025)
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
by: Charles, Zachary, et al.
Published: (2025)
by: Charles, Zachary, et al.
Published: (2025)
Verde: Verification via Refereed Delegation for Machine Learning Programs
by: Arun, Arasu, et al.
Published: (2025)
by: Arun, Arasu, et al.
Published: (2025)
LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
by: Yang, Yifan, et al.
Published: (2024)
by: Yang, Yifan, et al.
Published: (2024)
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models
by: Zhang, Jun, et al.
Published: (2025)
by: Zhang, Jun, et al.
Published: (2025)
LoFT-LLM: Low-Frequency Time-Series Forecasting with Large Language Models
by: You, Jiacheng, et al.
Published: (2025)
by: You, Jiacheng, et al.
Published: (2025)
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
by: Abdi, Hossein, et al.
Published: (2024)
by: Abdi, Hossein, et al.
Published: (2024)
LoRA+: Efficient Low Rank Adaptation of Large Models
by: Hayou, Soufiane, et al.
Published: (2024)
by: Hayou, Soufiane, et al.
Published: (2024)
LoRIF: Low-Rank Influence Functions for Scalable Training Data Attribution
by: Li, Shuangqi, et al.
Published: (2026)
by: Li, Shuangqi, et al.
Published: (2026)
Smoothing DiLoCo with Primal Averaging for Faster Training of LLMs
by: Defazio, Aaron, et al.
Published: (2025)
by: Defazio, Aaron, et al.
Published: (2025)
LoRDO: Distributed Low-Rank Optimization with Infrequent Communication
by: Jovanović, Andrej, et al.
Published: (2026)
by: Jovanović, Andrej, et al.
Published: (2026)
MuLoCo: Muon is a practical inner optimizer for DiLoCo
by: Thérien, Benjamin, et al.
Published: (2025)
by: Thérien, Benjamin, et al.
Published: (2025)
BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models
by: Coscia, Dario, et al.
Published: (2026)
by: Coscia, Dario, et al.
Published: (2026)
Sensitivity-LoRA: Low-Load Sensitivity-Based Fine-Tuning for Large Language Models
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models
by: Rahmati, Amir Hossein, et al.
Published: (2025)
by: Rahmati, Amir Hossein, et al.
Published: (2025)
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
by: Zhang, Yilang, et al.
Published: (2025)
by: Zhang, Yilang, et al.
Published: (2025)
Similar Items
-
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks
by: Blagoev, Nikolay, et al.
Published: (2025) -
Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO
by: Blagoev, Nikolay, et al.
Published: (2025) -
All is Not Lost: LLM Recovery without Checkpoints
by: Blagoev, Nikolay, et al.
Published: (2025) -
HDEE: Heterogeneous Domain Expert Ensemble
by: Ersoy, Oğuzhan, et al.
Published: (2025) -
F-TIS: Harnessing Diverse Models in Collaborative GRPO
by: Blagoev, Nikolay, et al.
Published: (2026)