Saved in:
| Main Authors: | Mishra, Asit, Stosic, Dusan, Layton, Simon, Micikevicius, Paulius |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.08027 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Future of Large Language Model Pre-training is Federated
by: Sani, Lorenzo, et al.
Published: (2024)
by: Sani, Lorenzo, et al.
Published: (2024)
Improving training time and GPU utilization in geo-distributed language model training
by: Palak, et al.
Published: (2024)
by: Palak, et al.
Published: (2024)
Data movement limits to frontier model training
by: Erdil, Ege, et al.
Published: (2024)
by: Erdil, Ege, et al.
Published: (2024)
Pre-Deployment Complexity Estimation for Federated Perception Systems
by: Solaiman, KMA, et al.
Published: (2026)
by: Solaiman, KMA, et al.
Published: (2026)
FLStore: Efficient Federated Learning Storage for non-training workloads
by: Khan, Ahmad Faraz, et al.
Published: (2025)
by: Khan, Ahmad Faraz, et al.
Published: (2025)
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
by: Park, Chansung, et al.
Published: (2024)
by: Park, Chansung, et al.
Published: (2024)
Marconi: Prefix Caching for the Era of Hybrid LLMs
by: Pan, Rui, et al.
Published: (2024)
by: Pan, Rui, et al.
Published: (2024)
MinT: Managed Infrastructure for Training and Serving Millions of LLMs
by: Lab, Mind, et al.
Published: (2026)
by: Lab, Mind, et al.
Published: (2026)
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
by: Zhu, Zhanda, et al.
Published: (2025)
by: Zhu, Zhanda, et al.
Published: (2025)
SFPrompt: Communication-Efficient Split Federated Fine-Tuning for Large Pre-Trained Models over Resource-Limited Devices
by: Cao, Linxiao, et al.
Published: (2024)
by: Cao, Linxiao, et al.
Published: (2024)
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
by: Li, Dacheng, et al.
Published: (2023)
by: Li, Dacheng, et al.
Published: (2023)
Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs
by: Pan, Rui, et al.
Published: (2025)
by: Pan, Rui, et al.
Published: (2025)
Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning
by: Jimenez-Gutierrez, Daniel M., et al.
Published: (2026)
by: Jimenez-Gutierrez, Daniel M., et al.
Published: (2026)
LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving
by: Hu, Huanqi, et al.
Published: (2025)
by: Hu, Huanqi, et al.
Published: (2025)
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
by: Liang, Wanchao, et al.
Published: (2024)
by: Liang, Wanchao, et al.
Published: (2024)
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
by: Wang, Jinheng, et al.
Published: (2025)
by: Wang, Jinheng, et al.
Published: (2025)
Vertical Federated Learning Hybrid Local Pre-training
by: Li, Wenguo, et al.
Published: (2024)
by: Li, Wenguo, et al.
Published: (2024)
Poison Once, Refuse Forever: Weaponizing Alignment for Injecting Bias in LLMs
by: Mamun, Md Abdullah Al, et al.
Published: (2025)
by: Mamun, Md Abdullah Al, et al.
Published: (2025)
Low-Rank GEMM: Efficient Matrix Multiplication via Low-Rank Approximation with FP8 Acceleration
by: Metere, Alfredo
Published: (2025)
by: Metere, Alfredo
Published: (2025)
A Structure-Agnostic Co-Tuning Framework for LLMs and SLMs in Cloud-Edge Systems
by: Liu, Yuze, et al.
Published: (2025)
by: Liu, Yuze, et al.
Published: (2025)
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs
by: Cao, Yadi, et al.
Published: (2026)
by: Cao, Yadi, et al.
Published: (2026)
GPT-FL: Generative Pre-trained Model-Assisted Federated Learning
by: Zhang, Tuo, et al.
Published: (2023)
by: Zhang, Tuo, et al.
Published: (2023)
A Unified Convergence Analysis for Semi-Decentralized Learning: Sampled-to-Sampled vs. Sampled-to-All Communication
by: Rodio, Angelo, et al.
Published: (2025)
by: Rodio, Angelo, et al.
Published: (2025)
Synera: Synergistic LLM Serving across Device and Cloud at Scale
by: Wang, Genglin, et al.
Published: (2025)
by: Wang, Genglin, et al.
Published: (2025)
PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training
by: Ockerman, Seth, et al.
Published: (2025)
by: Ockerman, Seth, et al.
Published: (2025)
A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
by: Li, Xiaocan, et al.
Published: (2025)
by: Li, Xiaocan, et al.
Published: (2025)
Topology-Aware Knowledge Propagation in Decentralized Learning
by: Sakarvadia, Mansi, et al.
Published: (2025)
by: Sakarvadia, Mansi, et al.
Published: (2025)
FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic Graphs
by: Chen, Zihan, et al.
Published: (2025)
by: Chen, Zihan, et al.
Published: (2025)
FedCGD: Collective Gradient Divergence Optimized Scheduling for Wireless Federated Learning
by: Chen, Tan, et al.
Published: (2025)
by: Chen, Tan, et al.
Published: (2025)
Research on Edge Computing and Cloud Collaborative Resource Scheduling Optimization Based on Deep Reinforcement Learning
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models
by: Lin, Zheng, et al.
Published: (2025)
by: Lin, Zheng, et al.
Published: (2025)
Learning Like Humans: Resource-Efficient Federated Fine-Tuning through Cognitive Developmental Stages
by: Wu, Yebo, et al.
Published: (2025)
by: Wu, Yebo, et al.
Published: (2025)
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
by: Wan, Xinyi, et al.
Published: (2025)
by: Wan, Xinyi, et al.
Published: (2025)
Accelerating Privacy-Preserving Federated Learning in Large-Scale LEO Satellite Systems
by: Guo, Binquan, et al.
Published: (2025)
by: Guo, Binquan, et al.
Published: (2025)
On Using Large-Batches in Federated Learning
by: Tyagi, Sahil
Published: (2025)
by: Tyagi, Sahil
Published: (2025)
Adaptive Approach to Enhance Machine Learning Scheduling Algorithms During Runtime Using Reinforcement Learning in Metascheduling Applications
by: Alshaer, Samer, et al.
Published: (2025)
by: Alshaer, Samer, et al.
Published: (2025)
Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks
by: Deng, Xiumei, et al.
Published: (2025)
by: Deng, Xiumei, et al.
Published: (2025)
Adaptive Graph Pruning with Sudden-Events Evaluation for Traffic Prediction using Online Semi-Decentralized ST-GNNs
by: Kralj, Ivan, et al.
Published: (2025)
by: Kralj, Ivan, et al.
Published: (2025)
Task-Agnostic Federation over Decentralized Data: Research Landscape and Visions
by: Wu, Wentai, et al.
Published: (2025)
by: Wu, Wentai, et al.
Published: (2025)
DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling
by: Gao, Yubo, et al.
Published: (2025)
by: Gao, Yubo, et al.
Published: (2025)
Similar Items
-
The Future of Large Language Model Pre-training is Federated
by: Sani, Lorenzo, et al.
Published: (2024) -
Improving training time and GPU utilization in geo-distributed language model training
by: Palak, et al.
Published: (2024) -
Data movement limits to frontier model training
by: Erdil, Ege, et al.
Published: (2024) -
Pre-Deployment Complexity Estimation for Federated Perception Systems
by: Solaiman, KMA, et al.
Published: (2026) -
FLStore: Efficient Federated Learning Storage for non-training workloads
by: Khan, Ahmad Faraz, et al.
Published: (2025)