:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mishra, Asit, Stosic, Dusan, Layton, Simon, Micikevicius, Paulius
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Distributed, Parallel, and Cluster Computing
Online Access:	https://arxiv.org/abs/2506.08027
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Future of Large Language Model Pre-training is Federated
by: Sani, Lorenzo, et al.
Published: (2024)

Improving training time and GPU utilization in geo-distributed language model training
by: Palak, et al.
Published: (2024)

Data movement limits to frontier model training
by: Erdil, Ege, et al.
Published: (2024)

Pre-Deployment Complexity Estimation for Federated Perception Systems
by: Solaiman, KMA, et al.
Published: (2026)

FLStore: Efficient Federated Learning Storage for non-training workloads
by: Khan, Ahmad Faraz, et al.
Published: (2025)

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
by: Park, Chansung, et al.
Published: (2024)

Marconi: Prefix Caching for the Era of Hybrid LLMs
by: Pan, Rui, et al.
Published: (2024)

MinT: Managed Infrastructure for Training and Serving Millions of LLMs
by: Lab, Mind, et al.
Published: (2026)

LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
by: Zhu, Zhanda, et al.
Published: (2025)

SFPrompt: Communication-Efficient Split Federated Fine-Tuning for Large Pre-Trained Models over Resource-Limited Devices
by: Cao, Linxiao, et al.
Published: (2024)

DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
by: Li, Dacheng, et al.
Published: (2023)

Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs
by: Pan, Rui, et al.
Published: (2025)

Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning
by: Jimenez-Gutierrez, Daniel M., et al.
Published: (2026)

LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving
by: Hu, Huanqi, et al.
Published: (2025)

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
by: Liang, Wanchao, et al.
Published: (2024)

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
by: Wang, Jinheng, et al.
Published: (2025)

Vertical Federated Learning Hybrid Local Pre-training
by: Li, Wenguo, et al.
Published: (2024)

Poison Once, Refuse Forever: Weaponizing Alignment for Injecting Bias in LLMs
by: Mamun, Md Abdullah Al, et al.
Published: (2025)

Low-Rank GEMM: Efficient Matrix Multiplication via Low-Rank Approximation with FP8 Acceleration
by: Metere, Alfredo
Published: (2025)

A Structure-Agnostic Co-Tuning Framework for LLMs and SLMs in Cloud-Edge Systems
by: Liu, Yuze, et al.
Published: (2025)

SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs
by: Cao, Yadi, et al.
Published: (2026)

GPT-FL: Generative Pre-trained Model-Assisted Federated Learning
by: Zhang, Tuo, et al.
Published: (2023)

A Unified Convergence Analysis for Semi-Decentralized Learning: Sampled-to-Sampled vs. Sampled-to-All Communication
by: Rodio, Angelo, et al.
Published: (2025)

Synera: Synergistic LLM Serving across Device and Cloud at Scale
by: Wang, Genglin, et al.
Published: (2025)

PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training
by: Ockerman, Seth, et al.
Published: (2025)

A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
by: Li, Xiaocan, et al.
Published: (2025)

Topology-Aware Knowledge Propagation in Decentralized Learning
by: Sakarvadia, Mansi, et al.
Published: (2025)

FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic Graphs
by: Chen, Zihan, et al.
Published: (2025)

FedCGD: Collective Gradient Divergence Optimized Scheduling for Wireless Federated Learning
by: Chen, Tan, et al.
Published: (2025)

Research on Edge Computing and Cloud Collaborative Resource Scheduling Optimization Based on Deep Reinforcement Learning
by: Wang, Yuqing, et al.
Published: (2025)

HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models
by: Lin, Zheng, et al.
Published: (2025)

Learning Like Humans: Resource-Efficient Federated Fine-Tuning through Cognitive Developmental Stages
by: Wu, Yebo, et al.
Published: (2025)

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
by: Wan, Xinyi, et al.
Published: (2025)

Accelerating Privacy-Preserving Federated Learning in Large-Scale LEO Satellite Systems
by: Guo, Binquan, et al.
Published: (2025)

On Using Large-Batches in Federated Learning
by: Tyagi, Sahil
Published: (2025)

Adaptive Approach to Enhance Machine Learning Scheduling Algorithms During Runtime Using Reinforcement Learning in Metascheduling Applications
by: Alshaer, Samer, et al.
Published: (2025)

Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks
by: Deng, Xiumei, et al.
Published: (2025)

Adaptive Graph Pruning with Sudden-Events Evaluation for Traffic Prediction using Online Semi-Decentralized ST-GNNs
by: Kralj, Ivan, et al.
Published: (2025)

Task-Agnostic Federation over Decentralized Data: Research Landscape and Visions
by: Wu, Wentai, et al.
Published: (2025)

DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling
by: Gao, Yubo, et al.
Published: (2025)