:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Akkas, Selahattin, Devarakonda, Aditya, Azad, Ariful
Format:	Preprint
Publié:	2025
Sujets:	Machine Learning Artificial Intelligence Distributed, Parallel, and Cluster Computing
Accès en ligne:	https://arxiv.org/abs/2506.22668
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Shapley-Value-Based Graph Sparsification for GNN Inference
par: Akkas, Selahattin, et autres
Publié: (2025)

GNNShap: Scalable and Accurate GNN Explanation using Shapley Values
par: Akkas, Selahattin, et autres
Publié: (2024)

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training
par: Wei, Cunyang, et autres
Publié: (2026)

MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN Training
par: Ullah, Irfan, et autres
Publié: (2026)

Plexus: Taming Billion-edge Graphs with 3D Parallel Full-graph GNN Training
par: Ranjan, Aditya K., et autres
Publié: (2025)

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
par: Wang, Taiyi, et autres
Publié: (2024)

D3-GNN: Dynamic Distributed Dataflow for Streaming Graph Neural Networks
par: Guliyev, Rustam, et autres
Publié: (2024)

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
par: Singh, Siddharth, et autres
Publié: (2025)

Scalable Artificial Intelligence for Science: Perspectives, Methods and Exemplars
par: Brewer, Wesley, et autres
Publié: (2024)

FSD-Inference: Fully Serverless Distributed Inference with Scalable Cloud Communication
par: Oakley, Joe, et autres
Publié: (2024)

iSpLib: A Library for Accelerating Graph Neural Networks using Auto-tuned Sparse Operations
par: Anik, Md Saidul Hoque, et autres
Publié: (2024)

Efficient and Scalable Agentic AI with Heterogeneous Systems
par: Asgar, Zain, et autres
Publié: (2025)

Context Parallelism for Scalable Million-Token Inference
par: Yang, Amy, et autres
Publié: (2024)

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents
par: Sarkar, Aishwarya, et autres
Publié: (2026)

Communication-Avoiding Linear Algebraic Kernel K-Means on GPUs
par: Bellavita, Julian, et autres
Publié: (2026)

LLM-42: Enabling Determinism in LLM Inference with Verified Speculation
par: Gond, Raja, et autres
Publié: (2026)

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
par: Wan, Xinyi, et autres
Publié: (2025)

Laminar: A Scalable Asynchronous RL Post-Training Framework
par: Sheng, Guangming, et autres
Publié: (2025)

Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning
par: Sharma, Atul, et autres
Publié: (2025)

The Big Send-off: Scalable and Performant Collectives for Deep Learning
par: Singh, Siddharth, et autres
Publié: (2025)

Scalable and Adaptive Parallel Training of Graph Transformer on Large Graphs
par: Lin, Jun-Liang, et autres
Publié: (2026)

GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable
par: Wangni, Jianqiao
Publié: (2025)

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer
par: Vooturi, Dharma Teja, et autres
Publié: (2026)

Guard: Scalable Straggler Detection and Node Health Management for Large-Scale Training
par: Liu, Guanliang, et autres
Publié: (2026)

GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism
par: Jeon, Byungsoo, et autres
Publié: (2024)

FLoRIST: Singular Value Thresholding for Efficient and Accurate Federated Fine-Tuning of Large Language Models
par: Ramesh, Hariharan, et autres
Publié: (2025)

Optimal Transport Aggregation for Distributed Mixture-of-Experts
par: Chamroukhi, Faïcel, et autres
Publié: (2023)

Trustworthiness of Stochastic Gradient Descent in Distributed Learning
par: Li, Hongyang, et autres
Publié: (2024)

On the Fragility of Data Attribution When Learning Is Distributed
par: Gao, Xian, et autres
Publié: (2026)

Measuring Heterogeneity in Machine Learning with Distributed Energy Distance
par: Fan, Mengchen, et autres
Publié: (2025)

Distributed Low-Communication Training with Decoupled Momentum Optimization
par: Nedelkoski, Sasho, et autres
Publié: (2025)

Loss- and Reward-Weighting for Efficient Distributed Reinforcement Learning
par: Holen, Martin, et autres
Publié: (2023)

Adaptive Consensus Gradients Aggregation for Scaled Distributed Training
par: Choukroun, Yoni, et autres
Publié: (2024)

TrainVerify: Equivalence-Based Verification for Distributed LLM Training
par: Lu, Yunchi, et autres
Publié: (2025)

Galvatron: An Automatic Distributed System for Efficient Foundation Model Training
par: Liu, Xinyi, et autres
Publié: (2025)

Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses
par: Park, Jeongmin Brian, et autres
Publié: (2023)

FedCore: Straggler-Free Federated Learning with Distributed Coresets
par: Guo, Hongpeng, et autres
Publié: (2024)

Efficient Resource Scheduling for Distributed Infrastructures Using Negotiation Capabilities
par: Chu, Junjie, et autres
Publié: (2024)

PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training
par: Ockerman, Seth, et autres
Publié: (2025)

ATTENTION2D: Communication Efficient Distributed Self-Attention Mechanism
par: Elango, Venmugil
Publié: (2025)