:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang, Yatong, Wang, Fali, Gu, Naibin, Lin, Zheng, Liu, Zhengxiao, Yao, Dingyu, Zhang, Zhiwei, Shi, Jianxin, Wang, Weiping
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Distributed, Parallel, and Cluster Computing Computation and Language
Online-Zugang:	https://arxiv.org/abs/2605.23913
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads
von: Zuo, Jingwei, et al.
Veröffentlicht: (2026)

ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs
von: Sui, Yifan, et al.
Veröffentlicht: (2025)

InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models
von: Chen, Hongyu, et al.
Veröffentlicht: (2026)

SHE-LoRA: Selective Homomorphic Encryption for Federated Tuning with Heterogeneous LoRA
von: Liu, Jianmin, et al.
Veröffentlicht: (2025)

S-LoRA: Serving Thousands of Concurrent LoRA Adapters
von: Sheng, Ying, et al.
Veröffentlicht: (2023)

pFedLoRA: Model-Heterogeneous Personalized Federated Learning with LoRA Tuning
von: Yi, Liping, et al.
Veröffentlicht: (2023)

TS-EoH: An Edge Server Task Scheduling Algorithm Based on Evolution of Heuristic
von: Yatong, Wang, et al.
Veröffentlicht: (2024)

Predictive-LoRA: A Proactive and Fragmentation-Aware Serverless Inference System for LLMs
von: Ni, Yinan, et al.
Veröffentlicht: (2025)

LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices
von: Ding, Chuntao, et al.
Veröffentlicht: (2024)

Federated LoRA with Sparse Communication
von: Kuo, Kevin, et al.
Veröffentlicht: (2024)

LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
von: Zhu, Zhanda, et al.
Veröffentlicht: (2025)

CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference
von: Li, Suyi, et al.
Veröffentlicht: (2024)

FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning
von: QI, Jiaxing, et al.
Veröffentlicht: (2024)

Stabilizing Decentralized Federated Fine-Tuning via Topology-Aware Alternating LoRA
von: Wang, Xiaoyu, et al.
Veröffentlicht: (2026)

Co-LoRA: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients
von: Seo, Minhyuk, et al.
Veröffentlicht: (2025)

Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM
von: Kodavanti, Sravanth, et al.
Veröffentlicht: (2026)

ForkKV: Scaling Multi-LoRA Agent Serving via Copy-on-Write Disaggregated KV Cache
von: Wang, Shao, et al.
Veröffentlicht: (2026)

Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models
von: Cho, Yae Jee, et al.
Veröffentlicht: (2024)

FedQuad: Adaptive Layer-wise LoRA Deployment and Activation Quantization for Federated Fine-Tuning
von: Li, Rukuo, et al.
Veröffentlicht: (2025)

LoRA-based Parameter-Efficient LLMs for Continuous Learning in Edge-based Malware Detection
von: Rondanini, Christian, et al.
Veröffentlicht: (2026)

Improving LoRA in Privacy-preserving Federated Learning
von: Sun, Youbang, et al.
Veröffentlicht: (2024)

FedRPCA: Enhancing Federated LoRA Aggregation Using Robust PCA
von: Jhunjhunwala, Divyansh, et al.
Veröffentlicht: (2025)

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA
von: Chen, Shuangyi, et al.
Veröffentlicht: (2024)

AutoRank: MCDA Based Rank Personalization for LoRA-Enabled Distributed Learning
von: Chen, Shuaijun, et al.
Veröffentlicht: (2024)

RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS
von: Chen, Shuaijun, et al.
Veröffentlicht: (2024)

A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference
von: Zhang, Yida, et al.
Veröffentlicht: (2026)

ML-ECS: A Collaborative Multimodal Learning Framework for Edge-Cloud Synergies
von: Liu, Yuze, et al.
Veröffentlicht: (2026)

JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning
von: Tahir, Anique, et al.
Veröffentlicht: (2024)

Fed-pilot: Optimizing LoRA Allocation for Efficient Federated Fine-Tuning with Heterogeneous Clients
von: Zhang, Zikai, et al.
Veröffentlicht: (2024)

Harmonic Decomposition in Data Sketches
von: Wang, Dingyu
Veröffentlicht: (2024)

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
von: Brüel-Gabrielsson, Rickard, et al.
Veröffentlicht: (2024)

EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices
von: Shen, Zheyu, et al.
Veröffentlicht: (2025)

Serving Heterogeneous LoRA Adapters in Distributed LLM Inference Systems
von: Jaiswal, Shashwat, et al.
Veröffentlicht: (2025)

Fed-HeLLo: Efficient Federated Foundation Model Fine-Tuning with Heterogeneous LoRA Allocation
von: Zhang, Zikai, et al.
Veröffentlicht: (2025)

Efficient Multi-Adapter LLM Serving via Cross-Model KV-Cache Reuse with Activated LoRA
von: Li, Allison, et al.
Veröffentlicht: (2025)

EAT: QoS-Aware Edge-Collaborative AIGC Task Scheduling via Attention-Guided Diffusion Reinforcement Learning
von: Xu, Zhifei, et al.
Veröffentlicht: (2025)

HAFLQ: Heterogeneous Adaptive Federated LoRA Fine-tuned LLM with Quantization
von: Su, Yang, et al.
Veröffentlicht: (2024)

Propius: A Platform for Collaborative Machine Learning across the Edge and the Cloud
von: Ding, Eric
Veröffentlicht: (2025)

FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models
von: Singhal, Raghav, et al.
Veröffentlicht: (2024)

Collaborative Inference and Learning between Edge SLMs and Cloud LLMs: A Survey of Algorithms, Execution, and Open Challenges
von: Li, Senyao, et al.
Veröffentlicht: (2025)