Saved in:
| Main Authors: | Xu, Shen, Zhuge, Xiangwen, Xu, Zhe, Hu, Yingkun, Yang, Zheng, Liu, Yunhao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.05819 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices
by: Zhuge, Xiangwen, et al.
Published: (2025)
by: Zhuge, Xiangwen, et al.
Published: (2025)
Efficient ANN-SNN Conversion with Error Compensation Learning
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
Communication-Efficient Distributed Learning with Local Immediate Error Compensation
by: Cheng, Yifei, et al.
Published: (2024)
by: Cheng, Yifei, et al.
Published: (2024)
Rethinking Residual Errors in Compensation-based LLM Quantization
by: Li, Shuaiting, et al.
Published: (2026)
by: Li, Shuaiting, et al.
Published: (2026)
Efficient Sensor Fusion for Gesture Recognition on Resource-Constrained Devices
by: Bartoli, Pietro, et al.
Published: (2026)
by: Bartoli, Pietro, et al.
Published: (2026)
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices
by: Song, Congzheng, et al.
Published: (2025)
by: Song, Congzheng, et al.
Published: (2025)
Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices
by: Xia, Xiaoyu, et al.
Published: (2024)
by: Xia, Xiaoyu, et al.
Published: (2024)
NLI:Non-uniform Linear Interpolation Approximation of Nonlinear Operations for Efficient LLMs Inference
by: Yu, Jiangyong, et al.
Published: (2026)
by: Yu, Jiangyong, et al.
Published: (2026)
Dynamic Graph Unlearning: A General and Efficient Post-Processing Method via Gradient Transformation
by: Zhang, He, et al.
Published: (2024)
by: Zhang, He, et al.
Published: (2024)
Efficient Deployment of Large Language Models on Resource-constrained Devices
by: Yao, Zhiwei, et al.
Published: (2025)
by: Yao, Zhiwei, et al.
Published: (2025)
FL-NAS: Towards Fairness of NAS for Resource Constrained Devices via Large Language Models
by: Qin, Ruiyang, et al.
Published: (2024)
by: Qin, Ruiyang, et al.
Published: (2024)
Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices
by: Ahmed, Mohamed Aboelenien, et al.
Published: (2025)
by: Ahmed, Mohamed Aboelenien, et al.
Published: (2025)
Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices
by: Pfeiffer, Kilian, et al.
Published: (2024)
by: Pfeiffer, Kilian, et al.
Published: (2024)
FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices
by: Yao, Dezhong, et al.
Published: (2025)
by: Yao, Dezhong, et al.
Published: (2025)
ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference
by: Yin, Wangsong, et al.
Published: (2025)
by: Yin, Wangsong, et al.
Published: (2025)
Steady-State Error Compensation for Reinforcement Learning with Quadratic Rewards
by: Wang, Liyao, et al.
Published: (2024)
by: Wang, Liyao, et al.
Published: (2024)
GCoDE: Efficient Device-Edge Co-Inference for GNNs via Architecture-Mapping Co-Search
by: Zhou, Ao, et al.
Published: (2025)
by: Zhou, Ao, et al.
Published: (2025)
Efficient Resource-Constrained Training of Transformers via Subspace Optimization
by: Nguyen, Le-Trung, et al.
Published: (2025)
by: Nguyen, Le-Trung, et al.
Published: (2025)
Contextual Bandits for Resource-Constrained Devices using Probabilistic Learning
by: Angioli, Marco, et al.
Published: (2026)
by: Angioli, Marco, et al.
Published: (2026)
Hardware-Aware Neural Feature Extraction for Resource-Constrained Devices
by: Tosini, Francesco, et al.
Published: (2026)
by: Tosini, Francesco, et al.
Published: (2026)
Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data
by: Liu, Jun, et al.
Published: (2025)
by: Liu, Jun, et al.
Published: (2025)
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning
by: Tang, Yunhao, et al.
Published: (2025)
by: Tang, Yunhao, et al.
Published: (2025)
Bandwidth-Efficient Adaptive Mixture-of-Experts via Low-Rank Compensation
by: Liu, Zhenyu, et al.
Published: (2025)
by: Liu, Zhenyu, et al.
Published: (2025)
RWKV-edge: Deeply Compressed RWKV for Resource-Constrained Devices
by: Choe, Wonkyo, et al.
Published: (2024)
by: Choe, Wonkyo, et al.
Published: (2024)
A Novel Hat-Shaped Device-Cloud Collaborative Inference Framework for Large Language Models
by: Xie, Zuan, et al.
Published: (2025)
by: Xie, Zuan, et al.
Published: (2025)
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
by: Hu, Kai, et al.
Published: (2024)
by: Hu, Kai, et al.
Published: (2024)
AdaptiveFL: Adaptive Heterogeneous Federated Learning for Resource-Constrained AIoT Systems
by: Jia, Chentao, et al.
Published: (2023)
by: Jia, Chentao, et al.
Published: (2023)
Boosted Trees on a Diet: Compact Models for Resource-Constrained Devices
by: Herrmann, Nina, et al.
Published: (2025)
by: Herrmann, Nina, et al.
Published: (2025)
Memory-Efficient Partitioned DNN Inference on Resource-Constrained Android Crowds
by: Manamperi, Lakshani, et al.
Published: (2026)
by: Manamperi, Lakshani, et al.
Published: (2026)
Continual Error Correction on Low-Resource Devices
by: Paramonov, Kirill, et al.
Published: (2025)
by: Paramonov, Kirill, et al.
Published: (2025)
ESFL: Efficient Split Federated Learning over Resource-Constrained Heterogeneous Wireless Devices
by: Zhu, Guangyu, et al.
Published: (2024)
by: Zhu, Guangyu, et al.
Published: (2024)
Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling
by: Zhang, Xianzhi, et al.
Published: (2026)
by: Zhang, Xianzhi, et al.
Published: (2026)
CoFormer: Collaborating with Heterogeneous Edge Devices for Scalable Transformer Inference
by: Xu, Guanyu, et al.
Published: (2025)
by: Xu, Guanyu, et al.
Published: (2025)
MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators
by: Huang, Beichen, et al.
Published: (2025)
by: Huang, Beichen, et al.
Published: (2025)
Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices
by: Wang, Ziyao, et al.
Published: (2025)
by: Wang, Ziyao, et al.
Published: (2025)
SacFL: Self-Adaptive Federated Continual Learning for Resource-Constrained End Devices
by: Zhong, Zhengyi, et al.
Published: (2025)
by: Zhong, Zhengyi, et al.
Published: (2025)
Accelerating Local LLMs on Resource-Constrained Edge Devices via Distributed Prompt Caching
by: Matsutani, Hiroki, et al.
Published: (2026)
by: Matsutani, Hiroki, et al.
Published: (2026)
Graph Neural Networks Automated Design and Deployment on Device-Edge Co-Inference Systems
by: Zhou, Ao, et al.
Published: (2024)
by: Zhou, Ao, et al.
Published: (2024)
Communication-Computation Trade-Off in Resource-Constrained Edge Inference
by: Shao, Jiawei, et al.
Published: (2020)
by: Shao, Jiawei, et al.
Published: (2020)
Enabling On-Device Learning via Experience Replay with Efficient Dataset Condensation
by: Xu, Gelei, et al.
Published: (2024)
by: Xu, Gelei, et al.
Published: (2024)
Similar Items
-
SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices
by: Zhuge, Xiangwen, et al.
Published: (2025) -
Efficient ANN-SNN Conversion with Error Compensation Learning
by: Liu, Chang, et al.
Published: (2025) -
Communication-Efficient Distributed Learning with Local Immediate Error Compensation
by: Cheng, Yifei, et al.
Published: (2024) -
Rethinking Residual Errors in Compensation-based LLM Quantization
by: Li, Shuaiting, et al.
Published: (2026) -
Efficient Sensor Fusion for Gesture Recognition on Resource-Constrained Devices
by: Bartoli, Pietro, et al.
Published: (2026)