Saved in:
| Main Authors: | Xiao, Jie, Huang, Qianyi, Chen, Xu, Tian, Chen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.03613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SpecMemo: Speculative Decoding is in Your Pocket
by: Yildirim, Selin, et al.
Published: (2025)
by: Yildirim, Selin, et al.
Published: (2025)
RoboPocket: Improve Robot Policies Instantly with Your Phone
by: Fang, Junjie, et al.
Published: (2026)
by: Fang, Junjie, et al.
Published: (2026)
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
by: Tian, Ye, et al.
Published: (2025)
by: Tian, Ye, et al.
Published: (2025)
Practical Performative Policy Learning with Strategic Agents
by: Chen, Qianyi, et al.
Published: (2024)
by: Chen, Qianyi, et al.
Published: (2024)
EdgeMoE: Empowering Sparse Large Language Models on Mobile Devices
by: Yi, Rongjie, et al.
Published: (2023)
by: Yi, Rongjie, et al.
Published: (2023)
PocketLLM: Enabling On-Device Fine-Tuning for Personalized LLMs
by: Peng, Dan, et al.
Published: (2024)
by: Peng, Dan, et al.
Published: (2024)
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
by: Ou, Jie, et al.
Published: (2024)
by: Ou, Jie, et al.
Published: (2024)
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
by: Qu, Guanqiao, et al.
Published: (2024)
by: Qu, Guanqiao, et al.
Published: (2024)
Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction
by: Chen, Qianyi, et al.
Published: (2025)
by: Chen, Qianyi, et al.
Published: (2025)
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
by: Liu, Zechun, et al.
Published: (2024)
by: Liu, Zechun, et al.
Published: (2024)
Teaching Your Models to Understand Code via Focal Preference Alignment
by: Wu, Jie, et al.
Published: (2025)
by: Wu, Jie, et al.
Published: (2025)
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices
by: Wang, Zhaode, et al.
Published: (2025)
by: Wang, Zhaode, et al.
Published: (2025)
Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference
by: Cai, Huaiguang, et al.
Published: (2024)
by: Cai, Huaiguang, et al.
Published: (2024)
Can Large Language Models Help Experimental Design for Causal Discovery?
by: Li, Junyi, et al.
Published: (2025)
by: Li, Junyi, et al.
Published: (2025)
PLMM: Personal Large Language Models on Mobile Devices
by: Gong, Yuanhao
Published: (2023)
by: Gong, Yuanhao
Published: (2023)
Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration
by: Lin, Haitao, et al.
Published: (2023)
by: Lin, Haitao, et al.
Published: (2023)
Efficient Temporal Tokenization for Mobility Prediction with Large Language Models
by: He, Haoyu, et al.
Published: (2025)
by: He, Haoyu, et al.
Published: (2025)
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
by: Li, Ziniu, et al.
Published: (2024)
by: Li, Ziniu, et al.
Published: (2024)
PocketSR: The Super-Resolution Expert in Your Pocket Mobiles
by: Sun, Haoze, et al.
Published: (2025)
by: Sun, Haoze, et al.
Published: (2025)
Efficient Deployment of Vision-Language Models on Mobile Devices: A Case Study on OnePlus 13R
by: Guerrero, Pablo Robin, et al.
Published: (2025)
by: Guerrero, Pablo Robin, et al.
Published: (2025)
LinVT: Empower Your Image-level Large Language Model to Understand Videos
by: Gao, Lishuai, et al.
Published: (2024)
by: Gao, Lishuai, et al.
Published: (2024)
Robust and Scalable Model Editing for Large Language Models
by: Chen, Yingfa, et al.
Published: (2024)
by: Chen, Yingfa, et al.
Published: (2024)
Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets
by: Peng, Xingang, et al.
Published: (2022)
by: Peng, Xingang, et al.
Published: (2022)
Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation
by: Yang, Xinyu, et al.
Published: (2025)
by: Yang, Xinyu, et al.
Published: (2025)
DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices
by: Song, Chenyang, et al.
Published: (2026)
by: Song, Chenyang, et al.
Published: (2026)
Cut Your Losses in Large-Vocabulary Language Models
by: Wijmans, Erik, et al.
Published: (2024)
by: Wijmans, Erik, et al.
Published: (2024)
TernaryLLM: Ternarized Large Language Model
by: Chen, Tianqi, et al.
Published: (2024)
by: Chen, Tianqi, et al.
Published: (2024)
A Novel Hat-Shaped Device-Cloud Collaborative Inference Framework for Large Language Models
by: Xie, Zuan, et al.
Published: (2025)
by: Xie, Zuan, et al.
Published: (2025)
GUNDAM: Aligning Large Language Models with Graph Understanding
by: Ouyang, Sheng, et al.
Published: (2024)
by: Ouyang, Sheng, et al.
Published: (2024)
Off-Policy Value-Based Reinforcement Learning for Large Language Models
by: Wang, Peng-Yuan, et al.
Published: (2026)
by: Wang, Peng-Yuan, et al.
Published: (2026)
FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive Personalization
by: Zhao, Qianyi, et al.
Published: (2024)
by: Zhao, Qianyi, et al.
Published: (2024)
ECGadv: Generating Adversarial Electrocardiogram to Misguide Arrhythmia Classification System
by: Chen, Huangxun, et al.
Published: (2019)
by: Chen, Huangxun, et al.
Published: (2019)
Large Language Models for Real-World IoT Device Identification
by: Mahmood, Rameen, et al.
Published: (2025)
by: Mahmood, Rameen, et al.
Published: (2025)
MobileMoE: Scaling On-Device Mixture of Experts
by: Chen, Yanbei, et al.
Published: (2026)
by: Chen, Yanbei, et al.
Published: (2026)
Towards a Mechanistic Understanding of Propositional Logical Reasoning in Large Language Models
by: Chen, Danchun, et al.
Published: (2026)
by: Chen, Danchun, et al.
Published: (2026)
P/D-Device: Disaggregated Large Language Model between Cloud and Devices
by: Jin, Yibo, et al.
Published: (2025)
by: Jin, Yibo, et al.
Published: (2025)
Unlocking Emergent Modularity in Large Language Models
by: Qiu, Zihan, et al.
Published: (2023)
by: Qiu, Zihan, et al.
Published: (2023)
Understanding Why Large Language Models Can Be Ineffective in Time Series Analysis: The Impact of Modality Alignment
by: Zheng, Liangwei Nathan, et al.
Published: (2024)
by: Zheng, Liangwei Nathan, et al.
Published: (2024)
Collaborative Performance Prediction for Large Language Models
by: Zhang, Qiyuan, et al.
Published: (2024)
by: Zhang, Qiyuan, et al.
Published: (2024)
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
by: Zhang, Andi, et al.
Published: (2024)
by: Zhang, Andi, et al.
Published: (2024)
Similar Items
-
SpecMemo: Speculative Decoding is in Your Pocket
by: Yildirim, Selin, et al.
Published: (2025) -
RoboPocket: Improve Robot Policies Instantly with Your Phone
by: Fang, Junjie, et al.
Published: (2026) -
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
by: Tian, Ye, et al.
Published: (2025) -
Practical Performative Policy Learning with Strategic Agents
by: Chen, Qianyi, et al.
Published: (2024) -
EdgeMoE: Empowering Sparse Large Language Models on Mobile Devices
by: Yi, Rongjie, et al.
Published: (2023)