Guardado en:
| Autores principales: | Zou, Xingze, Wang, Jing, Zheng, Yuhua, Chen, Xueyi, Bai, Haolei, Kong, Lingcheng, Abu-Bakar, Syed A. R., Wang, Zhaode, Lv, Chengfei, Hu, Haoji, Wang, Huan |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.11935 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
por: Bai, Haolei, et al.
Publicado: (2026)
por: Bai, Haolei, et al.
Publicado: (2026)
ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
por: Kong, Lingcheng, et al.
Publicado: (2025)
por: Kong, Lingcheng, et al.
Publicado: (2025)
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices
por: Wang, Zhaode, et al.
Publicado: (2025)
por: Wang, Zhaode, et al.
Publicado: (2025)
KernelBench: Can LLMs Write Efficient GPU Kernels?
por: Ouyang, Anne, et al.
Publicado: (2025)
por: Ouyang, Anne, et al.
Publicado: (2025)
MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
por: Huang, Zhengxiang, et al.
Publicado: (2025)
por: Huang, Zhengxiang, et al.
Publicado: (2025)
RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation
por: Zhang, Bin, et al.
Publicado: (2026)
por: Zhang, Bin, et al.
Publicado: (2026)
CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
por: Chen, Wentao, et al.
Publicado: (2025)
por: Chen, Wentao, et al.
Publicado: (2025)
D3S2: Diffusion-Guided Dataset Distillation for Semantic Segmentation
por: Zheng, Wenjie, et al.
Publicado: (2026)
por: Zheng, Wenjie, et al.
Publicado: (2026)
AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
por: Jiang, Zhonghua, et al.
Publicado: (2025)
por: Jiang, Zhonghua, et al.
Publicado: (2025)
FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference
por: Li, Kunxi, et al.
Publicado: (2025)
por: Li, Kunxi, et al.
Publicado: (2025)
MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference
por: Li, Kunxi, et al.
Publicado: (2025)
por: Li, Kunxi, et al.
Publicado: (2025)
Towards Real-time Video Compressive Sensing on Mobile Devices
por: Cao, Miao, et al.
Publicado: (2024)
por: Cao, Miao, et al.
Publicado: (2024)
KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
por: Wang, Han, et al.
Publicado: (2026)
por: Wang, Han, et al.
Publicado: (2026)
SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?
por: Tian, Muxin, et al.
Publicado: (2026)
por: Tian, Muxin, et al.
Publicado: (2026)
Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting
por: Ye, Jiangnan, et al.
Publicado: (2025)
por: Ye, Jiangnan, et al.
Publicado: (2025)
PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models
por: Jiang, Zhonghua, et al.
Publicado: (2025)
por: Jiang, Zhonghua, et al.
Publicado: (2025)
Identity-Preserving Pose-Guided Character Animation via Facial Landmarks Transformation
por: Mu, Lianrui, et al.
Publicado: (2024)
por: Mu, Lianrui, et al.
Publicado: (2024)
No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection
por: Mu, Lianrui, et al.
Publicado: (2025)
por: Mu, Lianrui, et al.
Publicado: (2025)
MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control
por: Lee, Juyong, et al.
Publicado: (2024)
por: Lee, Juyong, et al.
Publicado: (2024)
Nutritional Retention and Sensory Properties of Novel Tropical Vegetable Juice Formulations Prepared via Non-Thermal Processing
por: Ibrahim Abu, Bakar
Publicado: (2026)
por: Ibrahim Abu, Bakar
Publicado: (2026)
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
por: Wang, Jun, et al.
Publicado: (2024)
por: Wang, Jun, et al.
Publicado: (2024)
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
por: Tao, Keda, et al.
Publicado: (2026)
por: Tao, Keda, et al.
Publicado: (2026)
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases
por: Murthy, Rithesh, et al.
Publicado: (2024)
por: Murthy, Rithesh, et al.
Publicado: (2024)
CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMs
por: Fan, Gucongcong, et al.
Publicado: (2025)
por: Fan, Gucongcong, et al.
Publicado: (2025)
ERC-SVD: Error-Controlled SVD for Large Language Model Compression
por: Bai, Haolei, et al.
Publicado: (2025)
por: Bai, Haolei, et al.
Publicado: (2025)
ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices
por: Kong, Dezhi, et al.
Publicado: (2026)
por: Kong, Dezhi, et al.
Publicado: (2026)
CoDec: Prefix-Shared Decoding Kernel for LLMs
por: Wang, Zhibin, et al.
Publicado: (2025)
por: Wang, Zhibin, et al.
Publicado: (2025)
MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
por: Wang, Luyuan, et al.
Publicado: (2024)
por: Wang, Luyuan, et al.
Publicado: (2024)
Mobile-GS: Real-time Gaussian Splatting for Mobile Devices
por: Du, Xiaobiao, et al.
Publicado: (2026)
por: Du, Xiaobiao, et al.
Publicado: (2026)
MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
por: Wang, Zifan, et al.
Publicado: (2025)
por: Wang, Zifan, et al.
Publicado: (2025)
Agent-Kernel: A MicroKernel Multi-Agent System Framework for Adaptive Social Simulation Powered by LLMs
por: Mao, Yuren, et al.
Publicado: (2025)
por: Mao, Yuren, et al.
Publicado: (2025)
MobileManiBench: Simplifying Model Verification for Mobile Manipulation
por: Wang, Wenbo, et al.
Publicado: (2026)
por: Wang, Wenbo, et al.
Publicado: (2026)
MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation
por: Wen, Zhongzhen, et al.
Publicado: (2025)
por: Wen, Zhongzhen, et al.
Publicado: (2025)
MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
por: Zhang, Shuai, et al.
Publicado: (2025)
por: Zhang, Shuai, et al.
Publicado: (2025)
EdgeDAM: Real-time Object Tracking for Mobile Devices
por: Raza, Syed Muhammad, et al.
Publicado: (2026)
por: Raza, Syed Muhammad, et al.
Publicado: (2026)
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
por: Deng, Shihan, et al.
Publicado: (2024)
por: Deng, Shihan, et al.
Publicado: (2024)
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
por: Yang, Min, et al.
Publicado: (2025)
por: Yang, Min, et al.
Publicado: (2025)
Omniwise: Predicting GPU Kernels Performance with LLMs
por: Wang, Zixian, et al.
Publicado: (2025)
por: Wang, Zixian, et al.
Publicado: (2025)
Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
por: Zou, Ya, et al.
Publicado: (2025)
por: Zou, Ya, et al.
Publicado: (2025)
Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots
por: Nguyen, Khang, et al.
Publicado: (2024)
por: Nguyen, Khang, et al.
Publicado: (2024)
Ejemplares similares
-
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
por: Bai, Haolei, et al.
Publicado: (2026) -
ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
por: Kong, Lingcheng, et al.
Publicado: (2025) -
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices
por: Wang, Zhaode, et al.
Publicado: (2025) -
KernelBench: Can LLMs Write Efficient GPU Kernels?
por: Ouyang, Anne, et al.
Publicado: (2025) -
MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
por: Huang, Zhengxiang, et al.
Publicado: (2025)