:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zou, Xingze, Wang, Jing, Zheng, Yuhua, Chen, Xueyi, Bai, Haolei, Kong, Lingcheng, Abu-Bakar, Syed A. R., Wang, Zhaode, Lv, Chengfei, Hu, Haoji, Wang, Huan
Formato:	Preprint
Publicado:	2026
Materias:	Machine Learning Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2603.11935
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
por: Bai, Haolei, et al.
Publicado: (2026)

ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
por: Kong, Lingcheng, et al.
Publicado: (2025)

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices
por: Wang, Zhaode, et al.
Publicado: (2025)

KernelBench: Can LLMs Write Efficient GPU Kernels?
por: Ouyang, Anne, et al.
Publicado: (2025)

MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
por: Huang, Zhengxiang, et al.
Publicado: (2025)

RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation
por: Zhang, Bin, et al.
Publicado: (2026)

CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
por: Chen, Wentao, et al.
Publicado: (2025)

D3S2: Diffusion-Guided Dataset Distillation for Semantic Segmentation
por: Zheng, Wenjie, et al.
Publicado: (2026)

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
por: Jiang, Zhonghua, et al.
Publicado: (2025)

FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference
por: Li, Kunxi, et al.
Publicado: (2025)

MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference
por: Li, Kunxi, et al.
Publicado: (2025)

Towards Real-time Video Compressive Sensing on Mobile Devices
por: Cao, Miao, et al.
Publicado: (2024)

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
por: Wang, Han, et al.
Publicado: (2026)

SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?
por: Tian, Muxin, et al.
Publicado: (2026)

Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting
por: Ye, Jiangnan, et al.
Publicado: (2025)

PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models
por: Jiang, Zhonghua, et al.
Publicado: (2025)

Identity-Preserving Pose-Guided Character Animation via Facial Landmarks Transformation
por: Mu, Lianrui, et al.
Publicado: (2024)

No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection
por: Mu, Lianrui, et al.
Publicado: (2025)

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control
por: Lee, Juyong, et al.
Publicado: (2024)

Nutritional Retention and Sensory Properties of Novel Tropical Vegetable Juice Formulations Prepared via Non-Thermal Processing
por: Ibrahim Abu, Bakar
Publicado: (2026)

HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
por: Wang, Jun, et al.
Publicado: (2024)

LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
por: Tao, Keda, et al.
Publicado: (2026)

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases
por: Murthy, Rithesh, et al.
Publicado: (2024)

CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMs
por: Fan, Gucongcong, et al.
Publicado: (2025)

ERC-SVD: Error-Controlled SVD for Large Language Model Compression
por: Bai, Haolei, et al.
Publicado: (2025)

ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices
por: Kong, Dezhi, et al.
Publicado: (2026)

CoDec: Prefix-Shared Decoding Kernel for LLMs
por: Wang, Zhibin, et al.
Publicado: (2025)

MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
por: Wang, Luyuan, et al.
Publicado: (2024)

Mobile-GS: Real-time Gaussian Splatting for Mobile Devices
por: Du, Xiaobiao, et al.
Publicado: (2026)

MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
por: Wang, Zifan, et al.
Publicado: (2025)

Agent-Kernel: A MicroKernel Multi-Agent System Framework for Adaptive Social Simulation Powered by LLMs
por: Mao, Yuren, et al.
Publicado: (2025)

MobileManiBench: Simplifying Model Verification for Mobile Manipulation
por: Wang, Wenbo, et al.
Publicado: (2026)

MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation
por: Wen, Zhongzhen, et al.
Publicado: (2025)

MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
por: Zhang, Shuai, et al.
Publicado: (2025)

EdgeDAM: Real-time Object Tracking for Mobile Devices
por: Raza, Syed Muhammad, et al.
Publicado: (2026)

Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
por: Deng, Shihan, et al.
Publicado: (2024)

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
por: Yang, Min, et al.
Publicado: (2025)

Omniwise: Predicting GPU Kernels Performance with LLMs
por: Wang, Zixian, et al.
Publicado: (2025)

Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
por: Zou, Ya, et al.
Publicado: (2025)

Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots
por: Nguyen, Khang, et al.
Publicado: (2024)