Saved in:
| Main Authors: | Wang, Zhe, Wang, Zhen, Wu, Jianwen, Xiao, Wangzhong, Chen, Yidong, Feng, Zihua, Yang, Dian, Liu, Hongchen, Liang, Bo, Fu, Jiaojiao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.03218 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study
by: Wang, Zhe, et al.
Published: (2024)
by: Wang, Zhe, et al.
Published: (2024)
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
by: Wang, Zhe, et al.
Published: (2025)
by: Wang, Zhe, et al.
Published: (2025)
Non-Asymptotic Performance Analysis of DOA Estimation Based on Real-Valued Root-MUSIC
by: Liu, Junyang, et al.
Published: (2025)
by: Liu, Junyang, et al.
Published: (2025)
Profiling Apple Silicon Performance for ML Training
by: Feng, Dahua, et al.
Published: (2025)
by: Feng, Dahua, et al.
Published: (2025)
Pinching-Antenna Systems For Indoor Immersive Communications: A 3D-Modeling Based Performance Analysis
by: Wang, Yulei, et al.
Published: (2025)
by: Wang, Yulei, et al.
Published: (2025)
Characterize LSM-tree Compaction Performance via On-Device LLM Inference
by: Ding, Jiabiao, et al.
Published: (2026)
by: Ding, Jiabiao, et al.
Published: (2026)
Performance Characterization and Optimizations of Traditional ML Applications
by: Kumar, Harsh, et al.
Published: (2024)
by: Kumar, Harsh, et al.
Published: (2024)
Conformer-Based Speech Recognition On Extreme Edge-Computing Devices
by: Xu, Mingbin, et al.
Published: (2023)
by: Xu, Mingbin, et al.
Published: (2023)
Resource Allocation Influence on Application Performance in Sliced Testbeds
by: Moreira, Rodrigo, et al.
Published: (2024)
by: Moreira, Rodrigo, et al.
Published: (2024)
A Continuous Benchmarking Infrastructure for High-Performance Computing Applications
by: Alt, Christoph, et al.
Published: (2024)
by: Alt, Christoph, et al.
Published: (2024)
Waltz: Temperature-Aware Cooperative Compression for High-Performance Compression-Based CSDs
by: Yu, Dingcui, et al.
Published: (2025)
by: Yu, Dingcui, et al.
Published: (2025)
Impact of AI-Triage on Radiologist Report Turnaround Time: Real-World Time-Savings and Insights from Model Predictions
by: Thompson, Yee Lam Elim, et al.
Published: (2025)
by: Thompson, Yee Lam Elim, et al.
Published: (2025)
Accurate Performance Modeling And Uncertainty Analysis of Lossy Compression in Scientific Applications
by: Liu, Youyuan, et al.
Published: (2024)
by: Liu, Youyuan, et al.
Published: (2024)
Redundant Array Computation Elimination
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
Ecoscape: Fault Tolerance Benchmark for Adaptive Remediation Strategies in Real-Time Edge ML
by: Reiter, Hendrik, et al.
Published: (2025)
by: Reiter, Hendrik, et al.
Published: (2025)
Scaler: Efficient and Effective Cross Flow Analysis
by: Steven, et al.
Published: (2024)
by: Steven, et al.
Published: (2024)
AI Application Benchmarking: Power-Aware Performance Analysis for Vision and Language Models
by: Mayr, Martin, et al.
Published: (2026)
by: Mayr, Martin, et al.
Published: (2026)
Evaluating the Performance of the DeepSeek Model in Confidential Computing Environment
by: Dong, Ben, et al.
Published: (2025)
by: Dong, Ben, et al.
Published: (2025)
SysOM-AI: Continuous Cross-Layer Performance Diagnosis for Production AI Training
by: Zheng, Yusheng, et al.
Published: (2026)
by: Zheng, Yusheng, et al.
Published: (2026)
Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices
by: Yan, Xiao, et al.
Published: (2025)
by: Yan, Xiao, et al.
Published: (2025)
Unleashing the Power of Preemptive Priority-based Scheduling for Real-Time GPU Tasks
by: Wang, Yidi, et al.
Published: (2024)
by: Wang, Yidi, et al.
Published: (2024)
Systematic Performance Evaluation Framework for LEO Mega-Constellation Satellite Networks
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
OSCAR-P and aMLLibrary: Profiling and Predicting the Performance of FaaS-based Applications in Computing Continua
by: Sala, Roberto, et al.
Published: (2024)
by: Sala, Roberto, et al.
Published: (2024)
lm-Meter: Unveiling Runtime Inference Latency for On-Device Language Models
by: Wang, Haoxin, et al.
Published: (2025)
by: Wang, Haoxin, et al.
Published: (2025)
A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
by: Atif, Mohammad, et al.
Published: (2025)
by: Atif, Mohammad, et al.
Published: (2025)
An Upper Bound on the M/M/k Queue With Deterministic Setup Times
by: Williams, Jalani, et al.
Published: (2025)
by: Williams, Jalani, et al.
Published: (2025)
Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling
by: Zhao, Youpeng, et al.
Published: (2025)
by: Zhao, Youpeng, et al.
Published: (2025)
Spatiotemporal Non-Uniformity-Aware Online Task Scheduling in Collaborative Edge Computing for Industrial Internet of Things
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Energy Efficiency Analysis of Active RIS-enhanced Wireless Network under Power-Sum Constraint
by: Xin, Jingdie, et al.
Published: (2025)
by: Xin, Jingdie, et al.
Published: (2025)
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
by: Huang, Haochen, et al.
Published: (2025)
by: Huang, Haochen, et al.
Published: (2025)
HPC Application Parameter Autotuning on Edge Devices: A Bandit Learning Approach
by: Hossain, Abrar, et al.
Published: (2025)
by: Hossain, Abrar, et al.
Published: (2025)
gigiProfiler: Diagnosing Performance Issues by Uncovering Application Resource Bottlenecks
by: Hu, Yigong, et al.
Published: (2025)
by: Hu, Yigong, et al.
Published: (2025)
A Structure-Aware Framework for Learning Device Placements on Computation Graphs
by: Duan, Shukai, et al.
Published: (2024)
by: Duan, Shukai, et al.
Published: (2024)
Affine Frequency Division Multiplexing Over Wideband Doubly-Dispersive Channels With Time-Scaling Effects
by: Li, Xiangxiang, et al.
Published: (2025)
by: Li, Xiangxiang, et al.
Published: (2025)
Cloud Computing Energy Consumption Prediction Based on Kernel Extreme Learning Machine Algorithm Improved by Vector Weighted Average Algorithm
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices
by: Ghafouri, Saeid, et al.
Published: (2025)
by: Ghafouri, Saeid, et al.
Published: (2025)
FLEXIS: FLEXible Frequent Subgraph Mining using Maximal Independent Sets
by: Sharma, Akshit, et al.
Published: (2024)
by: Sharma, Akshit, et al.
Published: (2024)
Rethinking Temporal Models for TinyML: LSTM versus 1D-CNN in Resource-Constrained Devices
by: Saha, Bidyut, et al.
Published: (2026)
by: Saha, Bidyut, et al.
Published: (2026)
SparseX: Efficient Segment-Level KV Cache Sharing for Interleaved LLM Serving
by: Zhang, Quqing, et al.
Published: (2026)
by: Zhang, Quqing, et al.
Published: (2026)
ADS Performance Revisited
by: Weber, Alexander, et al.
Published: (2024)
by: Weber, Alexander, et al.
Published: (2024)
Similar Items
-
Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study
by: Wang, Zhe, et al.
Published: (2024) -
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
by: Wang, Zhe, et al.
Published: (2025) -
Non-Asymptotic Performance Analysis of DOA Estimation Based on Real-Valued Root-MUSIC
by: Liu, Junyang, et al.
Published: (2025) -
Profiling Apple Silicon Performance for ML Training
by: Feng, Dahua, et al.
Published: (2025) -
Pinching-Antenna Systems For Indoor Immersive Communications: A 3D-Modeling Based Performance Analysis
by: Wang, Yulei, et al.
Published: (2025)