:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhao, Qidong, Wu, Hao, Hao, Yuming, Ye, Zilingfeng, Li, Jiajia, Liu, Xu, Zhou, Keren
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Performance Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2411.02797
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Interpreting Performance Profiles with Deep Learning
von: Liu, Zhuoran
Veröffentlicht: (2025)

PASTA: A Modular Program Analysis Tool Framework for Accelerators
von: Lin, Mao, et al.
Veröffentlicht: (2026)

Enabling Heterogeneous Performance Analysis for Scientific Workloads
von: Graczyk, Maksymilian, et al.
Veröffentlicht: (2025)

DEER: Deep Runahead for Instruction Prefetching on Modern Mobile Workloads
von: Vahdatniya, Parmida, et al.
Veröffentlicht: (2025)

PerfSeer: An Efficient and Accurate Deep Learning Models Performance Predictor
von: Zhao, Xinlong, et al.
Veröffentlicht: (2025)

Scaler: Efficient and Effective Cross Flow Analysis
von: Steven, et al.
Veröffentlicht: (2024)

Obfuscation as an Effective Signal for Prioritizing Cross-Chain Smart Contract Audits: Large-Scale Measurement and Risk Profiling
von: Zhao, Yao, et al.
Veröffentlicht: (2026)

Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
von: Karami, Rachid, et al.
Veröffentlicht: (2024)

CUTHERMO: Understanding GPU Memory Inefficiencies with Heat Map Profiling
von: Zhao, Yanbo, et al.
Veröffentlicht: (2025)

Performance of Genetic Algorithms in the Context of Software Model Refactoring
von: Cortellessa, Vittorio, et al.
Veröffentlicht: (2023)

Chapter Cross-Genre Musicking in Individual and Collaborative Group Contexts: Lived Experience and Musical Identity
von: Ruth Herbert, Asha Parkinson
Veröffentlicht: (2026)

Chapter Cross-Genre Musicking in Individual and Collaborative Group Contexts: Lived Experience and Musical Identity
von: Ruth Herbert, Asha Parkinson
Veröffentlicht: (2026)

Forecasting GPU Performance for Deep Learning Training and Inference
von: Lee, Seonho, et al.
Veröffentlicht: (2024)

Versatile Cross-platform Compilation Toolchain for Schrödinger-style Quantum Circuit Simulation
von: Lu, Yuncheng, et al.
Veröffentlicht: (2025)

Profiling Apple Silicon Performance for ML Training
von: Feng, Dahua, et al.
Veröffentlicht: (2025)

Unikernels vs. Containers: A Runtime-Level Performance Comparison for Resource-Constrained Edge Workloads
von: Dinh-Tuan, Hai
Veröffentlicht: (2025)

oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation
von: Li, Jianhui, et al.
Veröffentlicht: (2023)

xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads
von: Shi, Jiabo, et al.
Veröffentlicht: (2025)

Evaluating the Performance of the DeepSeek Model in Confidential Computing Environment
von: Dong, Ben, et al.
Veröffentlicht: (2025)

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson -- Extended
von: Chakraborty, Abhinaba, et al.
Veröffentlicht: (2025)

Mosaic: Cross-Modal Clustering for Efficient Video Understanding
von: Wang, Tuowei, et al.
Veröffentlicht: (2026)

SysOM-AI: Continuous Cross-Layer Performance Diagnosis for Production AI Training
von: Zheng, Yusheng, et al.
Veröffentlicht: (2026)

Characterizing Machine Learning Force Fields as Emerging Molecular Dynamics Workloads on Graphics Processing Units
von: De Alwis, Udari, et al.
Veröffentlicht: (2026)

Anatomizing Deep Learning Inference in Web Browsers
von: Wang, Qipeng, et al.
Veröffentlicht: (2024)

H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference
von: Fu, Zizhuo, et al.
Veröffentlicht: (2025)

Scalable GPU Performance Variability Analysis framework
von: Lahiry, Ankur, et al.
Veröffentlicht: (2025)

Inference performance evaluation for LLMs on edge devices with a novel benchmarking framework and metric
von: Chen, Hao, et al.
Veröffentlicht: (2025)

Dissecting RISC-V Performance: Practical PMU Profiling and Hardware-Agnostic Roofline Analysis on Emerging Platforms
von: Batashev, Alexander
Veröffentlicht: (2025)

Energy Efficiency Analysis of Active RIS-enhanced Wireless Network under Power-Sum Constraint
von: Xin, Jingdie, et al.
Veröffentlicht: (2025)

Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks
von: Gardner, Jason, et al.
Veröffentlicht: (2025)

Minos: Systematically Classifying Performance and Power Characteristics of GPU Workloads on HPC Clusters
von: Jain, Rutwik, et al.
Veröffentlicht: (2026)

Network Calculus Bounds for Time-Sensitive Networks: A Revisit
von: Jiang, Yuming
Veröffentlicht: (2024)

Memory Analysis on the Training Course of DeepSeek Models
von: Zhang, Ping, et al.
Veröffentlicht: (2025)

Tabular and Deep Reinforcement Learning for Gittins Index
von: Dhankhar, Harshit, et al.
Veröffentlicht: (2024)

Putting the Context back into Memory
von: Roberts, David A.
Veröffentlicht: (2025)

QPART: Adaptive Model Quantization and Dynamic Workload Balancing for Accuracy-aware Edge Inference
von: Li, Xiangchen, et al.
Veröffentlicht: (2025)

The Use of Digital Financial Services and Business Performance Satisfaction in the Context of Female Entrepreneurship
von: Fernanda Francielle de Oliveira Malaquias
Veröffentlicht: (2022)

The Price of Interoperability: Exploring Cross-Chain Bridges and Their Economic Consequences
von: Cao, Yiyue, et al.
Veröffentlicht: (2026)

XTC, A Research Platform for Optimizing AI Workload Operators
von: Hugo, Pompougnac, et al.
Veröffentlicht: (2025)

Adaptive Workload Distribution for Accuracy-aware DNN Inference on Collaborative Edge Platforms
von: Taufique, Zain, et al.
Veröffentlicht: (2023)