Saved in:
| Main Authors: | Beltre, Angel, Zaman, Shehtab, Chiu, Kenneth, Pamidighantam, Sudhakar, Qiao, Xingye, Govindaraju, Madhusudhan |
|---|---|
| Format: | Preprint |
| Published: |
2019
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1906.04286 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Score-based Idempotent Distillation of Diffusion Models
by: Zaman, Shehtab, et al.
Published: (2025)
by: Zaman, Shehtab, et al.
Published: (2025)
Towards an Integrated Performance Framework for Fire Science and Management Workflows
by: Ahmed, H., et al.
Published: (2024)
by: Ahmed, H., et al.
Published: (2024)
Fake Runs, Real Fixes -- Analyzing xPU Performance Through Simulation
by: Zarkadas, Ioannis, et al.
Published: (2025)
by: Zarkadas, Ioannis, et al.
Published: (2025)
Phantora: Maximizing Code Reuse in Simulation-based Machine Learning System Performance Estimation
by: Qin, Jianxing, et al.
Published: (2025)
by: Qin, Jianxing, et al.
Published: (2025)
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
by: Wang, Can, et al.
Published: (2024)
by: Wang, Can, et al.
Published: (2024)
CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs
by: Elshamy, Mohamed R., et al.
Published: (2025)
by: Elshamy, Mohamed R., et al.
Published: (2025)
Application Research On Real-Time Perception Of Device Performance Status
by: Wang, Zhe, et al.
Published: (2024)
by: Wang, Zhe, et al.
Published: (2024)
Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis
by: Werner, Elias, et al.
Published: (2023)
by: Werner, Elias, et al.
Published: (2023)
SAfEPaTh: A System-Level Approach for Efficient Power and Thermal Estimation of Convolutional Neural Network Accelerator
by: Chen, Yukai, et al.
Published: (2024)
by: Chen, Yukai, et al.
Published: (2024)
PrETi: Predicting Execution Time in Early Stage with LLVM and Machine Learning
by: Xu, Risheng, et al.
Published: (2025)
by: Xu, Risheng, et al.
Published: (2025)
Feature Optimization for Time Series Forecasting via Novel Randomized Uphill Climbing
by: Van Thanh, Nguyen
Published: (2025)
by: Van Thanh, Nguyen
Published: (2025)
A Kernel-Based Approach for Accurate Steady-State Detection in Performance Time Series
by: Beseda, Martin, et al.
Published: (2025)
by: Beseda, Martin, et al.
Published: (2025)
FlashOmni: A Unified Sparse Attention Engine for Diffusion Transformers
by: Qiao, Liang, et al.
Published: (2025)
by: Qiao, Liang, et al.
Published: (2025)
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system
by: Li, Zeyuan, et al.
Published: (2024)
by: Li, Zeyuan, et al.
Published: (2024)
Rapid Augmentations for Time Series (RATS): A High-Performance Library for Time Series Augmentation
by: Skaf, Wadie, et al.
Published: (2026)
by: Skaf, Wadie, et al.
Published: (2026)
PtychoFormer: A Transformer-based Model for Ptychographic Phase Retrieval
by: Nakahata, Ryuma, et al.
Published: (2024)
by: Nakahata, Ryuma, et al.
Published: (2024)
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity
by: Bolet, Gregory, et al.
Published: (2025)
by: Bolet, Gregory, et al.
Published: (2025)
Toward A Formalized Approach for Spike Sorting Algorithms and Hardware Evaluation
by: Zhang, Tim, et al.
Published: (2022)
by: Zhang, Tim, et al.
Published: (2022)
Learning Performance-Improving Code Edits
by: Shypula, Alexander, et al.
Published: (2023)
by: Shypula, Alexander, et al.
Published: (2023)
Towards Generalized Parameter Tuning in Coherent Ising Machines: A Portfolio-Based Approach
by: Hanyu, Tatsuro, et al.
Published: (2025)
by: Hanyu, Tatsuro, et al.
Published: (2025)
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
by: Kermani, Arshia, et al.
Published: (2025)
by: Kermani, Arshia, et al.
Published: (2025)
A Review of the Long Horizon Forecasting Problem in Time Series Analysis
by: Krupakar, Hans, et al.
Published: (2025)
by: Krupakar, Hans, et al.
Published: (2025)
TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)
by: Kan, Zeliang, et al.
Published: (2024)
by: Kan, Zeliang, et al.
Published: (2024)
Toward Smart Scheduling in Tapis
by: Stubbs, Joe, et al.
Published: (2024)
by: Stubbs, Joe, et al.
Published: (2024)
VecTrans: Enhancing Compiler Auto-Vectorization through LLM-Assisted Code Transformations
by: Zheng, Zhongchun, et al.
Published: (2025)
by: Zheng, Zhongchun, et al.
Published: (2025)
Regression Language Models for Code
by: Akhauri, Yash, et al.
Published: (2025)
by: Akhauri, Yash, et al.
Published: (2025)
Time-Efficient Hybrid Hyperparameter Tuning Approach for Cardiovascular Disease Classification
by: Pathak, Abhay Kumar, et al.
Published: (2024)
by: Pathak, Abhay Kumar, et al.
Published: (2024)
Identifying Best Practice Melting Patterns in Induction Furnaces: A Data-Driven Approach Using Time Series KMeans Clustering and Multi-Criteria Decision Making
by: Howard, Daniel Anthony, et al.
Published: (2024)
by: Howard, Daniel Anthony, et al.
Published: (2024)
Towards a Higher Roofline for Matrix-Vector Multiplication in Matrix-Free HOSFEM
by: Cao, Zijian, et al.
Published: (2025)
by: Cao, Zijian, et al.
Published: (2025)
Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
by: Shen, Jingran, et al.
Published: (2023)
by: Shen, Jingran, et al.
Published: (2023)
ALERT: Accurate Learning for Energy and Timeliness
by: Wan, Chengcheng, et al.
Published: (2019)
by: Wan, Chengcheng, et al.
Published: (2019)
Opening the Black Box: Performance Estimation during Code Generation for GPUs
by: Ernst, Dominik, et al.
Published: (2021)
by: Ernst, Dominik, et al.
Published: (2021)
A Scalable k-Medoids Clustering via Whale Optimization Algorithm
by: Chenan, Huang, et al.
Published: (2024)
by: Chenan, Huang, et al.
Published: (2024)
GPU-Accelerated INT8 Quantization for KV Cache Compression in Large Language Models
by: Taneja, Maanas, et al.
Published: (2026)
by: Taneja, Maanas, et al.
Published: (2026)
DistZO2: High-Throughput and Memory-Efficient Zeroth-Order Fine-tuning LLMs with Distributed Parallel Computing
by: Wang, Liangyu, et al.
Published: (2025)
by: Wang, Liangyu, et al.
Published: (2025)
PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation
by: An, Zihao, et al.
Published: (2025)
by: An, Zihao, et al.
Published: (2025)
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025)
by: You, Bozhi, et al.
Published: (2025)
MarginGate: Sparse Margin-Triggered Verification for Batch-Invariant LLM Inference
by: Chu, Kexin, et al.
Published: (2026)
by: Chu, Kexin, et al.
Published: (2026)
Parallel Implementations Assessment of a Spatial-Spectral Classifier for Hyperspectral Clinical Applications
by: Lazcano, Raquel, et al.
Published: (2024)
by: Lazcano, Raquel, et al.
Published: (2024)
A Structure-Aware Framework for Learning Device Placements on Computation Graphs
by: Duan, Shukai, et al.
Published: (2024)
by: Duan, Shukai, et al.
Published: (2024)
Similar Items
-
Score-based Idempotent Distillation of Diffusion Models
by: Zaman, Shehtab, et al.
Published: (2025) -
Towards an Integrated Performance Framework for Fire Science and Management Workflows
by: Ahmed, H., et al.
Published: (2024) -
Fake Runs, Real Fixes -- Analyzing xPU Performance Through Simulation
by: Zarkadas, Ioannis, et al.
Published: (2025) -
Phantora: Maximizing Code Reuse in Simulation-based Machine Learning System Performance Estimation
by: Qin, Jianxing, et al.
Published: (2025) -
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
by: Wang, Can, et al.
Published: (2024)