Saved in:
| Main Authors: | Price, Daniel, Vellaisamy, Prabhu, Gonzalez, Patricia, Michelogiannakis, George, Shen, John P., Wu, Di |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04847 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition
by: Vellaisamy, Prabhu, et al.
Published: (2026)
by: Vellaisamy, Prabhu, et al.
Published: (2026)
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures
by: Vellaisamy, Prabhu, et al.
Published: (2025)
by: Vellaisamy, Prabhu, et al.
Published: (2025)
Mugi: Value Level Parallelism For Efficient LLMs
by: Price, Daniel, et al.
Published: (2026)
by: Price, Daniel, et al.
Published: (2026)
GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs
by: Liu, Juelin, et al.
Published: (2024)
by: Liu, Juelin, et al.
Published: (2024)
Robust Recursive Query Parallelism in Graph Database Management Systems
by: Chakraborty, Anurag, et al.
Published: (2025)
by: Chakraborty, Anurag, et al.
Published: (2025)
One-Hop Sub-Query Result Caches for Graph Database Systems
by: Nguyen, Hieu, et al.
Published: (2024)
by: Nguyen, Hieu, et al.
Published: (2024)
A Modular Graph-Native Query Optimization Framework
by: Lyu, Bingqing, et al.
Published: (2024)
by: Lyu, Bingqing, et al.
Published: (2024)
Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics
by: Moura, Diego, et al.
Published: (2022)
by: Moura, Diego, et al.
Published: (2022)
COMPASS: A Unified Decision-Intelligence System for Navigating Performance Trade-off in HPC
by: Lahiry, Ankur, et al.
Published: (2026)
by: Lahiry, Ankur, et al.
Published: (2026)
Optimizing Cloud-native Services with SAGA: A Service Affinity Graph-based Approach
by: Dinh-Tuan, Hai, et al.
Published: (2025)
by: Dinh-Tuan, Hai, et al.
Published: (2025)
Graph-Based Product Form
by: Comte, Céline, et al.
Published: (2025)
by: Comte, Céline, et al.
Published: (2025)
LightningSimV2: Faster and Scalable Simulation for High-Level Synthesis via Graph Compilation and Optimization
by: Sarkar, Rishov, et al.
Published: (2024)
by: Sarkar, Rishov, et al.
Published: (2024)
Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling
by: Zhao, Youpeng, et al.
Published: (2025)
by: Zhao, Youpeng, et al.
Published: (2025)
A Structure-Aware Framework for Learning Device Placements on Computation Graphs
by: Duan, Shukai, et al.
Published: (2024)
by: Duan, Shukai, et al.
Published: (2024)
Inspection of I/O Operations from System Call Traces using Directly-Follows-Graph
by: Sankaran, Aravind, et al.
Published: (2024)
by: Sankaran, Aravind, et al.
Published: (2024)
GCL-Sampler: Discovering Kernel Similarity for Sampled GPU Simulation via Graph Contrastive Learning
by: Wang, Jiaqi, et al.
Published: (2026)
by: Wang, Jiaqi, et al.
Published: (2026)
Shortest-Path FFT: Optimal SIMD Instruction Scheduling via Graph Search
by: Bergach, Mohamed Amine
Published: (2026)
by: Bergach, Mohamed Amine
Published: (2026)
Analysis of Stable Vertex Values: Fast Query Evaluation Over An Evolving Graph
by: Afarin, Mahbod, et al.
Published: (2025)
by: Afarin, Mahbod, et al.
Published: (2025)
XRFlux: Virtual Reality Benchmark for Edge Caching Systems
by: Alfares, Nader, et al.
Published: (2024)
by: Alfares, Nader, et al.
Published: (2024)
A relação entre a «performance» social e a «performance» económico-financeira
by: Daniel Taborda
Published: (2007)
by: Daniel Taborda
Published: (2007)
Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental Study
by: Merkel, Nikolai, et al.
Published: (2024)
by: Merkel, Nikolai, et al.
Published: (2024)
On-Demand JSON: A Better Way to Parse Documents?
by: Keiser, John, et al.
Published: (2023)
by: Keiser, John, et al.
Published: (2023)
Energy-Efficient Software Development: A Multi-dimensional Empirical Analysis of Stack Overflow
by: Jin, Bihui, et al.
Published: (2024)
by: Jin, Bihui, et al.
Published: (2024)
A Controlled Study of Memory Hierarchy Transitions in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture
by: Pratipat, Gyan
Published: (2026)
by: Pratipat, Gyan
Published: (2026)
KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models
by: Gul, Haji, et al.
Published: (2025)
by: Gul, Haji, et al.
Published: (2025)
Vector-Centric Machine Learning Systems: A Cross-Stack Approach
by: Jiang, Wenqi
Published: (2025)
by: Jiang, Wenqi
Published: (2025)
oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation
by: Li, Jianhui, et al.
Published: (2023)
by: Li, Jianhui, et al.
Published: (2023)
Efficient Graph Knowledge Distillation from GNNs to Kolmogorov--Arnold Networks via Self-Attention Dynamic Sampling
by: Cui, Can, et al.
Published: (2025)
by: Cui, Can, et al.
Published: (2025)
DRAGON (Differentiable Graph Execution) : A suite of Hardware Simulation and Optimization tools for Modern AI/Non-AI Workloads
by: Sethi, Khushal
Published: (2022)
by: Sethi, Khushal
Published: (2022)
Comparison of Vectorization Capabilities of Different Compilers for X86 and ARM CPUs
by: Sakib, Nazmus, et al.
Published: (2025)
by: Sakib, Nazmus, et al.
Published: (2025)
Graph-Based Vector Search: An Experimental Evaluation of the State-of-the-Art
by: Azizi, Ilias, et al.
Published: (2025)
by: Azizi, Ilias, et al.
Published: (2025)
Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
by: Besta, Maciej, et al.
Published: (2024)
by: Besta, Maciej, et al.
Published: (2024)
A Model-driven Approach for Continuous Performance Engineering in Microservice-based Systems
by: Cortellessa, Vittorio, et al.
Published: (2023)
by: Cortellessa, Vittorio, et al.
Published: (2023)
ACALSim: A Scalable Parallel Simulation Framework for High-Performance System Design Space Exploration
by: Lin, Wei-Fen, et al.
Published: (2026)
by: Lin, Wei-Fen, et al.
Published: (2026)
Rule-Based Graph Programs Matching the Time Complexity of Imperative Algorithms
by: Alaoui, Ziad Ismaili, et al.
Published: (2025)
by: Alaoui, Ziad Ismaili, et al.
Published: (2025)
Meta-Metrics and Best Practices for System-Level Inference Performance Benchmarking
by: Salaria, Shweta, et al.
Published: (2025)
by: Salaria, Shweta, et al.
Published: (2025)
CAPSim: A Fast CPU Performance Simulator Using Attention-based Predictor
by: Xu, Buqing, et al.
Published: (2025)
by: Xu, Buqing, et al.
Published: (2025)
Accurate and Fast Approximate Graph Pattern Mining at Scale
by: Arpaci-Dusseau, Anna, et al.
Published: (2024)
by: Arpaci-Dusseau, Anna, et al.
Published: (2024)
DF-GNN: Dynamic Fusion Framework for Attention Graph Neural Networks on GPUs
by: Liu, Jiahui, et al.
Published: (2024)
by: Liu, Jiahui, et al.
Published: (2024)
Columbo: Low Level End-to-End System Traces through Modular Full-System Simulation
by: Görgen, Jakob, et al.
Published: (2024)
by: Görgen, Jakob, et al.
Published: (2024)
Similar Items
-
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition
by: Vellaisamy, Prabhu, et al.
Published: (2026) -
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures
by: Vellaisamy, Prabhu, et al.
Published: (2025) -
Mugi: Value Level Parallelism For Efficient LLMs
by: Price, Daniel, et al.
Published: (2026) -
GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs
by: Liu, Juelin, et al.
Published: (2024) -
Robust Recursive Query Parallelism in Graph Database Management Systems
by: Chakraborty, Anurag, et al.
Published: (2025)