:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Yiyue, Zheng, Mingzhe, Cong, Lin William, Li, Siguang, Wang, Xuechao
Format:	Preprint
Published:	2026
Subjects:	Performance
Online Access:	https://arxiv.org/abs/2604.03083
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Obfuscation as an Effective Signal for Prioritizing Cross-Chain Smart Contract Audits: Large-Scale Measurement and Risk Profiling
by: Zhao, Yao, et al.
Published: (2026)

Inference performance evaluation for LLMs on edge devices with a novel benchmarking framework and metric
by: Chen, Hao, et al.
Published: (2025)

Mosaic: Cross-Modal Clustering for Efficient Video Understanding
by: Wang, Tuowei, et al.
Published: (2026)

SysOM-AI: Continuous Cross-Layer Performance Diagnosis for Production AI Training
by: Zheng, Yusheng, et al.
Published: (2026)

Scaler: Efficient and Effective Cross Flow Analysis
by: Steven, et al.
Published: (2024)

OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs
by: Sarkar, Rishov, et al.
Published: (2025)

HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
by: Huang, Haochen, et al.
Published: (2025)

LightningSimV2: Faster and Scalable Simulation for High-Level Synthesis via Graph Compilation and Optimization
by: Sarkar, Rishov, et al.
Published: (2024)

The Adoption of Innovations over Time: Structural Determinants and Consequences in Library Organizations.
by: Damanpour, Fariborz, et al.
Published: (1992)

Anatomizing Deep Learning Inference in Web Browsers
by: Wang, Qipeng, et al.
Published: (2024)

DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads
by: Zhao, Qidong, et al.
Published: (2024)

SPEC CPU2026: Characterization, Representativeness, and Cross-Suite Comparison
by: Li, Ruihao, et al.
Published: (2026)

SimLens for Early Exit in Large Language Models: Eliciting Accurate Latent Predictions with One More Token
by: Ma, Ming, et al.
Published: (2025)

Profiling Apple Silicon Performance for ML Training
by: Feng, Dahua, et al.
Published: (2025)

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges
by: Li, Yu, et al.
Published: (2025)

A Zoned Storage Optimized Flash Cache on ZNS SSDs
by: Yang, Chongzhuo, et al.
Published: (2024)

Demystifying Serverless Costs on Public Platforms: Bridging Billing, Architecture, and OS Scheduling
by: Lin, Changyuan, et al.
Published: (2025)

A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
by: Atif, Mohammad, et al.
Published: (2025)

Unleashing the Power of Preemptive Priority-based Scheduling for Real-Time GPU Tasks
by: Wang, Yidi, et al.
Published: (2024)

Atys: An Efficient Profiling Framework for Identifying Hotspot Functions in Large-scale Cloud Microservices
by: Sun, Jiaqi, et al.
Published: (2025)

Accelerating Transistor-Level Simulation of Integrated Circuits via Equivalence of RC Long-Chain Structures
by: Tang, Ruibai, et al.
Published: (2025)

Towards a Higher Roofline for Matrix-Vector Multiplication in Matrix-Free HOSFEM
by: Cao, Zijian, et al.
Published: (2025)

Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective
by: Benazir, Afsara, et al.
Published: (2025)

SparseX: Efficient Segment-Level KV Cache Sharing for Interleaved LLM Serving
by: Zhang, Quqing, et al.
Published: (2026)

A Stochastic Geometry Based Techno-Economic Analysis of RIS-Assisted Cellular Networks
by: Sun, Guodong, et al.
Published: (2025)

DSO: A GPU Energy Efficiency Optimizer by Fusing Dynamic and Static Information
by: Wang, Qiang, et al.
Published: (2024)

Exploring Topologies in Quantum Annealing: A Hardware-Aware Perspective
by: Bifulco, Mario, et al.
Published: (2025)

Noise Injection for__Performance Bottleneck Analysis
by: Delval, Aurélien, et al.
Published: (2025)

Two Criteria for Performance Analysis of Optimization Algorithms
by: Jing, Yunpeng, et al.
Published: (2024)

StiffGIPC: Advancing GPU IPC for stiff affine-deformable simulation
by: Huang, Kemeng, et al.
Published: (2024)

Two-Timescale Dynamic Service Deployment and Task Scheduling with Spatiotemporal Collaboration in Mobile Edge Networks
by: Li, Yang, et al.
Published: (2025)

Systematic Performance Evaluation Framework for LEO Mega-Constellation Satellite Networks
by: Wang, Yu, et al.
Published: (2024)

Spatiotemporal Non-Uniformity-Aware Online Task Scheduling in Collaborative Edge Computing for Industrial Internet of Things
by: Li, Yang, et al.
Published: (2025)

Attributing the System's Overall Effect to its Components
by: Wang, Chenxi, et al.
Published: (2026)

Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection
by: Marinelli, Ryan, et al.
Published: (2025)

PANDA: Noise-Resilient Antagonist Identification in Production Datacenters
by: Zhou, Sixiang, et al.
Published: (2025)

Redundant Array Computation Elimination
by: Wang, Zixuan, et al.
Published: (2025)

A relação entre a «performance» social e a «performance» económico-financeira
by: Daniel Taborda
Published: (2007)

Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training
by: Liu, Vivian, et al.
Published: (2024)

DRIM-ANN: An Approximate Nearest Neighbor Search Engine based on Commercial DRAM-PIMs
by: Chen, Mingkai, et al.
Published: (2024)