:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Lansiaux, Edouard, Azzouz, Ramy, Chazard, Emmanuel, Vromant, Amélie, Wiel, Eric
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Machine Learning Performance
Accesso online:	https://arxiv.org/abs/2507.01080
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications
di: Lansiaux, Edouard, et al.
Pubblicazione: (2025)

Emergency Department Patient Flow Optimization with an Alternative Care Threshold Policy
di: Baniasadi, Sahba, et al.
Pubblicazione: (2026)

Zero-Knowledge Federated Learning with Lattice-Based Hybrid Encryption for Quantum-Resilient Medical AI
di: Lansiaux, Edouard
Pubblicazione: (2026)

Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++
di: Malenza, Giulio, et al.
Pubblicazione: (2025)

Artificial Intelligence and its Impact on Academic Performance of students with Disabilities in Nasarawa State University, Keffi
di: Osita, Emmanuel Izuchukwu, et al.
Pubblicazione: (2026)

Impact of AI-Triage on Radiologist Report Turnaround Time: Real-World Time-Savings and Insights from Model Predictions
di: Thompson, Yee Lam Elim, et al.
Pubblicazione: (2025)

Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI
di: Pfister, Rolf, et al.
Pubblicazione: (2025)

Impact of Data-Oriented and Object-Oriented Design on Performance and Cache Utilization with Artificial Intelligence Algorithms in Multi-Threaded CPUs
di: Arantes, Gabriel M., et al.
Pubblicazione: (2025)

Support Systems of Clinical Decisions in the Triage of the Emergency Department Using Artificial Intelligence: The Efficiency to Support Triage
di: Eleni Karlafti
Pubblicazione: (2023)

OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners
di: Edouard, Lansiaux, et al.
Pubblicazione: (2026)

Achieving Consistent and Comparable CPU Evaluation
di: Wang, Chenxi, et al.
Pubblicazione: (2024)

GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving
di: Liu, Qunyou, et al.
Pubblicazione: (2025)

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers
di: Maczan, Jędrzej
Pubblicazione: (2026)

msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML
di: Huang, Zhaolan, et al.
Pubblicazione: (2025)

COMPASS: A Unified Decision-Intelligence System for Navigating Performance Trade-off in HPC
di: Lahiry, Ankur, et al.
Pubblicazione: (2026)

Statistical Modeling and Uncertainty Estimation of LLM Inference Systems
di: Ray, Kaustabha, et al.
Pubblicazione: (2025)

BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
di: Wang, Yuxin, et al.
Pubblicazione: (2024)

High-Performance Portable GPU Primitives for Arbitrary Types and Operators in Julia
di: Pilliat, Emmanuel
Pubblicazione: (2026)

Characterize LSM-tree Compaction Performance via On-Device LLM Inference
di: Ding, Jiabiao, et al.
Pubblicazione: (2026)

Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis
di: Werner, Elias, et al.
Pubblicazione: (2023)

SparseInfer: Training-free Prediction of Activation Sparsity for Fast LLM Inference
di: Shin, Jiho, et al.
Pubblicazione: (2024)

SparseX: Efficient Segment-Level KV Cache Sharing for Interleaved LLM Serving
di: Zhang, Quqing, et al.
Pubblicazione: (2026)

Architectural Trade-offs in the Energy-Efficient Era: A Comparative Study of power-capping NVIDIA H100 and H200
di: Ujeniya, Aditya, et al.
Pubblicazione: (2026)

A Tale of Three Location Trackers: AirTag, SmartTag, and Tile
di: Jang, HyunSeok Daniel, et al.
Pubblicazione: (2025)

U-TOE: Universal TinyML On-board Evaluation Toolkit for Low-Power IoT
di: Huang, Zhaolan, et al.
Pubblicazione: (2023)

H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference
di: Fu, Zizhuo, et al.
Pubblicazione: (2025)

Examem: Low-Overhead Memory Instrumentation for Intelligent Memory Systems
di: Poduval, Ashwin, et al.
Pubblicazione: (2024)

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization
di: Zaeed, Mohammad, et al.
Pubblicazione: (2026)

A Comparative Study and Implementation of Key Derivation Functions Standardized by NIST and IEEE
di: Chen, Abel C. H.
Pubblicazione: (2025)

Growth performance of Trichogaster pectoralis Regan in intensively cultivated rice fields / N. Vromant
di: Vromant, N
Pubblicazione: (2001)

An Interpretable Latency Model for Speculative Decoding in LLM Serving
di: Kong, Linghao, et al.
Pubblicazione: (2026)

An Inquiry into Datacenter TCO for LLM Inference with FP8
di: Kim, Jiwoo, et al.
Pubblicazione: (2025)

Intelligent Green Efficiency for Intrusion Detection
di: Pereira, Pedro, et al.
Pubblicazione: (2024)

When Scanners Lie: Evaluator Instability in LLM Red-Teaming
di: Erez, Lidor, et al.
Pubblicazione: (2026)

CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM Pipelines
di: Sun, Wenbo, et al.
Pubblicazione: (2024)

FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
di: Heidari, Sina, et al.
Pubblicazione: (2026)

SweetSpot: An Analytical Model for Predicting Energy Efficiency of LLM Inference
di: Cavagna, Hiari Pizzini, et al.
Pubblicazione: (2026)

TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios
di: Rossi, Daniel, et al.
Pubblicazione: (2025)

A relação entre a «performance» social e a «performance» económico-financeira
di: Daniel Taborda
Pubblicazione: (2007)

ZKProphet: Understanding Performance of Zero-Knowledge Proofs on GPUs
di: Verma, Tarunesh, et al.
Pubblicazione: (2025)