Saved in:
| Main Authors: | Benfenati, Luca, Risso, Matteo, Vannozzi, Andrea, Yüzügüler, Ahmet Caner, Cavigelli, Lukas, Macii, Enrico, Pagliari, Daniele Jahier, Burrello, Alessio |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21686 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
by: Yüzügüler, Ahmet Caner, et al.
Published: (2025)
by: Yüzügüler, Ahmet Caner, et al.
Published: (2025)
OnDA: On-device Channel Pruning for Efficient Personalized Keyword Spotting
by: Risso, Matteo, et al.
Published: (2026)
by: Risso, Matteo, et al.
Published: (2026)
Optimizing DNN Inference on Multi-Accelerator SoCs at Training-time
by: Risso, Matteo, et al.
Published: (2024)
by: Risso, Matteo, et al.
Published: (2024)
Joint Pruning and Channel-wise Mixed-Precision Quantization for Efficient Deep Neural Networks
by: Motetti, Beatrice Alessandra, et al.
Published: (2024)
by: Motetti, Beatrice Alessandra, et al.
Published: (2024)
Before Parc Fermé: RL-Time Pruning for Efficient Embodied LLMs in Autonomous Driving
by: Benfenati, Luca, et al.
Published: (2026)
by: Benfenati, Luca, et al.
Published: (2026)
Accelerating Depthwise Separable Convolutions on Ultra-Low-Power Devices
by: Daghero, Francesco, et al.
Published: (2024)
by: Daghero, Francesco, et al.
Published: (2024)
Optimized Deployment of Deep Neural Networks for Visual Pose Estimation on Nano-drones
by: Risso, Matteo, et al.
Published: (2024)
by: Risso, Matteo, et al.
Published: (2024)
Coupling Neural Networks and Physics Equations For Li-Ion Battery State-of-Charge Prediction
by: Pollo, Giovanni, et al.
Published: (2024)
by: Pollo, Giovanni, et al.
Published: (2024)
HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arrays
by: Risso, Matteo, et al.
Published: (2024)
by: Risso, Matteo, et al.
Published: (2024)
BlankSkip: Early-exit Object Detection onboard Nano-drones
by: Marra, Carlo, et al.
Published: (2026)
by: Marra, Carlo, et al.
Published: (2026)
EnhancePPG: Improving PPG-based Heart Rate Estimation with Self-Supervision and Augmentation
by: Benfenati, Luca, et al.
Published: (2024)
by: Benfenati, Luca, et al.
Published: (2024)
Integrating SystemC-AMS Power Modeling with a RISC-V ISS for Virtual Prototyping of Battery-operated Embedded Devices
by: Hamdi, Mohamed Amine, et al.
Published: (2024)
by: Hamdi, Mohamed Amine, et al.
Published: (2024)
Building Damage Assessment in Conflict Zones: A Deep Learning Approach Using Geospatial Sub-Meter Resolution Data
by: Risso, Matteo, et al.
Published: (2024)
by: Risso, Matteo, et al.
Published: (2024)
TyphoonMLA: A Mixed Naive-Absorb MLA Kernel For Shared Prefix
by: Yüzügüler, Ahmet Caner, et al.
Published: (2025)
by: Yüzügüler, Ahmet Caner, et al.
Published: (2025)
Foundation Models for Structural Health Monitoring
by: Benfenati, Luca, et al.
Published: (2024)
by: Benfenati, Luca, et al.
Published: (2024)
Late Breaking Results: CHESSY: Coupled Hybrid Emulation with SystemC-FPGA Synchronization
by: Ruotolo, Lorenzo, et al.
Published: (2026)
by: Ruotolo, Lorenzo, et al.
Published: (2026)
Optimization and Deployment of Deep Neural Networks for PPG-based Blood Pressure Estimation Targeting Low-power Wearables
by: Burrello, Alessio, et al.
Published: (2024)
by: Burrello, Alessio, et al.
Published: (2024)
BISeizuRe: BERT-Inspired Seizure Data Representation to Improve Epilepsy Monitoring
by: Benfenati, Luca, et al.
Published: (2024)
by: Benfenati, Luca, et al.
Published: (2024)
MEbots: Integrating a RISC-V Virtual Platform with a Robotic Simulator for Energy-aware Design
by: Pollo, Giovanni, et al.
Published: (2025)
by: Pollo, Giovanni, et al.
Published: (2025)
Improving Continual Learning for Gaussian Splatting based Environments Reconstruction on Commercial Off-the-Shelf Edge Devices
by: Zaino, Ivan, et al.
Published: (2026)
by: Zaino, Ivan, et al.
Published: (2026)
End-to-end Automated Deep Neural Network Optimization for PPG-based Blood Pressure Estimation on Wearables
by: Carlucci, Francesco, et al.
Published: (2026)
by: Carlucci, Francesco, et al.
Published: (2026)
Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization
by: Ruggeri, Giuseppe, et al.
Published: (2025)
by: Ruggeri, Giuseppe, et al.
Published: (2025)
Adaptive Deep Learning for Efficient Visual Pose Estimation aboard Ultra-low-power Nano-drones
by: Motetti, Beatrice Alessandra, et al.
Published: (2024)
by: Motetti, Beatrice Alessandra, et al.
Published: (2024)
Lightweight Software Kernels and Hardware Extensions for Efficient Sparse Deep Neural Networks on Microcontrollers
by: Daghero, Francesco, et al.
Published: (2025)
by: Daghero, Francesco, et al.
Published: (2025)
V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms
by: Rodrigo, Javier J. Poveda, et al.
Published: (2025)
by: Rodrigo, Javier J. Poveda, et al.
Published: (2025)
Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform
by: Potocnik, Viviane, et al.
Published: (2024)
by: Potocnik, Viviane, et al.
Published: (2024)
Performance evaluation of acceleration of convolutional layers on OpenEdgeCGRA
by: Carpentieri, Nicolò, et al.
Published: (2024)
by: Carpentieri, Nicolò, et al.
Published: (2024)
MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCs
by: Russo, Enrico, et al.
Published: (2026)
by: Russo, Enrico, et al.
Published: (2026)
SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition
by: Spacone, Giusy, et al.
Published: (2026)
by: Spacone, Giusy, et al.
Published: (2026)
Model-Driven Dataset Generation for Data-Driven Battery SOH Models
by: Alamin, Khaled Sidahmed Sidahmed, et al.
Published: (2024)
by: Alamin, Khaled Sidahmed Sidahmed, et al.
Published: (2024)
HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms
by: Van Delm, Josse, et al.
Published: (2024)
by: Van Delm, Josse, et al.
Published: (2024)
Integrating SystemC TLM into FMI 3.0 Co-Simulations with an Open-Source Approach
by: Albu, Andrei Mihai, et al.
Published: (2025)
by: Albu, Andrei Mihai, et al.
Published: (2025)
Automatic integration of SystemC in the FMI standard for Software-defined Vehicle design
by: Pollo, Giovanni, et al.
Published: (2025)
by: Pollo, Giovanni, et al.
Published: (2025)
Hierarchical Training of Deep Neural Networks Using Early Exiting
by: Sepehri, Yamin, et al.
Published: (2023)
by: Sepehri, Yamin, et al.
Published: (2023)
Don't Waste Bits! Adaptive KV-Cache Quantization for Lightweight On-Device LLMs
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2026)
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2026)
MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices
by: Hamdi, Mohamed Amine, et al.
Published: (2024)
by: Hamdi, Mohamed Amine, et al.
Published: (2024)
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
by: Müller, Lorenz K., et al.
Published: (2025)
by: Müller, Lorenz K., et al.
Published: (2025)
Don't Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks
by: Lumer, Elias, et al.
Published: (2026)
by: Lumer, Elias, et al.
Published: (2026)
Don't be so negative! Score-based Generative Modeling with Oracle-assisted Guidance
by: Naderiparizi, Saeid, et al.
Published: (2023)
by: Naderiparizi, Saeid, et al.
Published: (2023)
Fused-Tiled Layers: Minimizing Data Movement on RISC-V SoCs with Software-Managed Caches
by: Jung, Victor J. B., et al.
Published: (2025)
by: Jung, Victor J. B., et al.
Published: (2025)
Similar Items
-
PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
by: Yüzügüler, Ahmet Caner, et al.
Published: (2025) -
OnDA: On-device Channel Pruning for Efficient Personalized Keyword Spotting
by: Risso, Matteo, et al.
Published: (2026) -
Optimizing DNN Inference on Multi-Accelerator SoCs at Training-time
by: Risso, Matteo, et al.
Published: (2024) -
Joint Pruning and Channel-wise Mixed-Precision Quantization for Efficient Deep Neural Networks
by: Motetti, Beatrice Alessandra, et al.
Published: (2024) -
Before Parc Fermé: RL-Time Pruning for Efficient Embodied LLMs in Autonomous Driving
by: Benfenati, Luca, et al.
Published: (2026)