Saved in:
| Main Authors: | Yeganeh, Yavar Taheri, Jafari, Mohsen, Matta, Andrea |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.09322 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deep Active Inference Agents for Delayed and Long-Horizon Environments
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)
Active Inference for Energy Control and Planning in Smart Buildings and Communities
by: Nazemi, Seyyed Danial, et al.
Published: (2025)
by: Nazemi, Seyyed Danial, et al.
Published: (2025)
ICaRus: Identical Cache Reuse for Efficient Multi Model Inference
by: Woo, Sunghyeon, et al.
Published: (2026)
by: Woo, Sunghyeon, et al.
Published: (2026)
A Parallel Alternative for Energy-Efficient Neural Network Training and Inferencing
by: Seal, Sudip K., et al.
Published: (2025)
by: Seal, Sudip K., et al.
Published: (2025)
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)
by: Bian, Song, et al.
Published: (2025)
Adaptive-lambda Subtracted Importance Sampled Scores in Machine Unlearning for DDPMs and VAEs
by: Dini, MohammadParsa, et al.
Published: (2025)
by: Dini, MohammadParsa, et al.
Published: (2025)
Generalization and Membership Inference Attack a Practical Perspective
by: Rahmani, Fateme, et al.
Published: (2026)
by: Rahmani, Fateme, et al.
Published: (2026)
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
by: Tang, Xiaojuan, et al.
Published: (2025)
by: Tang, Xiaojuan, et al.
Published: (2025)
Bayesian Inverse Problems Meet Flow Matching: Efficient and Flexible Inference via Transformers
by: Sherki, Daniil, et al.
Published: (2025)
by: Sherki, Daniil, et al.
Published: (2025)
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding
by: Yi, Hanling, et al.
Published: (2024)
by: Yi, Hanling, et al.
Published: (2024)
DepCap: Adaptive Block-Wise Parallel Decoding for Efficient Diffusion LM Inference
by: Xia, Xiang, et al.
Published: (2026)
by: Xia, Xiang, et al.
Published: (2026)
SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI
by: Patne, Parth, et al.
Published: (2026)
by: Patne, Parth, et al.
Published: (2026)
An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference
by: Yao, Feiyu, et al.
Published: (2026)
by: Yao, Feiyu, et al.
Published: (2026)
Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
by: Shkolnikov, Yakov Pyotr
Published: (2026)
by: Shkolnikov, Yakov Pyotr
Published: (2026)
Accelerating Transformer Inference for Translation via Parallel Decoding
by: Santilli, Andrea, et al.
Published: (2023)
by: Santilli, Andrea, et al.
Published: (2023)
Limitations of Using Identical Distributions for Training and Testing When Learning Boolean Functions
by: Pérez-Guijarro, Jordi
Published: (2025)
by: Pérez-Guijarro, Jordi
Published: (2025)
Energy Consumption in Parallel Neural Network Training
by: Huber, Philipp, et al.
Published: (2025)
by: Huber, Philipp, et al.
Published: (2025)
Towards Low-bit Communication for Tensor Parallel LLM Inference
by: Dong, Harry, et al.
Published: (2024)
by: Dong, Harry, et al.
Published: (2024)
Non-Identical Diffusion Models in MIMO-OFDM Channel Generation
by: Yang, Yuzhi, et al.
Published: (2025)
by: Yang, Yuzhi, et al.
Published: (2025)
Pathway-based Progressive Inference (PaPI) for Energy-Efficient Continual Learning
by: Gaurav, Suyash, et al.
Published: (2025)
by: Gaurav, Suyash, et al.
Published: (2025)
Learning An Active Inference Model of Driver Perception and Control: Application to Vehicle Car-Following
by: Wei, Ran, et al.
Published: (2023)
by: Wei, Ran, et al.
Published: (2023)
Energy-Efficient Wireless LLM Inference via Uncertainty and Importance-Aware Speculative Decoding
by: Park, Jihoon, et al.
Published: (2025)
by: Park, Jihoon, et al.
Published: (2025)
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
by: Kermani, Arshia, et al.
Published: (2025)
by: Kermani, Arshia, et al.
Published: (2025)
Sensitivity-Guided Framework for Pruned and Quantized Reservoir Computing Accelerators
by: Jafari, Atousa, et al.
Published: (2026)
by: Jafari, Atousa, et al.
Published: (2026)
Agentic Unlearning: When LLM Agent Meets Machine Unlearning
by: Wang, Bin, et al.
Published: (2026)
by: Wang, Bin, et al.
Published: (2026)
Communication Compression for Tensor Parallel LLM Inference
by: Hansen-Palmus, Jan, et al.
Published: (2024)
by: Hansen-Palmus, Jan, et al.
Published: (2024)
Value of Information and Reward Specification in Active Inference and POMDPs
by: Wei, Ran
Published: (2024)
by: Wei, Ran
Published: (2024)
Active Inference with Reusable State-Dependent Value Profiles
by: Poschl, Jacob
Published: (2025)
by: Poschl, Jacob
Published: (2025)
Real-World Robot Control by Deep Active Inference With a Temporally Hierarchical World Model
by: Fujii, Kentaro, et al.
Published: (2025)
by: Fujii, Kentaro, et al.
Published: (2025)
Efficient Triple Modular Redundancy for Reliability Enhancement of DNNs Using Explainable AI
by: Soroush, Kimia, et al.
Published: (2025)
by: Soroush, Kimia, et al.
Published: (2025)
Recursive Inference Machines for Neural Reasoning
by: Komisarczyk, Mieszko, et al.
Published: (2026)
by: Komisarczyk, Mieszko, et al.
Published: (2026)
Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers
by: Fanariotis, Anastasios, et al.
Published: (2025)
by: Fanariotis, Anastasios, et al.
Published: (2025)
Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference
by: Joshi, Thomas, et al.
Published: (2025)
by: Joshi, Thomas, et al.
Published: (2025)
COVID-19 Probability Prediction Using Machine Learning: An Infectious Approach
by: Ilani, Mohsen Asghari, et al.
Published: (2024)
by: Ilani, Mohsen Asghari, et al.
Published: (2024)
ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
by: Lin, Xiaoqiang, et al.
Published: (2025)
by: Lin, Xiaoqiang, et al.
Published: (2025)
Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
by: Hohman, Fred, et al.
Published: (2024)
by: Hohman, Fred, et al.
Published: (2024)
Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction
by: Firouzkouhi, Arad, et al.
Published: (2025)
by: Firouzkouhi, Arad, et al.
Published: (2025)
Instance-Adaptive Parametrization for Amortized Variational Inference
by: Pollastro, Andrea, et al.
Published: (2026)
by: Pollastro, Andrea, et al.
Published: (2026)
Sensi: Learn One Thing at a Time -- Curriculum-Based Test-Time Learning for LLM Game Agents
by: Arjmandi, Mohsen
Published: (2026)
by: Arjmandi, Mohsen
Published: (2026)
Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control
by: Delavari, Elahe, et al.
Published: (2024)
by: Delavari, Elahe, et al.
Published: (2024)
Similar Items
-
Deep Active Inference Agents for Delayed and Long-Horizon Environments
by: Yeganeh, Yavar Taheri, et al.
Published: (2025) -
Active Inference for Energy Control and Planning in Smart Buildings and Communities
by: Nazemi, Seyyed Danial, et al.
Published: (2025) -
ICaRus: Identical Cache Reuse for Efficient Multi Model Inference
by: Woo, Sunghyeon, et al.
Published: (2026) -
A Parallel Alternative for Energy-Efficient Neural Network Training and Inferencing
by: Seal, Sudip K., et al.
Published: (2025) -
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)