:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yeganeh, Yavar Taheri, Jafari, Mohsen, Matta, Andrea
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.09322
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Deep Active Inference Agents for Delayed and Long-Horizon Environments
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)

Active Inference for Energy Control and Planning in Smart Buildings and Communities
by: Nazemi, Seyyed Danial, et al.
Published: (2025)

ICaRus: Identical Cache Reuse for Efficient Multi Model Inference
by: Woo, Sunghyeon, et al.
Published: (2026)

A Parallel Alternative for Energy-Efficient Neural Network Training and Inferencing
by: Seal, Sudip K., et al.
Published: (2025)

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
by: Bian, Song, et al.
Published: (2025)

Adaptive-lambda Subtracted Importance Sampled Scores in Machine Unlearning for DDPMs and VAEs
by: Dini, MohammadParsa, et al.
Published: (2025)

Generalization and Membership Inference Attack a Practical Perspective
by: Rahmani, Fateme, et al.
Published: (2026)

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
by: Tang, Xiaojuan, et al.
Published: (2025)

Bayesian Inverse Problems Meet Flow Matching: Efficient and Flexible Inference via Transformers
by: Sherki, Daniil, et al.
Published: (2025)

Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding
by: Yi, Hanling, et al.
Published: (2024)

DepCap: Adaptive Block-Wise Parallel Decoding for Efficient Diffusion LM Inference
by: Xia, Xiang, et al.
Published: (2026)

SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI
by: Patne, Parth, et al.
Published: (2026)

An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference
by: Yao, Feiyu, et al.
Published: (2026)

Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
by: Shkolnikov, Yakov Pyotr
Published: (2026)

Accelerating Transformer Inference for Translation via Parallel Decoding
by: Santilli, Andrea, et al.
Published: (2023)

Limitations of Using Identical Distributions for Training and Testing When Learning Boolean Functions
by: Pérez-Guijarro, Jordi
Published: (2025)

Energy Consumption in Parallel Neural Network Training
by: Huber, Philipp, et al.
Published: (2025)

Towards Low-bit Communication for Tensor Parallel LLM Inference
by: Dong, Harry, et al.
Published: (2024)

Non-Identical Diffusion Models in MIMO-OFDM Channel Generation
by: Yang, Yuzhi, et al.
Published: (2025)

Pathway-based Progressive Inference (PaPI) for Energy-Efficient Continual Learning
by: Gaurav, Suyash, et al.
Published: (2025)

Learning An Active Inference Model of Driver Perception and Control: Application to Vehicle Car-Following
by: Wei, Ran, et al.
Published: (2023)

Energy-Efficient Wireless LLM Inference via Uncertainty and Importance-Aware Speculative Decoding
by: Park, Jihoon, et al.
Published: (2025)

Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
by: Kermani, Arshia, et al.
Published: (2025)

Sensitivity-Guided Framework for Pruned and Quantized Reservoir Computing Accelerators
by: Jafari, Atousa, et al.
Published: (2026)

Agentic Unlearning: When LLM Agent Meets Machine Unlearning
by: Wang, Bin, et al.
Published: (2026)

Communication Compression for Tensor Parallel LLM Inference
by: Hansen-Palmus, Jan, et al.
Published: (2024)

Value of Information and Reward Specification in Active Inference and POMDPs
by: Wei, Ran
Published: (2024)

Active Inference with Reusable State-Dependent Value Profiles
by: Poschl, Jacob
Published: (2025)

Real-World Robot Control by Deep Active Inference With a Temporally Hierarchical World Model
by: Fujii, Kentaro, et al.
Published: (2025)

Efficient Triple Modular Redundancy for Reliability Enhancement of DNNs Using Explainable AI
by: Soroush, Kimia, et al.
Published: (2025)

Recursive Inference Machines for Neural Reasoning
by: Komisarczyk, Mieszko, et al.
Published: (2026)

Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers
by: Fanariotis, Anastasios, et al.
Published: (2025)

Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference
by: Joshi, Thomas, et al.
Published: (2025)

COVID-19 Probability Prediction Using Machine Learning: An Infectious Approach
by: Ilani, Mohsen Asghari, et al.
Published: (2024)

ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
by: Lin, Xiaoqiang, et al.
Published: (2025)

Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
by: Hohman, Fred, et al.
Published: (2024)

Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction
by: Firouzkouhi, Arad, et al.
Published: (2025)

Instance-Adaptive Parametrization for Amortized Variational Inference
by: Pollastro, Andrea, et al.
Published: (2026)

Sensi: Learn One Thing at a Time -- Curriculum-Based Test-Time Learning for LLM Game Agents
by: Arjmandi, Mohsen
Published: (2026)

Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control
by: Delavari, Elahe, et al.
Published: (2024)