Saved in:
Bibliographic Details
Main Authors: He, Ruimin, Lin, Shaowei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.26672
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • At the heart of reinforcement learning are actions -- decisions made in response to observations of the environment. Actions are equally fundamental in the modeling of stochastic processes, as they trigger discontinuous state transitions and enable the flow of information through large, complex systems. In this paper, we unify the perspectives of stochastic processes and reinforcement learning through action-driven processes, and illustrate their application to spiking neural networks. Leveraging ideas from control-as-inference, we show that minimizing the Kullback-Leibler divergence between a policy-driven true distribution and a reward-driven model distribution for a suitably defined action-driven process is equivalent to maximum entropy reinforcement learning.