Saved in:
Bibliographic Details
Main Authors: Pan, Yuhao, Wang, Xiucheng, Cheng, Nan, Qiu, Qi
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.13568
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910495299076096
author Pan, Yuhao
Wang, Xiucheng
Cheng, Nan
Qiu, Qi
author_facet Pan, Yuhao
Wang, Xiucheng
Cheng, Nan
Qiu, Qi
contents With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
format Preprint
id arxiv_https___arxiv_org_abs_2406_13568
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
Pan, Yuhao
Wang, Xiucheng
Cheng, Nan
Qiu, Qi
Artificial Intelligence
With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
title Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
topic Artificial Intelligence
url https://arxiv.org/abs/2406.13568