Saved in:
| Main Authors: | , |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2025
|
| Online Access: | https://doi.org/10.5281/zenodo.17829591 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- This paper explores the crucial role of activation functions in neural networks, moving beyond the limitations of traditional rectified linear units (ReLU) and sigmoid functions. We propose a reassessment of activation function geometry, focusing on novel designs that enhance network performance through improved gradient flow, increased expressiveness, and better handling of vanishing/exploding gradients. Our investigation encompasses a comprehensive analysis of various activation function properties, including smoothness, monotonicity, and boundedness, and their impact on network training dynamics. We introduce and evaluate several new activation function architectures, demonstrating their superior performance on benchmark datasets compared to standard activation functions. Furthermore, we provide theoretical insights into the relationship between activation function geometry and network optimization, shedding light on the conditions necessary for stable and efficient training. Our findings offer a fresh perspective on activation function design, paving the way for more powerful and robust neural network models.