Saved in:
| Main Author: | Masalskikh, Aleksandr |
|---|---|
| Format: | Recurso digital |
| Language: | English |
| Published: |
Zenodo
2026
|
| Subjects: | |
| Online Access: | https://doi.org/10.5281/zenodo.19232218 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models
by: Mukherji, Rishav, et al.
Published: (2024)
by: Mukherji, Rishav, et al.
Published: (2024)
Explore Activation Sparsity in Recurrent LLMs for Energy-Efficient Neuromorphic Computing
by: Knunyants, Ivan, et al.
Published: (2025)
by: Knunyants, Ivan, et al.
Published: (2025)
Sparsity Moves Computation: How FFN Architecture Reshapes Attention in Small Transformers
by: Smithline, Gabriel, et al.
Published: (2026)
by: Smithline, Gabriel, et al.
Published: (2026)
Predictive Coding Graphs are a Superset of Feedforward Neural Networks
by: van Zwol, Björn
Published: (2026)
by: van Zwol, Björn
Published: (2026)
Approximation Theory for Neural Networks: Old and New
by: Mukherjee, Soumendu Sundar, et al.
Published: (2026)
by: Mukherjee, Soumendu Sundar, et al.
Published: (2026)
Preisach Attention: A Hysteretic Model of Sequential Memory
by: Frydrych, Piotr
Published: (2026)
by: Frydrych, Piotr
Published: (2026)
Nature-Inspired Local Propagation
by: Betti, Alessandro, et al.
Published: (2024)
by: Betti, Alessandro, et al.
Published: (2024)
Predictive Coding Networks and Inference Learning: Tutorial and Survey
by: van Zwol, Björn, et al.
Published: (2024)
by: van Zwol, Björn, et al.
Published: (2024)
Learning Successor Features with Distributed Hebbian Temporal Memory
by: Dzhivelikian, Evgenii, et al.
Published: (2023)
by: Dzhivelikian, Evgenii, et al.
Published: (2023)
CLASSP: a Biologically-Inspired Approach to Continual Learning through Adjustment Suppression and Sparsity Promotion
by: Ludwig, Oswaldo
Published: (2024)
by: Ludwig, Oswaldo
Published: (2024)
Sup3r: A Semi-Supervised Algorithm for increasing Sparsity, Stability, and Separability in Hierarchy Of Time-Surfaces architectures
by: Rasetto, Marco, et al.
Published: (2024)
by: Rasetto, Marco, et al.
Published: (2024)
Gate-level boolean evolutionary geometric attention neural networks
by: Shi, Xianshuai, et al.
Published: (2025)
by: Shi, Xianshuai, et al.
Published: (2025)
CosineGate: Semantic Dynamic Routing via Cosine Incompatibility in Residual Networks
by: Thota, Yogeswar Reddy
Published: (2025)
by: Thota, Yogeswar Reddy
Published: (2025)
Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning
by: Mavor-Parker, Augustine N., et al.
Published: (2024)
by: Mavor-Parker, Augustine N., et al.
Published: (2024)
Error-margin Analysis for Hidden Neuron Activation Labels
by: Dalal, Abhilekha, et al.
Published: (2024)
by: Dalal, Abhilekha, et al.
Published: (2024)
A Generative Neural Annealer for Black-Box Combinatorial Optimization
by: Zhang, Yuan-Hang, et al.
Published: (2025)
by: Zhang, Yuan-Hang, et al.
Published: (2025)
Differential learning kinetics govern the transition from memorization to generalization during in-context learning
by: Nguyen, Alex, et al.
Published: (2024)
by: Nguyen, Alex, et al.
Published: (2024)
Synaptic Activation and Dual Liquid Dynamics for Interpretable Bio-Inspired Models
by: Farsang, Mónika, et al.
Published: (2026)
by: Farsang, Mónika, et al.
Published: (2026)
LANCE: Low Rank Activation Compression for Efficient On-Device Continual Learning
by: Apolinario, Marco Paul E., et al.
Published: (2025)
by: Apolinario, Marco Paul E., et al.
Published: (2025)
Breaking the Conventional Forward-Backward Tie in Neural Networks: Activation Functions
by: Troiano, Luigi, et al.
Published: (2025)
by: Troiano, Luigi, et al.
Published: (2025)
Parallel Algorithms for Exact Enumeration of Deep Neural Network Activation Regions
by: Drammis, Sabrina, et al.
Published: (2024)
by: Drammis, Sabrina, et al.
Published: (2024)
Neuro-Symbolic Activation Discovery: Transferring Mathematical Structures from Physics to Ecology for Parameter-Efficient Neural Networks
by: Hajbi, Anas
Published: (2026)
by: Hajbi, Anas
Published: (2026)
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
by: Qin, Lang, et al.
Published: (2024)
by: Qin, Lang, et al.
Published: (2024)
Cascaded Transformer for Robust and Scalable SLA Decomposition via Amortized Optimization
by: Hsu, Cyril Shih-Huan
Published: (2026)
by: Hsu, Cyril Shih-Huan
Published: (2026)
Scaling Laws Do Not Scale
by: Diaz, Fernando, et al.
Published: (2023)
by: Diaz, Fernando, et al.
Published: (2023)
Demystifying MPNNs: Message Passing as Merely Efficient Matrix Multiplication
by: Jiang, Qin, et al.
Published: (2025)
by: Jiang, Qin, et al.
Published: (2025)
Transformer-Empowered Actor-Critic Reinforcement Learning for Sequence-Aware Service Function Chain Partitioning
by: Hsu, Cyril Shih-Huan, et al.
Published: (2025)
by: Hsu, Cyril Shih-Huan, et al.
Published: (2025)
Discover physical concepts and equations with machine learning
by: Li, Bao-Bing, et al.
Published: (2024)
by: Li, Bao-Bing, et al.
Published: (2024)
SkyCharge: Deploying Unmanned Aerial Vehicles for Dynamic Load Optimization in Solar Small Cell 5G Networks
by: Dave, Daksh, et al.
Published: (2023)
by: Dave, Daksh, et al.
Published: (2023)
Activations Through Extensions: A Framework To Boost Performance Of Neural Networks
by: Kamanchi, Chandramouli, et al.
Published: (2024)
by: Kamanchi, Chandramouli, et al.
Published: (2024)
A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs
by: Kalra, Dayal Singh, et al.
Published: (2026)
by: Kalra, Dayal Singh, et al.
Published: (2026)
Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate
by: Kalra, Dayal Singh, et al.
Published: (2026)
by: Kalra, Dayal Singh, et al.
Published: (2026)
Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer
by: Lauditi, Clarissa, et al.
Published: (2026)
by: Lauditi, Clarissa, et al.
Published: (2026)
On the origin of neural scaling laws: from random graphs to natural language
by: Barkeshli, Maissam, et al.
Published: (2026)
by: Barkeshli, Maissam, et al.
Published: (2026)
Learning Shrinks the Hard Tail: Training-Dependent Inference Scaling in a Solvable Linear Model
by: Levi, Noam
Published: (2026)
by: Levi, Noam
Published: (2026)
More Bang for the Buck: Improving the Inference of Large Language Models at a Fixed Budget using Reset and Discard (ReD)
by: Meir, Sagi, et al.
Published: (2026)
by: Meir, Sagi, et al.
Published: (2026)
Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime
by: Defilippis, Leonardo, et al.
Published: (2025)
by: Defilippis, Leonardo, et al.
Published: (2025)
Generalization through variance: how noise shapes inductive biases in diffusion models
by: Vastola, John J.
Published: (2025)
by: Vastola, John J.
Published: (2025)
Identifying internal patterns in (1+1)-dimensional directed percolation using neural networks
by: Parkhomenko, Danil, et al.
Published: (2025)
by: Parkhomenko, Danil, et al.
Published: (2025)
Grokking vs. Learning: Same Features, Different Encodings
by: Manning-Coe, Dmitry, et al.
Published: (2025)
by: Manning-Coe, Dmitry, et al.
Published: (2025)
Similar Items
-
Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models
by: Mukherji, Rishav, et al.
Published: (2024) -
Explore Activation Sparsity in Recurrent LLMs for Energy-Efficient Neuromorphic Computing
by: Knunyants, Ivan, et al.
Published: (2025) -
Sparsity Moves Computation: How FFN Architecture Reshapes Attention in Small Transformers
by: Smithline, Gabriel, et al.
Published: (2026) -
Predictive Coding Graphs are a Superset of Feedforward Neural Networks
by: van Zwol, Björn
Published: (2026) -
Approximation Theory for Neural Networks: Old and New
by: Mukherjee, Soumendu Sundar, et al.
Published: (2026)