Saved in:
| Main Authors: | Jagtap, Rohan, Dhage, Sudhir N. |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2004.04902 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Neural Information Organizing and Processing -- Neural Machines
by: Petrila, Iosif Iulian
Published: (2024)
by: Petrila, Iosif Iulian
Published: (2024)
Large Language Models for Tuning Evolution Strategies
by: Kramer, Oliver
Published: (2024)
by: Kramer, Oliver
Published: (2024)
EvoX: Meta-Evolution for Automated Discovery
by: Liu, Shu, et al.
Published: (2026)
by: Liu, Shu, et al.
Published: (2026)
Intelligent Neural Networks: From Layered Architectures to Graph-Organized Intelligence
by: Salomon, Antoine
Published: (2025)
by: Salomon, Antoine
Published: (2025)
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks
by: Zhu, Rui-Jie, et al.
Published: (2023)
by: Zhu, Rui-Jie, et al.
Published: (2023)
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
by: Xing, Xingrun, et al.
Published: (2024)
by: Xing, Xingrun, et al.
Published: (2024)
Language Models and Cycle Consistency for Self-Reflective Machine Translation
by: Wangni, Jianqiao
Published: (2024)
by: Wangni, Jianqiao
Published: (2024)
The Evolution of Learning Algorithms for Artificial Neural Networks
by: Baxter, Jonathan
Published: (2025)
by: Baxter, Jonathan
Published: (2025)
Multiple Population Alternate Evolution Neural Architecture Search
by: Zou, Juan, et al.
Published: (2024)
by: Zou, Juan, et al.
Published: (2024)
Multi-Class Imbalanced Learning with Support Vector Machines via Differential Evolution
by: Zhang, Zhong-Liang, et al.
Published: (2025)
by: Zhang, Zhong-Liang, et al.
Published: (2025)
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
by: Wang, Hanrui, et al.
Published: (2020)
by: Wang, Hanrui, et al.
Published: (2020)
EvolKV: Evolutionary KV Cache Compression for LLM Inference
by: Yu, Bohan, et al.
Published: (2025)
by: Yu, Bohan, et al.
Published: (2025)
Decomposing Evolutionary Mixture-of-LoRA Architectures: The Routing Lever, the Lifecycle Penalty, and a Substrate-Conditional Boundary
by: Kumaresan, Ramchand
Published: (2026)
by: Kumaresan, Ramchand
Published: (2026)
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
by: Dong, Peijie, et al.
Published: (2024)
by: Dong, Peijie, et al.
Published: (2024)
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
by: Xing, Xingrun, et al.
Published: (2024)
by: Xing, Xingrun, et al.
Published: (2024)
Hysteresis Activation Function for Efficient Inference
by: Kimhi, Moshe, et al.
Published: (2024)
by: Kimhi, Moshe, et al.
Published: (2024)
Large Language Models Suffer From Their Own Output: An Analysis of the Self-Consuming Training Loop
by: Briesch, Martin, et al.
Published: (2023)
by: Briesch, Martin, et al.
Published: (2023)
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
by: Majumdar, Somshubra, et al.
Published: (2024)
by: Majumdar, Somshubra, et al.
Published: (2024)
Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers
by: Kadlčík, Marek, et al.
Published: (2025)
by: Kadlčík, Marek, et al.
Published: (2025)
AP-BMM: Approximating Capability-Cost Pareto Sets of LLMs via Asynchronous Prior-Guided Bayesian Model Merging
by: Chen, Kesheng, et al.
Published: (2025)
by: Chen, Kesheng, et al.
Published: (2025)
On the Power of Convolution Augmented Transformer
by: Li, Mingchen, et al.
Published: (2024)
by: Li, Mingchen, et al.
Published: (2024)
A Hormone-inspired Emotion Layer for Transformer language models (HELT)
by: Reda, Eslam, et al.
Published: (2026)
by: Reda, Eslam, et al.
Published: (2026)
SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models
by: Shen, Shuaijie, et al.
Published: (2024)
by: Shen, Shuaijie, et al.
Published: (2024)
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
by: Csordás, Róbert, et al.
Published: (2023)
by: Csordás, Róbert, et al.
Published: (2023)
Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model
by: Tang, Kaiwen, et al.
Published: (2024)
by: Tang, Kaiwen, et al.
Published: (2024)
An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering
by: Azarshab, Mahsa, et al.
Published: (2024)
by: Azarshab, Mahsa, et al.
Published: (2024)
BrainTransformers: SNN-LLM
by: Tang, Zhengzheng, et al.
Published: (2024)
by: Tang, Zhengzheng, et al.
Published: (2024)
ComplicaCode: Enhancing Disease Complication Detection in Electronic Health Records through ICD Path Generation
by: Zhou, Xiaofan
Published: (2023)
by: Zhou, Xiaofan
Published: (2023)
Why Prompt Optimization Works, and Why It Sometimes Doesn't: A Causal-Inspired Edit-Level Analysis
by: Gong, Shuzhi, et al.
Published: (2026)
by: Gong, Shuzhi, et al.
Published: (2026)
Improving Sequence-to-Sequence Models for Abstractive Text Summarization Using Meta Heuristic Approaches
by: Saxena, Aditya, et al.
Published: (2024)
by: Saxena, Aditya, et al.
Published: (2024)
Improving Language Plasticity via Pretraining with Active Forgetting
by: Chen, Yihong, et al.
Published: (2023)
by: Chen, Yihong, et al.
Published: (2023)
B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory
by: Zancato, Luca, et al.
Published: (2024)
by: Zancato, Luca, et al.
Published: (2024)
Learning Numeracy: Binary Arithmetic with Neural Turing Machines
by: Castellini, Jacopo
Published: (2019)
by: Castellini, Jacopo
Published: (2019)
A Transformer-based Neural Architecture Search Method
by: Wang, Shang, et al.
Published: (2025)
by: Wang, Shang, et al.
Published: (2025)
Towards Faster k-Nearest-Neighbor Machine Translation
by: Shi, Xiangyu, et al.
Published: (2023)
by: Shi, Xiangyu, et al.
Published: (2023)
A Gauge Theory of Superposition: Toward a Sheaf-Theoretic Atlas of Neural Representations
by: Javidnia, Hossein
Published: (2026)
by: Javidnia, Hossein
Published: (2026)
Differential Evolution Algorithm based Hyper-Parameters Selection of Transformer Neural Network Model for Load Forecasting
by: Sen, Anuvab, et al.
Published: (2023)
by: Sen, Anuvab, et al.
Published: (2023)
Rethinking Deep Learning: Non-backpropagation and Non-optimization Machine Learning Approach Using Hebbian Neural Networks
by: Itoh, Kei
Published: (2024)
by: Itoh, Kei
Published: (2024)
On The Expressivity of Recurrent Neural Cascades
by: Knorozova, Nadezda Alexandrovna, et al.
Published: (2023)
by: Knorozova, Nadezda Alexandrovna, et al.
Published: (2023)
The Stacked Autoencoder Evolution Hypothesis
by: Iizuka, Hiroyuki
Published: (2026)
by: Iizuka, Hiroyuki
Published: (2026)
Similar Items
-
Neural Information Organizing and Processing -- Neural Machines
by: Petrila, Iosif Iulian
Published: (2024) -
Large Language Models for Tuning Evolution Strategies
by: Kramer, Oliver
Published: (2024) -
EvoX: Meta-Evolution for Automated Discovery
by: Liu, Shu, et al.
Published: (2026) -
Intelligent Neural Networks: From Layered Architectures to Graph-Organized Intelligence
by: Salomon, Antoine
Published: (2025) -
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks
by: Zhu, Rui-Jie, et al.
Published: (2023)