Saved in:
| Main Author: | He, Yannis Y. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.02007 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Echo State Transformer: Attention Over Finite Memories
by: Bendi-Ouis, Yannis, et al.
Published: (2025)
by: Bendi-Ouis, Yannis, et al.
Published: (2025)
Transformer-based Graph Neural Networks for Battery Range Prediction in AIoT Battery-Swap Services
by: Li, Zhao, et al.
Published: (2024)
by: Li, Zhao, et al.
Published: (2024)
Split Federated Learning Architectures for High-Accuracy and Low-Delay Model Training
by: Papageorgiou, Yiannis, et al.
Published: (2026)
by: Papageorgiou, Yiannis, et al.
Published: (2026)
How Transformers Learn to Plan via Multi-Token Prediction
by: Huang, Jianhao, et al.
Published: (2026)
by: Huang, Jianhao, et al.
Published: (2026)
Rethinking Tokenized Graph Transformers for Node Classification
by: Chen, Jinsong, et al.
Published: (2025)
by: Chen, Jinsong, et al.
Published: (2025)
Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link Prediction
by: Zou, Tao, et al.
Published: (2025)
by: Zou, Tao, et al.
Published: (2025)
ONNX-Net: Towards Universal Representations and Instant Performance Prediction for Neural Architectures
by: Qin, Shiwen, et al.
Published: (2025)
by: Qin, Shiwen, et al.
Published: (2025)
Efficient Real-Time Aircraft ETA Prediction via Feature Tokenization Transformer
by: Huang, Liping, et al.
Published: (2025)
by: Huang, Liping, et al.
Published: (2025)
Transformers are Graph Neural Networks
by: Joshi, Chaitanya K.
Published: (2025)
by: Joshi, Chaitanya K.
Published: (2025)
Flexible Parallel Neural Network Architecture Model for Early Prediction of Lithium Battery Life
by: Jiang, Lidang, et al.
Published: (2024)
by: Jiang, Lidang, et al.
Published: (2024)
A Novel Deep Neural Network Architecture for Real-Time Water Demand Forecasting
by: Salloom, Tony, et al.
Published: (2025)
by: Salloom, Tony, et al.
Published: (2025)
Triple Attention Transformer Architecture for Time-Dependent Concrete Creep Prediction
by: Dokduea, Warayut, et al.
Published: (2025)
by: Dokduea, Warayut, et al.
Published: (2025)
Generative Modeling of Networked Time-Series via Transformer Architectures
by: Elnady, Yusuf
Published: (2025)
by: Elnady, Yusuf
Published: (2025)
A Transformer-based Neural Architecture Search Method
by: Wang, Shang, et al.
Published: (2025)
by: Wang, Shang, et al.
Published: (2025)
TA-RNN: an Attention-based Time-aware Recurrent Neural Network Architecture for Electronic Health Records
by: Olaimat, Mohammad Al, et al.
Published: (2024)
by: Olaimat, Mohammad Al, et al.
Published: (2024)
Graph Tokenization for Bridging Graphs and Transformers
by: Guo, Zeyuan, et al.
Published: (2026)
by: Guo, Zeyuan, et al.
Published: (2026)
Gradient Flow Convergence Guarantee for General Neural Network Architectures
by: Jakhmola, Yash
Published: (2025)
by: Jakhmola, Yash
Published: (2025)
On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)
by: Peng, Binghui, et al.
Published: (2024)
TokenButler: Token Importance is Predictable
by: Akhauri, Yash, et al.
Published: (2025)
by: Akhauri, Yash, et al.
Published: (2025)
Enhanced Graph Transformer with Serialized Graph Tokens
by: Wang, Ruixiang, et al.
Published: (2026)
by: Wang, Ruixiang, et al.
Published: (2026)
Müntz-Szász Networks: Neural Architectures with Learnable Power-Law Bases
by: N'guessan, Gnankan Landry Regis
Published: (2025)
by: N'guessan, Gnankan Landry Regis
Published: (2025)
Clustering-based Multitasking Deep Neural Network for Solar Photovoltaics Power Generation Prediction
by: Song, Hui, et al.
Published: (2024)
by: Song, Hui, et al.
Published: (2024)
A Law of Next-Token Prediction in Large Language Models
by: He, Hangfeng, et al.
Published: (2024)
by: He, Hangfeng, et al.
Published: (2024)
Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
by: Yang, Zhengxin, et al.
Published: (2024)
by: Yang, Zhengxin, et al.
Published: (2024)
Enhancing Price Prediction in Cryptocurrency Using Transformer Neural Network and Technical Indicators
by: Khaniki, Mohammad Ali Labbaf, et al.
Published: (2024)
by: Khaniki, Mohammad Ali Labbaf, et al.
Published: (2024)
Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets
by: Belkhiter, Yannis, et al.
Published: (2025)
by: Belkhiter, Yannis, et al.
Published: (2025)
Efficient Neural Networks with Discrete Cosine Transform Activations
by: Martinez-Gost, Marc, et al.
Published: (2025)
by: Martinez-Gost, Marc, et al.
Published: (2025)
Artificial Intelligence-Driven Network-on-Chip Design Space Exploration: Neural Network Architectures for Design
by: N, Amogh Anshu, et al.
Published: (2025)
by: N, Amogh Anshu, et al.
Published: (2025)
Architecture Determines Observability of Transformers
by: Carmichael, Thomas
Published: (2026)
by: Carmichael, Thomas
Published: (2026)
FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer
by: Hwang, Dongyeong, et al.
Published: (2024)
by: Hwang, Dongyeong, et al.
Published: (2024)
Conformalized Link Prediction on Graph Neural Networks
by: Zhao, Tianyi, et al.
Published: (2024)
by: Zhao, Tianyi, et al.
Published: (2024)
Stock Price Prediction using Multi-Faceted Information based on Deep Recurrent Neural Networks
by: Shahbandari, Lida, et al.
Published: (2024)
by: Shahbandari, Lida, et al.
Published: (2024)
YZS-model: A Predictive Model for Organic Drug Solubility Based on Graph Convolutional Networks and Transformer-Attention
by: Wang, Chenxu, et al.
Published: (2024)
by: Wang, Chenxu, et al.
Published: (2024)
Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks
by: Bencomo, Gianluca, et al.
Published: (2025)
by: Bencomo, Gianluca, et al.
Published: (2025)
Encodings for Prediction-based Neural Architecture Search
by: Akhauri, Yash, et al.
Published: (2024)
by: Akhauri, Yash, et al.
Published: (2024)
Multi-Microphone Speech Emotion Recognition using the Hierarchical Token-semantic Audio Transformer Architecture
by: Cohen, Ohad, et al.
Published: (2024)
by: Cohen, Ohad, et al.
Published: (2024)
Permutation-Invariant Transformer Neural Architectures for Set-Based Indoor Localization Using Learned RSSI Embeddings
by: Aristorenas, Aris J.
Published: (2025)
by: Aristorenas, Aris J.
Published: (2025)
Efficient and Interpretable Neural Networks Using Complex Lehmer Transform
by: Ataei, Masoud, et al.
Published: (2025)
by: Ataei, Masoud, et al.
Published: (2025)
Self-Explaining Hypergraph Neural Networks for Diagnosis Prediction
by: Yu, Leisheng, et al.
Published: (2025)
by: Yu, Leisheng, et al.
Published: (2025)
Contextual Quantum Neural Networks for Stock Price Prediction
by: Mourya, Sharan, et al.
Published: (2025)
by: Mourya, Sharan, et al.
Published: (2025)
Similar Items
-
Echo State Transformer: Attention Over Finite Memories
by: Bendi-Ouis, Yannis, et al.
Published: (2025) -
Transformer-based Graph Neural Networks for Battery Range Prediction in AIoT Battery-Swap Services
by: Li, Zhao, et al.
Published: (2024) -
Split Federated Learning Architectures for High-Accuracy and Low-Delay Model Training
by: Papageorgiou, Yiannis, et al.
Published: (2026) -
How Transformers Learn to Plan via Multi-Token Prediction
by: Huang, Jianhao, et al.
Published: (2026) -
Rethinking Tokenized Graph Transformers for Node Classification
by: Chen, Jinsong, et al.
Published: (2025)