:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	He, Yannis Y.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.02007
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Echo State Transformer: Attention Over Finite Memories
by: Bendi-Ouis, Yannis, et al.
Published: (2025)

Transformer-based Graph Neural Networks for Battery Range Prediction in AIoT Battery-Swap Services
by: Li, Zhao, et al.
Published: (2024)

Split Federated Learning Architectures for High-Accuracy and Low-Delay Model Training
by: Papageorgiou, Yiannis, et al.
Published: (2026)

How Transformers Learn to Plan via Multi-Token Prediction
by: Huang, Jianhao, et al.
Published: (2026)

Rethinking Tokenized Graph Transformers for Node Classification
by: Chen, Jinsong, et al.
Published: (2025)

Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link Prediction
by: Zou, Tao, et al.
Published: (2025)

ONNX-Net: Towards Universal Representations and Instant Performance Prediction for Neural Architectures
by: Qin, Shiwen, et al.
Published: (2025)

Efficient Real-Time Aircraft ETA Prediction via Feature Tokenization Transformer
by: Huang, Liping, et al.
Published: (2025)

Transformers are Graph Neural Networks
by: Joshi, Chaitanya K.
Published: (2025)

Flexible Parallel Neural Network Architecture Model for Early Prediction of Lithium Battery Life
by: Jiang, Lidang, et al.
Published: (2024)

A Novel Deep Neural Network Architecture for Real-Time Water Demand Forecasting
by: Salloom, Tony, et al.
Published: (2025)

Triple Attention Transformer Architecture for Time-Dependent Concrete Creep Prediction
by: Dokduea, Warayut, et al.
Published: (2025)

Generative Modeling of Networked Time-Series via Transformer Architectures
by: Elnady, Yusuf
Published: (2025)

A Transformer-based Neural Architecture Search Method
by: Wang, Shang, et al.
Published: (2025)

TA-RNN: an Attention-based Time-aware Recurrent Neural Network Architecture for Electronic Health Records
by: Olaimat, Mohammad Al, et al.
Published: (2024)

Graph Tokenization for Bridging Graphs and Transformers
by: Guo, Zeyuan, et al.
Published: (2026)

Gradient Flow Convergence Guarantee for General Neural Network Architectures
by: Jakhmola, Yash
Published: (2025)

On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)

TokenButler: Token Importance is Predictable
by: Akhauri, Yash, et al.
Published: (2025)

Enhanced Graph Transformer with Serialized Graph Tokens
by: Wang, Ruixiang, et al.
Published: (2026)

Müntz-Szász Networks: Neural Architectures with Learnable Power-Law Bases
by: N'guessan, Gnankan Landry Regis
Published: (2025)

Clustering-based Multitasking Deep Neural Network for Solar Photovoltaics Power Generation Prediction
by: Song, Hui, et al.
Published: (2024)

A Law of Next-Token Prediction in Large Language Models
by: He, Hangfeng, et al.
Published: (2024)

Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
by: Yang, Zhengxin, et al.
Published: (2024)

Enhancing Price Prediction in Cryptocurrency Using Transformer Neural Network and Technical Indicators
by: Khaniki, Mohammad Ali Labbaf, et al.
Published: (2024)

Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets
by: Belkhiter, Yannis, et al.
Published: (2025)

Efficient Neural Networks with Discrete Cosine Transform Activations
by: Martinez-Gost, Marc, et al.
Published: (2025)

Artificial Intelligence-Driven Network-on-Chip Design Space Exploration: Neural Network Architectures for Design
by: N, Amogh Anshu, et al.
Published: (2025)

Architecture Determines Observability of Transformers
by: Carmichael, Thomas
Published: (2026)

FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer
by: Hwang, Dongyeong, et al.
Published: (2024)

Conformalized Link Prediction on Graph Neural Networks
by: Zhao, Tianyi, et al.
Published: (2024)

Stock Price Prediction using Multi-Faceted Information based on Deep Recurrent Neural Networks
by: Shahbandari, Lida, et al.
Published: (2024)

YZS-model: A Predictive Model for Organic Drug Solubility Based on Graph Convolutional Networks and Transformer-Attention
by: Wang, Chenxu, et al.
Published: (2024)

Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks
by: Bencomo, Gianluca, et al.
Published: (2025)

Encodings for Prediction-based Neural Architecture Search
by: Akhauri, Yash, et al.
Published: (2024)

Multi-Microphone Speech Emotion Recognition using the Hierarchical Token-semantic Audio Transformer Architecture
by: Cohen, Ohad, et al.
Published: (2024)

Permutation-Invariant Transformer Neural Architectures for Set-Based Indoor Localization Using Learned RSSI Embeddings
by: Aristorenas, Aris J.
Published: (2025)

Efficient and Interpretable Neural Networks Using Complex Lehmer Transform
by: Ataei, Masoud, et al.
Published: (2025)

Self-Explaining Hypergraph Neural Networks for Diagnosis Prediction
by: Yu, Leisheng, et al.
Published: (2025)

Contextual Quantum Neural Networks for Stock Price Prediction
by: Mourya, Sharan, et al.
Published: (2025)