:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Yujiao, Lian, Jing, Li, Linhui
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Computation and Language 68T07 I.5.1; I.2.0
Online Access:	https://arxiv.org/abs/2503.02495
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Deceptive Diffusion: Generating Synthetic Adversarial Examples
by: Beerens, Lucas, et al.
Published: (2024)

The First MPDD Challenge: Multimodal Personality-aware Depression Detection
by: Fu, Changzeng, et al.
Published: (2025)

DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting
by: Mysore, Naveen
Published: (2026)

Closing the Theory-Practice Gap in Spiking Transformers via Effective Dimension
by: Guo, Dongxin, et al.
Published: (2026)

JacNet: Learning Functions with Structured Jacobians
by: Lorraine, Jonathan, et al.
Published: (2024)

PolyGLU: State-Conditional Activation Routing in Transformer Feed-Forward Networks
by: Medeiros, Daniel Nobrega
Published: (2026)

Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
by: Nauen, Tobias Christian, et al.
Published: (2023)

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
by: Luo, An, et al.
Published: (2025)

A Hybrid Inductive-Transductive Network for Traffic Flow Imputation on Unsampled Locations
by: Rahimiasl, Mohammadmahdi, et al.
Published: (2025)

A Boltzmann-machine-enhanced Transformer For DNA Sequence Classification
by: Cao, Zhixuan, et al.
Published: (2026)

Can Agentic AI Match the Performance of Human Data Scientists?
by: Luo, An, et al.
Published: (2025)

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science
by: Luo, An, et al.
Published: (2026)

CVCM Track Circuits Pre-emptive Failure Diagnostics for Predictive Maintenance Using Deep Neural Networks
by: Mukherjee, Debdeep, et al.
Published: (2025)

multivariateGPT: a decoder-only transformer for multivariate categorical and numeric data
by: Loza, Andrew J., et al.
Published: (2025)

Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes
by: Alnemari, Mohammed, et al.
Published: (2026)

Neural Conditional Transport Maps
by: Rodriguez-Pardo, Carlos, et al.
Published: (2025)

GRALIS: A Unified Canonical Framework for Linear Attribution Methods via Riesz Representation
by: Fanale, Raimondo
Published: (2026)

CellARC: Measuring Intelligence with Cellular Automata
by: Lžičař, Miroslav
Published: (2025)

GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping
by: Wen, Qifu, et al.
Published: (2025)

KAN vs LSTM Performance in Time Series Forecasting
by: Rather, Tabish Ali, et al.
Published: (2025)

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
by: Dai, Yanning, et al.
Published: (2026)

InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer
by: Zhang, Tony, et al.
Published: (2025)

TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
by: Nauen, Tobias Christian, et al.
Published: (2024)

Scalable, Technology-Agnostic Diagnosis and Predictive Maintenance for Point Machine using Deep Learning
by: Di Santi, Eduardo, et al.
Published: (2025)

Transfer learning with generative models for object detection on limited datasets
by: Paiano, Matteo, et al.
Published: (2024)

mlx-snn: Spiking Neural Networks on Apple Silicon via MLX
by: Qin, Jiahao
Published: (2026)

Explicit Dropout: Deterministic Regularization for Transformer Architectures
by: Agrawal, Vidhi, et al.
Published: (2026)

Complex-Valued Phase-Coherent Transformer
by: Hioki, Leona
Published: (2026)

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement
by: Cao, Bryan Bo, et al.
Published: (2024)

H-Model: Dynamic Neural Architectures for Adaptive Processing
by: Hospodarchuk, Dmytro
Published: (2025)

Revisiting GAN with Bayes-Optimal Discrimination
by: Naeini, Mohammadreza Tavasoli, et al.
Published: (2025)

Revisiting Non-separable Binary Classification and its Applications in Anomaly Detection
by: Lau, Matthew, et al.
Published: (2023)

Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark
by: Ho, Siu Hang, et al.
Published: (2025)

PH-VAE: A Polynomial Hierarchical Variational Autoencoder Towards Disentangled Representation Learning
by: Chen, Xi, et al.
Published: (2025)

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference
by: Du, Jin, et al.
Published: (2025)

SigGate-GT: Taming Over-Smoothing in Graph Transformers via Sigmoid-Gated Attention
by: Guo, Dongxin, et al.
Published: (2026)

Semantic Retention and Extreme Compression in LLMs: Can We Have Both?
by: Laborde, Stanislas, et al.
Published: (2025)

Rethinking Visual Intelligence: Insights from Video Pretraining
by: Acuaviva, Pablo, et al.
Published: (2025)

Pulse-Driven Neural Architecture: Learnable Oscillatory Dynamics for Robust Continuous-Time Sequence Processing
by: Sharma, Paras
Published: (2026)

GraphNNK -- Graph Classification and Interpretability
by: Bolevic, Zeljko, et al.
Published: (2026)