Saved in:
| Main Authors: | Yang, Yujiao, Lian, Jing, Li, Linhui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.02495 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deceptive Diffusion: Generating Synthetic Adversarial Examples
by: Beerens, Lucas, et al.
Published: (2024)
by: Beerens, Lucas, et al.
Published: (2024)
The First MPDD Challenge: Multimodal Personality-aware Depression Detection
by: Fu, Changzeng, et al.
Published: (2025)
by: Fu, Changzeng, et al.
Published: (2025)
DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting
by: Mysore, Naveen
Published: (2026)
by: Mysore, Naveen
Published: (2026)
Closing the Theory-Practice Gap in Spiking Transformers via Effective Dimension
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
JacNet: Learning Functions with Structured Jacobians
by: Lorraine, Jonathan, et al.
Published: (2024)
by: Lorraine, Jonathan, et al.
Published: (2024)
PolyGLU: State-Conditional Activation Routing in Transformer Feed-Forward Networks
by: Medeiros, Daniel Nobrega
Published: (2026)
by: Medeiros, Daniel Nobrega
Published: (2026)
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
by: Nauen, Tobias Christian, et al.
Published: (2023)
by: Nauen, Tobias Christian, et al.
Published: (2023)
AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
by: Luo, An, et al.
Published: (2025)
by: Luo, An, et al.
Published: (2025)
A Hybrid Inductive-Transductive Network for Traffic Flow Imputation on Unsampled Locations
by: Rahimiasl, Mohammadmahdi, et al.
Published: (2025)
by: Rahimiasl, Mohammadmahdi, et al.
Published: (2025)
A Boltzmann-machine-enhanced Transformer For DNA Sequence Classification
by: Cao, Zhixuan, et al.
Published: (2026)
by: Cao, Zhixuan, et al.
Published: (2026)
Can Agentic AI Match the Performance of Human Data Scientists?
by: Luo, An, et al.
Published: (2025)
by: Luo, An, et al.
Published: (2025)
AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science
by: Luo, An, et al.
Published: (2026)
by: Luo, An, et al.
Published: (2026)
CVCM Track Circuits Pre-emptive Failure Diagnostics for Predictive Maintenance Using Deep Neural Networks
by: Mukherjee, Debdeep, et al.
Published: (2025)
by: Mukherjee, Debdeep, et al.
Published: (2025)
multivariateGPT: a decoder-only transformer for multivariate categorical and numeric data
by: Loza, Andrew J., et al.
Published: (2025)
by: Loza, Andrew J., et al.
Published: (2025)
Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes
by: Alnemari, Mohammed, et al.
Published: (2026)
by: Alnemari, Mohammed, et al.
Published: (2026)
Neural Conditional Transport Maps
by: Rodriguez-Pardo, Carlos, et al.
Published: (2025)
by: Rodriguez-Pardo, Carlos, et al.
Published: (2025)
GRALIS: A Unified Canonical Framework for Linear Attribution Methods via Riesz Representation
by: Fanale, Raimondo
Published: (2026)
by: Fanale, Raimondo
Published: (2026)
CellARC: Measuring Intelligence with Cellular Automata
by: Lžičař, Miroslav
Published: (2025)
by: Lžičař, Miroslav
Published: (2025)
GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping
by: Wen, Qifu, et al.
Published: (2025)
by: Wen, Qifu, et al.
Published: (2025)
KAN vs LSTM Performance in Time Series Forecasting
by: Rather, Tabish Ali, et al.
Published: (2025)
by: Rather, Tabish Ali, et al.
Published: (2025)
Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
by: Dai, Yanning, et al.
Published: (2026)
by: Dai, Yanning, et al.
Published: (2026)
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer
by: Zhang, Tony, et al.
Published: (2025)
by: Zhang, Tony, et al.
Published: (2025)
TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
by: Nauen, Tobias Christian, et al.
Published: (2024)
by: Nauen, Tobias Christian, et al.
Published: (2024)
Scalable, Technology-Agnostic Diagnosis and Predictive Maintenance for Point Machine using Deep Learning
by: Di Santi, Eduardo, et al.
Published: (2025)
by: Di Santi, Eduardo, et al.
Published: (2025)
Transfer learning with generative models for object detection on limited datasets
by: Paiano, Matteo, et al.
Published: (2024)
by: Paiano, Matteo, et al.
Published: (2024)
mlx-snn: Spiking Neural Networks on Apple Silicon via MLX
by: Qin, Jiahao
Published: (2026)
by: Qin, Jiahao
Published: (2026)
Explicit Dropout: Deterministic Regularization for Transformer Architectures
by: Agrawal, Vidhi, et al.
Published: (2026)
by: Agrawal, Vidhi, et al.
Published: (2026)
Complex-Valued Phase-Coherent Transformer
by: Hioki, Leona
Published: (2026)
by: Hioki, Leona
Published: (2026)
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement
by: Cao, Bryan Bo, et al.
Published: (2024)
by: Cao, Bryan Bo, et al.
Published: (2024)
H-Model: Dynamic Neural Architectures for Adaptive Processing
by: Hospodarchuk, Dmytro
Published: (2025)
by: Hospodarchuk, Dmytro
Published: (2025)
Revisiting GAN with Bayes-Optimal Discrimination
by: Naeini, Mohammadreza Tavasoli, et al.
Published: (2025)
by: Naeini, Mohammadreza Tavasoli, et al.
Published: (2025)
Revisiting Non-separable Binary Classification and its Applications in Anomaly Detection
by: Lau, Matthew, et al.
Published: (2023)
by: Lau, Matthew, et al.
Published: (2023)
Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark
by: Ho, Siu Hang, et al.
Published: (2025)
by: Ho, Siu Hang, et al.
Published: (2025)
PH-VAE: A Polynomial Hierarchical Variational Autoencoder Towards Disentangled Representation Learning
by: Chen, Xi, et al.
Published: (2025)
by: Chen, Xi, et al.
Published: (2025)
Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference
by: Du, Jin, et al.
Published: (2025)
by: Du, Jin, et al.
Published: (2025)
SigGate-GT: Taming Over-Smoothing in Graph Transformers via Sigmoid-Gated Attention
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Semantic Retention and Extreme Compression in LLMs: Can We Have Both?
by: Laborde, Stanislas, et al.
Published: (2025)
by: Laborde, Stanislas, et al.
Published: (2025)
Rethinking Visual Intelligence: Insights from Video Pretraining
by: Acuaviva, Pablo, et al.
Published: (2025)
by: Acuaviva, Pablo, et al.
Published: (2025)
Pulse-Driven Neural Architecture: Learnable Oscillatory Dynamics for Robust Continuous-Time Sequence Processing
by: Sharma, Paras
Published: (2026)
by: Sharma, Paras
Published: (2026)
GraphNNK -- Graph Classification and Interpretability
by: Bolevic, Zeljko, et al.
Published: (2026)
by: Bolevic, Zeljko, et al.
Published: (2026)
Similar Items
-
Deceptive Diffusion: Generating Synthetic Adversarial Examples
by: Beerens, Lucas, et al.
Published: (2024) -
The First MPDD Challenge: Multimodal Personality-aware Depression Detection
by: Fu, Changzeng, et al.
Published: (2025) -
DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting
by: Mysore, Naveen
Published: (2026) -
Closing the Theory-Practice Gap in Spiking Transformers via Effective Dimension
by: Guo, Dongxin, et al.
Published: (2026) -
JacNet: Learning Functions with Structured Jacobians
by: Lorraine, Jonathan, et al.
Published: (2024)