:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Su, Haoran, You, Chenyu
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.01014
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026)

Scaling Attention via Feature Sparsity
by: Xie, Yan, et al.
Published: (2026)

ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
by: You, Haoran, et al.
Published: (2023)

Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)

ManifoldFormer: Geometric Deep Learning for Neural Dynamics on Riemannian Manifolds
by: Fu, Yihang, et al.
Published: (2025)

LASS-ODE: Scaling ODE Computations to Connect Foundation Models with Dynamical Physical Systems
by: Li, Haoran, et al.
Published: (2026)

Geometric Scaling of Bayesian Inference in LLMs
by: Agarwal, Naman, et al.
Published: (2025)

Mathematical Foundations of Geometric Deep Learning
by: Borde, Haitz Sáez de Ocáriz, et al.
Published: (2025)

Calibration and Transformation-Free Weight-Only LLMs Quantization via Dynamic Grouping
by: Zheng, Xinzhe, et al.
Published: (2025)

Max-Affine Spline Insights Into Deep Network Pruning
by: You, Haoran, et al.
Published: (2021)

Scale-Consistent State-Space Dynamics via Fractal of Stationary Transformations
by: Yu, Geunhyeok, et al.
Published: (2026)

Spectral Convolution on Orbifolds for Geometric Deep Learning
by: Mangliers, Tim, et al.
Published: (2026)

On the Completeness of Invariant Geometric Deep Learning Models
by: Li, Zian, et al.
Published: (2024)

Is Distance Matrix Enough for Geometric Deep Learning?
by: Li, Zian, et al.
Published: (2023)

Scaling Diffusion Transformers Efficiently via $μ$P
by: Zheng, Chenyu, et al.
Published: (2025)

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking
by: Lyu, Qing, et al.
Published: (2026)

Unifying Learning Dynamics and Generalization in Transformers Scaling Law
by: Yang, Chiwun
Published: (2025)

DyCE: Dynamically Configurable Exiting for Deep Learning Compression and Real-time Scaling
by: Wang, Qingyuan, et al.
Published: (2024)

Flag Varieties: A Geometric Framework for Deep Network Alignment
by: Xiao, Jingchuan, et al.
Published: (2026)

PHGNN: A Novel Prompted Hypergraph Neural Network to Diagnose Alzheimer's Disease
by: Liu, Chenyu, et al.
Published: (2025)

The Geometric Anatomy of Capability Acquisition in Transformers
by: Billa, Jayadev
Published: (2026)

Geometric Attention: A Regime-Explicit Operator Semantics for Transformer Attention
by: Freytes, Luis Rosario
Published: (2026)

Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers
by: de Haan, Pim, et al.
Published: (2023)

Soft Geometric Inductive Bias for Object Centric Dynamics
by: Linander, Hampus, et al.
Published: (2025)

Geometric Dynamics of Agentic Loops in Large Language Models
by: Tacheny, Nicolas
Published: (2025)

ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
by: You, Haoran, et al.
Published: (2022)

A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning
by: Lei, Ming, et al.
Published: (2026)

Latent Mixture of Symmetries for Sample-Efficient Dynamic Learning
by: Li, Haoran, et al.
Published: (2025)

Scaling Sequential Recommendation Models with Transformers
by: Zivic, Pablo, et al.
Published: (2024)

Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey
by: Gu, Yang, et al.
Published: (2024)

Deep Minds and Shallow Probes
by: Lee, Su Hyeong, et al.
Published: (2026)

EnTransformer: A Deep Generative Transformer for Multivariate Probabilistic Forecasting
by: Pathak, Rajdeep, et al.
Published: (2026)

Making Batch Normalization Great in Federated Deep Learning
by: Zhong, Jike, et al.
Published: (2023)

Emergent Causal-Geometric Dynamics Across Depth in Large Language Models
by: Haim, Shahar, et al.
Published: (2026)

Geometric Neural Operators via Lie Group-Constrained Latent Dynamics
by: Zhang, Jiaquan, et al.
Published: (2026)

Predicting Human Brain States with Transformer
by: Sun, Yifei, et al.
Published: (2024)

Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions
by: You, Jingyang, et al.
Published: (2025)

No More K-means: Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval
by: Guo, Lixuan, et al.
Published: (2026)

Dynamically Scaled Activation Steering
by: Ferrando, Alex, et al.
Published: (2025)

GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts
by: Zou, Deyu, et al.
Published: (2023)