Saved in:
| Main Authors: | Su, Haoran, You, Chenyu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.01014 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Scaling Attention via Feature Sparsity
by: Xie, Yan, et al.
Published: (2026)
by: Xie, Yan, et al.
Published: (2026)
ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
by: You, Haoran, et al.
Published: (2023)
by: You, Haoran, et al.
Published: (2023)
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
ManifoldFormer: Geometric Deep Learning for Neural Dynamics on Riemannian Manifolds
by: Fu, Yihang, et al.
Published: (2025)
by: Fu, Yihang, et al.
Published: (2025)
LASS-ODE: Scaling ODE Computations to Connect Foundation Models with Dynamical Physical Systems
by: Li, Haoran, et al.
Published: (2026)
by: Li, Haoran, et al.
Published: (2026)
Geometric Scaling of Bayesian Inference in LLMs
by: Agarwal, Naman, et al.
Published: (2025)
by: Agarwal, Naman, et al.
Published: (2025)
Mathematical Foundations of Geometric Deep Learning
by: Borde, Haitz Sáez de Ocáriz, et al.
Published: (2025)
by: Borde, Haitz Sáez de Ocáriz, et al.
Published: (2025)
Calibration and Transformation-Free Weight-Only LLMs Quantization via Dynamic Grouping
by: Zheng, Xinzhe, et al.
Published: (2025)
by: Zheng, Xinzhe, et al.
Published: (2025)
Max-Affine Spline Insights Into Deep Network Pruning
by: You, Haoran, et al.
Published: (2021)
by: You, Haoran, et al.
Published: (2021)
Scale-Consistent State-Space Dynamics via Fractal of Stationary Transformations
by: Yu, Geunhyeok, et al.
Published: (2026)
by: Yu, Geunhyeok, et al.
Published: (2026)
Spectral Convolution on Orbifolds for Geometric Deep Learning
by: Mangliers, Tim, et al.
Published: (2026)
by: Mangliers, Tim, et al.
Published: (2026)
On the Completeness of Invariant Geometric Deep Learning Models
by: Li, Zian, et al.
Published: (2024)
by: Li, Zian, et al.
Published: (2024)
Is Distance Matrix Enough for Geometric Deep Learning?
by: Li, Zian, et al.
Published: (2023)
by: Li, Zian, et al.
Published: (2023)
Scaling Diffusion Transformers Efficiently via $μ$P
by: Zheng, Chenyu, et al.
Published: (2025)
by: Zheng, Chenyu, et al.
Published: (2025)
PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking
by: Lyu, Qing, et al.
Published: (2026)
by: Lyu, Qing, et al.
Published: (2026)
Unifying Learning Dynamics and Generalization in Transformers Scaling Law
by: Yang, Chiwun
Published: (2025)
by: Yang, Chiwun
Published: (2025)
DyCE: Dynamically Configurable Exiting for Deep Learning Compression and Real-time Scaling
by: Wang, Qingyuan, et al.
Published: (2024)
by: Wang, Qingyuan, et al.
Published: (2024)
Flag Varieties: A Geometric Framework for Deep Network Alignment
by: Xiao, Jingchuan, et al.
Published: (2026)
by: Xiao, Jingchuan, et al.
Published: (2026)
PHGNN: A Novel Prompted Hypergraph Neural Network to Diagnose Alzheimer's Disease
by: Liu, Chenyu, et al.
Published: (2025)
by: Liu, Chenyu, et al.
Published: (2025)
The Geometric Anatomy of Capability Acquisition in Transformers
by: Billa, Jayadev
Published: (2026)
by: Billa, Jayadev
Published: (2026)
Geometric Attention: A Regime-Explicit Operator Semantics for Transformer Attention
by: Freytes, Luis Rosario
Published: (2026)
by: Freytes, Luis Rosario
Published: (2026)
Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers
by: de Haan, Pim, et al.
Published: (2023)
by: de Haan, Pim, et al.
Published: (2023)
Soft Geometric Inductive Bias for Object Centric Dynamics
by: Linander, Hampus, et al.
Published: (2025)
by: Linander, Hampus, et al.
Published: (2025)
Geometric Dynamics of Agentic Loops in Large Language Models
by: Tacheny, Nicolas
Published: (2025)
by: Tacheny, Nicolas
Published: (2025)
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
by: You, Haoran, et al.
Published: (2022)
by: You, Haoran, et al.
Published: (2022)
A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning
by: Lei, Ming, et al.
Published: (2026)
by: Lei, Ming, et al.
Published: (2026)
Latent Mixture of Symmetries for Sample-Efficient Dynamic Learning
by: Li, Haoran, et al.
Published: (2025)
by: Li, Haoran, et al.
Published: (2025)
Scaling Sequential Recommendation Models with Transformers
by: Zivic, Pablo, et al.
Published: (2024)
by: Zivic, Pablo, et al.
Published: (2024)
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey
by: Gu, Yang, et al.
Published: (2024)
by: Gu, Yang, et al.
Published: (2024)
Deep Minds and Shallow Probes
by: Lee, Su Hyeong, et al.
Published: (2026)
by: Lee, Su Hyeong, et al.
Published: (2026)
EnTransformer: A Deep Generative Transformer for Multivariate Probabilistic Forecasting
by: Pathak, Rajdeep, et al.
Published: (2026)
by: Pathak, Rajdeep, et al.
Published: (2026)
Making Batch Normalization Great in Federated Deep Learning
by: Zhong, Jike, et al.
Published: (2023)
by: Zhong, Jike, et al.
Published: (2023)
Emergent Causal-Geometric Dynamics Across Depth in Large Language Models
by: Haim, Shahar, et al.
Published: (2026)
by: Haim, Shahar, et al.
Published: (2026)
Geometric Neural Operators via Lie Group-Constrained Latent Dynamics
by: Zhang, Jiaquan, et al.
Published: (2026)
by: Zhang, Jiaquan, et al.
Published: (2026)
Predicting Human Brain States with Transformer
by: Sun, Yifei, et al.
Published: (2024)
by: Sun, Yifei, et al.
Published: (2024)
Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions
by: You, Jingyang, et al.
Published: (2025)
by: You, Jingyang, et al.
Published: (2025)
No More K-means: Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval
by: Guo, Lixuan, et al.
Published: (2026)
by: Guo, Lixuan, et al.
Published: (2026)
Dynamically Scaled Activation Steering
by: Ferrando, Alex, et al.
Published: (2025)
by: Ferrando, Alex, et al.
Published: (2025)
GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts
by: Zou, Deyu, et al.
Published: (2023)
by: Zou, Deyu, et al.
Published: (2023)
Similar Items
-
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026) -
Scaling Attention via Feature Sparsity
by: Xie, Yan, et al.
Published: (2026) -
ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
by: You, Haoran, et al.
Published: (2023) -
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026) -
ManifoldFormer: Geometric Deep Learning for Neural Dynamics on Riemannian Manifolds
by: Fu, Yihang, et al.
Published: (2025)