:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	He, Bobby, Hofmann, Thomas
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2311.01906
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Recurrent Distance Filtering for Graph Representation Learning
by: Ding, Yuhui, et al.
Published: (2023)

Understanding and Minimising Outlier Features in Neural Network Training
by: He, Bobby, et al.
Published: (2024)

Hallmarks of Optimization Trajectories in Neural Networks: Directional Exploration and Redundancy
by: Singh, Sidak Pal, et al.
Published: (2024)

Revisiting Knowledge Distillation: The Hidden Role of Dataset Size
by: Lanzillotta, Giulia, et al.
Published: (2025)

Simplified PCNet with Robustness
by: Li, Bingheng, et al.
Published: (2024)

SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations
by: Wu, Qitian, et al.
Published: (2023)

Simplifying the Theory on Over-Smoothing
by: Roth, Andreas
Published: (2024)

Simplifying Graph Kernels for Efficient
by: Wang, Lin, et al.
Published: (2025)

Block-Recurrent Dynamics in Vision Transformers
by: Jacobs, Mozes, et al.
Published: (2025)

CVTGAD: Simplified Transformer with Cross-View Attention for Unsupervised Graph-level Anomaly Detection
by: Li, Jindong, et al.
Published: (2024)

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs
by: Guo, Han, et al.
Published: (2026)

Equivalence of Context and Parameter Updates in Modern Transformer Blocks
by: Goldwaser, Adrian, et al.
Published: (2025)

Simplifying Deep Temporal Difference Learning
by: Gallici, Matteo, et al.
Published: (2024)

Adam Simplified: Bias Correction Debunked
by: Laing, Sam, et al.
Published: (2025)

RT-Transformer: The Transformer Block as a Spherical State Estimator
by: Racioppo, Peter
Published: (2026)

Subclass Classification of Gliomas Using MRI Fusion Technique
by: Janardhan, Kiranmayee, et al.
Published: (2025)

Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction
by: Balaji, Roshan, et al.
Published: (2025)

Model Sparsity Can Simplify Machine Unlearning
by: Jia, Jinghan, et al.
Published: (2023)

Simplifying Random Forests' Probabilistic Forecasts
by: Koster, Nils, et al.
Published: (2024)

Simplified and Generalized Masked Diffusion for Discrete Data
by: Shi, Jiaxin, et al.
Published: (2024)

Simplifying Adversarially Robust PAC Learning with Tolerance
by: Ashtiani, Hassan, et al.
Published: (2025)

RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified States
by: Xiao, Xiangjie, et al.
Published: (2026)

Blocked Gibbs meets Diffusion Transformers: Unsupervised Learning for Constraint Optimization
by: Xu, Yudong W., et al.
Published: (2026)

Interpretability Illusions in the Generalization of Simplified Models
by: Friedman, Dan, et al.
Published: (2023)

Simplifying Optimal Transport through Schatten-$p$ Regularization
by: Maunu, Tyler
Published: (2025)

Simplifying Graph Convolutional Networks with Redundancy-Free Neighbors
by: Lu, Jielong, et al.
Published: (2025)

Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
by: Lu, Cheng, et al.
Published: (2024)

Normalizing Flow Regression for Bayesian Inference with Offline Likelihood Evaluations
by: Li, Chengkun, et al.
Published: (2025)

Simplified Diffusion Schrödinger Bridge
by: Tang, Zhicong, et al.
Published: (2024)

SparseSwin: Swin Transformer with Sparse Transformer Block
by: Pinasthika, Krisna, et al.
Published: (2023)

Adaptive Block Sparse Regularization under Arbitrary Linear Transform
by: Furuhashi, Takanobu, et al.
Published: (2024)

Diffusion Models under Alternative Noise: Simplified Analysis and Sensitivity
by: Choi, Juhyeok, et al.
Published: (2025)

Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks
by: McKee, Kevin
Published: (2024)

Simplifying Bayesian Optimization Via In-Context Direct Optimum Sampling
by: de Carvalho, Gustavo Sutter Pessurno, et al.
Published: (2025)

SStaGCN: Simplified stacking based graph convolutional networks
by: Cai, Jia, et al.
Published: (2021)

Simplifying Latent Dynamics with Softly State-Invariant World Models
by: Saanum, Tankred, et al.
Published: (2024)

Putting It All into Context: Simplifying Agents with LCLMs
by: Jiang, Mingjian, et al.
Published: (2025)

MABViT -- Modified Attention Block Enhances Vision Transformers
by: Ramesh, Mahesh, et al.
Published: (2023)

Block Selective Reprogramming for On-device Training of Vision Transformers
by: Sarkar, Sreetama, et al.
Published: (2024)

BlockCert: Certified Blockwise Extraction of Transformer Mechanisms
by: Andric, Sandro
Published: (2025)