Saved in:
| Main Authors: | Carvalho, Breno W., Garcez, Artur S. d'Avila, Lamb, Luís C., Brazil, Emílio Vital |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.01774 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FocusLearn: Fully-Interpretable, High-Performance Modular Neural Networks for Time Series
by: Su, Qiqi, et al.
Published: (2023)
by: Su, Qiqi, et al.
Published: (2023)
A Semantic Framework for Neuro-Symbolic Computing
by: Odense, Simon, et al.
Published: (2022)
by: Odense, Simon, et al.
Published: (2022)
Towards a Foundation Model for Partial Differential Equations Across Physics Domains
by: Soares, Eduardo, et al.
Published: (2025)
by: Soares, Eduardo, et al.
Published: (2025)
Neurosymbolic Deep Learning Semantics
by: Garcez, Artur d'Avila, et al.
Published: (2025)
by: Garcez, Artur d'Avila, et al.
Published: (2025)
Reasoning in Neurosymbolic AI
by: Tran, Son, et al.
Published: (2025)
by: Tran, Son, et al.
Published: (2025)
Explorations of the Softmax Space: Knowing When the Neural Network Doesn't Know
by: Sikar, Daniel, et al.
Published: (2025)
by: Sikar, Daniel, et al.
Published: (2025)
A Large Encoder-Decoder Family of Foundation Models For Chemical Language
by: Soares, Eduardo, et al.
Published: (2024)
by: Soares, Eduardo, et al.
Published: (2024)
Contrastive vision-language learning with paraphrasing and negation
by: Ngan, Kwun Ho, et al.
Published: (2025)
by: Ngan, Kwun Ho, et al.
Published: (2025)
From Neural Networks to Logical Theories: The Correspondence between Fibring Modal Logics and Fibring Neural Networks
by: Harzli, Ouns El, et al.
Published: (2025)
by: Harzli, Ouns El, et al.
Published: (2025)
Understanding Structural Representation in Foundation Models for Polymers
by: Park, Nathaniel H., et al.
Published: (2025)
by: Park, Nathaniel H., et al.
Published: (2025)
Large Language Models as Oracles for Ontology Alignment
by: Lushnei, Sviatoslav, et al.
Published: (2025)
by: Lushnei, Sviatoslav, et al.
Published: (2025)
Topological Signatures of Grokking
by: Tang, Yifan, et al.
Published: (2026)
by: Tang, Yifan, et al.
Published: (2026)
Understanding Grokking Through A Robustness Viewpoint
by: Tan, Zhiquan, et al.
Published: (2023)
by: Tan, Zhiquan, et al.
Published: (2023)
Grokking in Linear Models for Logistic Regression
by: Das, Nataraj, et al.
Published: (2026)
by: Das, Nataraj, et al.
Published: (2026)
Controlling Grokking with Nonlinearity and Data Symmetry
by: Salah, Ahmed, et al.
Published: (2024)
by: Salah, Ahmed, et al.
Published: (2024)
Grokfast: Accelerated Grokking by Amplifying Slow Gradients
by: Lee, Jaerin, et al.
Published: (2024)
by: Lee, Jaerin, et al.
Published: (2024)
Progress Measures for Grokking on Real-world Tasks
by: Golechha, Satvik
Published: (2024)
by: Golechha, Satvik
Published: (2024)
Muon Optimizer Accelerates Grokking
by: Tveit, Amund, et al.
Published: (2025)
by: Tveit, Amund, et al.
Published: (2025)
Grokking Finite-Dimensional Algebra
by: Notsawo, Pascal Jr Tikeng, et al.
Published: (2026)
by: Notsawo, Pascal Jr Tikeng, et al.
Published: (2026)
Grokking Group Multiplication with Cosets
by: Stander, Dashiell, et al.
Published: (2023)
by: Stander, Dashiell, et al.
Published: (2023)
Tracing the Path to Grokking: Embeddings, Dropout, and Network Activation
by: Salah, Ahmed, et al.
Published: (2025)
by: Salah, Ahmed, et al.
Published: (2025)
The Geometry of Grokking: Norm Minimization on the Zero-Loss Manifold
by: Musat, Tiberiu
Published: (2025)
by: Musat, Tiberiu
Published: (2025)
NeuralGrok: Accelerate Grokking by Neural Gradient Transformation
by: Zhou, Xinyu, et al.
Published: (2025)
by: Zhou, Xinyu, et al.
Published: (2025)
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Grokking and Generalization Collapse: Insights from \texttt{HTSR} theory
by: Prakash, Hari K., et al.
Published: (2025)
by: Prakash, Hari K., et al.
Published: (2025)
Early-Warning Signals of Grokking via Loss-Landscape Geometry
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Grokking as Structural Inference: Transformers Need Bayesian Lottery Tickets
by: Hidajat, Kai, et al.
Published: (2026)
by: Hidajat, Kai, et al.
Published: (2026)
Two Speeds of Learning: A Representation-Readout Decomposition of Grokking and Double Descent
by: Chou, Chi-Ning, et al.
Published: (2026)
by: Chou, Chi-Ning, et al.
Published: (2026)
Grokking as a Falsifiable Finite-Size Transition
by: Bi, Yuda, et al.
Published: (2026)
by: Bi, Yuda, et al.
Published: (2026)
Provable Scaling Laws of Feature Emergence from Learning Dynamics of Grokking
by: Tian, Yuandong
Published: (2025)
by: Tian, Yuandong
Published: (2025)
Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation
by: Park, Yeachan, et al.
Published: (2024)
by: Park, Yeachan, et al.
Published: (2024)
Grokking at the Edge of Numerical Stability
by: Prieto, Lucas, et al.
Published: (2025)
by: Prieto, Lucas, et al.
Published: (2025)
The Norm-Separation Delay Law of Grokking: A First-Principles Theory of Delayed Generalization
by: Khanh, Truong Xuan, et al.
Published: (2026)
by: Khanh, Truong Xuan, et al.
Published: (2026)
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
by: Lyu, Kaifeng, et al.
Published: (2023)
by: Lyu, Kaifeng, et al.
Published: (2023)
Towards Empirical Interpretation of Internal Circuits and Properties in Grokked Transformers on Modular Polynomials
by: Furuta, Hiroki, et al.
Published: (2024)
by: Furuta, Hiroki, et al.
Published: (2024)
Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Grokking Beyond the Euclidean Norm of Model Parameters
by: Notsawo, Pascal Jr Tikeng, et al.
Published: (2025)
by: Notsawo, Pascal Jr Tikeng, et al.
Published: (2025)
The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Grokking as a Variance-Limited Phase Transition: Spectral Gating and the Epsilon-Stability Threshold
by: Acharya, Pratyush, et al.
Published: (2026)
by: Acharya, Pratyush, et al.
Published: (2026)
Explaining Datasets in Words: Statistical Models with Natural Language Parameters
by: Zhong, Ruiqi, et al.
Published: (2024)
by: Zhong, Ruiqi, et al.
Published: (2024)
Similar Items
-
FocusLearn: Fully-Interpretable, High-Performance Modular Neural Networks for Time Series
by: Su, Qiqi, et al.
Published: (2023) -
A Semantic Framework for Neuro-Symbolic Computing
by: Odense, Simon, et al.
Published: (2022) -
Towards a Foundation Model for Partial Differential Equations Across Physics Domains
by: Soares, Eduardo, et al.
Published: (2025) -
Neurosymbolic Deep Learning Semantics
by: Garcez, Artur d'Avila, et al.
Published: (2025) -
Reasoning in Neurosymbolic AI
by: Tran, Son, et al.
Published: (2025)