Saved in:
| Main Authors: | Zhu, Meng, Xiao, Quan, Min, Weidong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.13465 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FlowAdam: Implicit Regularization via Geometry-Aware Soft Momentum Injection
by: Singh, Devender, et al.
Published: (2026)
by: Singh, Devender, et al.
Published: (2026)
Kourkoutas-Beta: A Sunspike-Driven Adam Optimizer with Desert Flair
by: Kassinos, Stavros C.
Published: (2025)
by: Kassinos, Stavros C.
Published: (2025)
Hidden Failure Modes of Gradient Modification under Adam in Continual Learning, and Adaptive Decoupled Moment Routing as a Repair
by: Hu, Yuelin, et al.
Published: (2026)
by: Hu, Yuelin, et al.
Published: (2026)
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)
by: Khorasani, Sadegh, et al.
Published: (2025)
ZetA: A Riemann Zeta-Scaled Extension of Adam for Deep Learning
by: BC, Samiksha
Published: (2025)
by: BC, Samiksha
Published: (2025)
Adam Improves Muon: Adaptive Moment Estimation with Orthogonalized Momentum
by: Zhang, Minxin, et al.
Published: (2026)
by: Zhang, Minxin, et al.
Published: (2026)
Predicting and improving test-time scaling laws via reward tail-guided search
by: Li, Muheng, et al.
Published: (2026)
by: Li, Muheng, et al.
Published: (2026)
TIFeD: a Tiny Integer-based Federated learning algorithm with Direct feedback alignment
by: Colombo, Luca, et al.
Published: (2024)
by: Colombo, Luca, et al.
Published: (2024)
NoRIN: Backbone-Adaptive Reversible Normalization for Time-Series Forecasting
by: Zhang, Shun, et al.
Published: (2026)
by: Zhang, Shun, et al.
Published: (2026)
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
Adaptive Epsilon Adversarial Training for Robust Gravitational Wave Parameter Estimation Using Normalizing Flows
by: Yang, Yiqian, et al.
Published: (2024)
by: Yang, Yiqian, et al.
Published: (2024)
Versatile Ordering Network: An Attention-based Neural Network for Ordering Across Scales and Quality Metrics
by: Yu, Zehua, et al.
Published: (2024)
by: Yu, Zehua, et al.
Published: (2024)
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
by: Huang, Yunpeng, et al.
Published: (2024)
by: Huang, Yunpeng, et al.
Published: (2024)
rmlnomogram: An R package to construct an explainable nomogram for any machine learning algorithms
by: Sufriyana, Herdiantri, et al.
Published: (2025)
by: Sufriyana, Herdiantri, et al.
Published: (2025)
FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models
by: Polly, Fabien
Published: (2026)
by: Polly, Fabien
Published: (2026)
I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation
by: Thil, Lucas, et al.
Published: (2025)
by: Thil, Lucas, et al.
Published: (2025)
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)
by: Yousaf, Iqra
Published: (2024)
Mamba base PKD for efficient knowledge compression
by: Medina, José, et al.
Published: (2025)
by: Medina, José, et al.
Published: (2025)
Data structure > labels? Unsupervised heuristics for SVM hyperparameter estimation
by: Cholewa, Michał, et al.
Published: (2021)
by: Cholewa, Michał, et al.
Published: (2021)
Minimum Description Length based Granular-Ball Tree Regularization for Spectral Clustering
by: Xian, Zeqiang, et al.
Published: (2026)
by: Xian, Zeqiang, et al.
Published: (2026)
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
by: Mi, Zhendong, et al.
Published: (2025)
by: Mi, Zhendong, et al.
Published: (2025)
SATORIS-N: Spectral Analysis based Traffic Observation Recovery via Informed Subspaces and Nuclear-norm minimization
by: Mohanty, Sampad, et al.
Published: (2026)
by: Mohanty, Sampad, et al.
Published: (2026)
On the logical skills of large language models: evaluations using arbitrarily complex first-order logic problems
by: Ibragimov, Shokhrukh, et al.
Published: (2025)
by: Ibragimov, Shokhrukh, et al.
Published: (2025)
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
by: Mongaras, Gabriel, et al.
Published: (2026)
by: Mongaras, Gabriel, et al.
Published: (2026)
Tactile-Proprioceptive Sensor Fusion for Contact Wrench Estimation in Whole-Body Physical Human-Robot Interaction
by: Min, Junha, et al.
Published: (2026)
by: Min, Junha, et al.
Published: (2026)
Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits
by: Rongier, Guillaume, et al.
Published: (2025)
by: Rongier, Guillaume, et al.
Published: (2025)
Pre-trained Models Perform the Best When Token Distributions Follow Zipf's Law
by: He, Yanjin, et al.
Published: (2025)
by: He, Yanjin, et al.
Published: (2025)
FSC-Net: Fast-Slow Consolidation Networks for Continual Learning
by: Gorrim, Mohamed El
Published: (2025)
by: Gorrim, Mohamed El
Published: (2025)
Action-Dependent Optimality-Preserving Reward Shaping
by: Forbes, Grant C., et al.
Published: (2025)
by: Forbes, Grant C., et al.
Published: (2025)
TFMAdapter: Lightweight Instance-Level Adaptation of Foundation Models for Forecasting with Covariates
by: Dange, Afrin, et al.
Published: (2025)
by: Dange, Afrin, et al.
Published: (2025)
Downsized and Compromised?: Assessing the Faithfulness of Model Compression
by: Kamal, Moumita, et al.
Published: (2025)
by: Kamal, Moumita, et al.
Published: (2025)
How Many Ratings per Item are Necessary for Reliable Significance Testing?
by: Homan, Christopher, et al.
Published: (2024)
by: Homan, Christopher, et al.
Published: (2024)
Learning Stochastic Nonlinear Dynamics with Embedded Latent Transfer Operators
by: Ke, Naichang, et al.
Published: (2025)
by: Ke, Naichang, et al.
Published: (2025)
Deep Variational Inference Symbolic Regression
by: Butterworth, James, et al.
Published: (2026)
by: Butterworth, James, et al.
Published: (2026)
Potential-Based Reward Shaping For Intrinsic Motivation
by: Forbes, Grant C., et al.
Published: (2024)
by: Forbes, Grant C., et al.
Published: (2024)
Intervening to Learn and Compose Causally Disentangled Representations
by: Markham, Alex, et al.
Published: (2025)
by: Markham, Alex, et al.
Published: (2025)
How to Boost Any Loss Function
by: Nock, Richard, et al.
Published: (2024)
by: Nock, Richard, et al.
Published: (2024)
Learning Transferable Predictability Representations
by: Goswami, Diyali, et al.
Published: (2026)
by: Goswami, Diyali, et al.
Published: (2026)
A Survey of Reinforcement Learning from Human Feedback
by: Kaufmann, Timo, et al.
Published: (2023)
by: Kaufmann, Timo, et al.
Published: (2023)
Discrete Latent Structure in Neural Networks
by: Niculae, Vlad, et al.
Published: (2023)
by: Niculae, Vlad, et al.
Published: (2023)
Similar Items
-
FlowAdam: Implicit Regularization via Geometry-Aware Soft Momentum Injection
by: Singh, Devender, et al.
Published: (2026) -
Kourkoutas-Beta: A Sunspike-Driven Adam Optimizer with Desert Flair
by: Kassinos, Stavros C.
Published: (2025) -
Hidden Failure Modes of Gradient Modification under Adam in Continual Learning, and Adaptive Decoupled Moment Routing as a Repair
by: Hu, Yuelin, et al.
Published: (2026) -
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025) -
ZetA: A Riemann Zeta-Scaled Extension of Adam for Deep Learning
by: BC, Samiksha
Published: (2025)