:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cooper, Patrick, Velasquez, Alvaro
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence 68T05 I.2.6; G.3; I.2.8
Online Access:	https://arxiv.org/abs/2602.02451
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift
by: Rahman, Naimur, et al.
Published: (2026)

An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning
by: Saini, Saurabh, et al.
Published: (2026)

On Divergence Measures for Training GFlowNets
by: da Silva, Tiago, et al.
Published: (2024)

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory
by: Zhang, Zhi, et al.
Published: (2024)

XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)

Connectivity-Aware Representations for Constrained Motion Planning via Multi-Scale Contrastive Learning
by: Jeon, Suhyun, et al.
Published: (2026)

Evolving machine learning workflows through interactive AutoML
by: Barbudo, Rafael, et al.
Published: (2024)

Prediction-Based Markov Violation Scores for Detecting Non-Markovian Observations in Reinforcement Learning
by: Mysore, Naveen
Published: (2026)

Normalisation and Initialisation Strategies for Graph Neural Networks in Blockchain Anomaly Detection
by: Duy, Dang Sy, et al.
Published: (2026)

AI Agents: Evolution, Architecture, and Real-World Applications
by: Krishnan, Naveen
Published: (2025)

Adaptive Minds: Empowering Agents with LoRA-as-Tools
by: Shekar, Pavan C, et al.
Published: (2025)

Proving Olympiad Algebraic Inequalities without Human Demonstrations
by: Wei, Chenrui, et al.
Published: (2024)

COBRA-PPM: A Causal Bayesian Reasoning Architecture Using Probabilistic Programming for Robot Manipulation Under Uncertainty
by: Cannizzaro, Ricardo, et al.
Published: (2024)

Reinforcement Learning for Portfolio Optimization with a Financial Goal and Defined Time Horizons
by: Leukam, Fermat, et al.
Published: (2025)

Risk-Sensitive Option Market Making with Arbitrage-Free eSSVI Surfaces: A Constrained RL and Stochastic Control Bridge
by: Zhang, Jian'an
Published: (2025)

SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library
by: Mishra, Satyam, et al.
Published: (2025)

The two clocks and the innovation window: When and how generative models learn rules
by: Wang, Binxu, et al.
Published: (2026)

Dual-Channel Feature Fusion for Joint Prediction in Dynamic Signed Weighted Networks
by: Zhang, Gaoxin, et al.
Published: (2026)

Emotion-Inspired Learning Signals (EILS): A Homeostatic Framework for Adaptive Autonomous Agents
by: Tiwari, Dhruv
Published: (2025)

Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks
by: Ye, Ye
Published: (2025)

A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
by: Caunhye, Ali Murtaza, et al.
Published: (2025)

Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
by: Pather, Kaviraj, et al.
Published: (2025)

Improving Hyperparameter Optimization with Checkpointed Model Weights
by: Mehta, Nikhil, et al.
Published: (2024)

Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds
by: Sasso, Remo, et al.
Published: (2025)

Exploration with Foundation Models: Capabilities, Limitations, and Hybrid Approaches
by: Sasso, Remo, et al.
Published: (2025)

GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies
by: Zhang, He, et al.
Published: (2026)

AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)

Tail-Safe Hedging: Explainable Risk-Sensitive Reinforcement Learning with a White-Box CBF--QP Safety Layer in Arbitrage-Free Markets
by: Zhang, Jian'an
Published: (2025)

Scalable Nested Optimization for Deep Learning
by: Lorraine, Jonathan
Published: (2024)

Law-Strength Frontiers and a No-Free-Lunch Result for Law-Seeking Reinforcement Learning on Volatility Law Manifolds
by: Zhang, Jian'an
Published: (2025)

Actor-Critic Model Predictive Control: Differentiable Optimization meets Reinforcement Learning for Agile Flight
by: Romero, Angel, et al.
Published: (2023)

RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward
by: Fang, Jiawei, et al.
Published: (2025)

Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay
by: Du, Wenzhang
Published: (2025)

From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance Adaptation
by: Fang, Junyi, et al.
Published: (2025)

Learning from Preferences and Mixed Demonstrations in General Settings
by: Brown, Jason R, et al.
Published: (2025)

ASkDAgger: Active Skill-level Data Aggregation for Interactive Imitation Learning
by: Luijkx, Jelle, et al.
Published: (2025)

STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery
by: Su, Jiarui, et al.
Published: (2026)

Streaming Continual Learning for Unified Adaptive Intelligence in Dynamic Environments
by: Giannini, Federico, et al.
Published: (2026)

Multiple data-driven missing imputation
by: Kavun, Sergii
Published: (2025)

Maximum Entropy Relaxation of Multi-Way Cardinality Constraints for Synthetic Population Generation
by: Pachet, François, et al.
Published: (2026)