Saved in:
| Main Authors: | Cooper, Patrick, Velasquez, Alvaro |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02451 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Modeling and Controlling Deployment Reliability under Temporal Distribution Shift
by: Rahman, Naimur, et al.
Published: (2026)
by: Rahman, Naimur, et al.
Published: (2026)
An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning
by: Saini, Saurabh, et al.
Published: (2026)
by: Saini, Saurabh, et al.
Published: (2026)
On Divergence Measures for Training GFlowNets
by: da Silva, Tiago, et al.
Published: (2024)
by: da Silva, Tiago, et al.
Published: (2024)
Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory
by: Zhang, Zhi, et al.
Published: (2024)
by: Zhang, Zhi, et al.
Published: (2024)
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)
Connectivity-Aware Representations for Constrained Motion Planning via Multi-Scale Contrastive Learning
by: Jeon, Suhyun, et al.
Published: (2026)
by: Jeon, Suhyun, et al.
Published: (2026)
Evolving machine learning workflows through interactive AutoML
by: Barbudo, Rafael, et al.
Published: (2024)
by: Barbudo, Rafael, et al.
Published: (2024)
Prediction-Based Markov Violation Scores for Detecting Non-Markovian Observations in Reinforcement Learning
by: Mysore, Naveen
Published: (2026)
by: Mysore, Naveen
Published: (2026)
Normalisation and Initialisation Strategies for Graph Neural Networks in Blockchain Anomaly Detection
by: Duy, Dang Sy, et al.
Published: (2026)
by: Duy, Dang Sy, et al.
Published: (2026)
AI Agents: Evolution, Architecture, and Real-World Applications
by: Krishnan, Naveen
Published: (2025)
by: Krishnan, Naveen
Published: (2025)
Adaptive Minds: Empowering Agents with LoRA-as-Tools
by: Shekar, Pavan C, et al.
Published: (2025)
by: Shekar, Pavan C, et al.
Published: (2025)
Proving Olympiad Algebraic Inequalities without Human Demonstrations
by: Wei, Chenrui, et al.
Published: (2024)
by: Wei, Chenrui, et al.
Published: (2024)
COBRA-PPM: A Causal Bayesian Reasoning Architecture Using Probabilistic Programming for Robot Manipulation Under Uncertainty
by: Cannizzaro, Ricardo, et al.
Published: (2024)
by: Cannizzaro, Ricardo, et al.
Published: (2024)
Reinforcement Learning for Portfolio Optimization with a Financial Goal and Defined Time Horizons
by: Leukam, Fermat, et al.
Published: (2025)
by: Leukam, Fermat, et al.
Published: (2025)
Risk-Sensitive Option Market Making with Arbitrage-Free eSSVI Surfaces: A Constrained RL and Stochastic Control Bridge
by: Zhang, Jian'an
Published: (2025)
by: Zhang, Jian'an
Published: (2025)
SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library
by: Mishra, Satyam, et al.
Published: (2025)
by: Mishra, Satyam, et al.
Published: (2025)
The two clocks and the innovation window: When and how generative models learn rules
by: Wang, Binxu, et al.
Published: (2026)
by: Wang, Binxu, et al.
Published: (2026)
Dual-Channel Feature Fusion for Joint Prediction in Dynamic Signed Weighted Networks
by: Zhang, Gaoxin, et al.
Published: (2026)
by: Zhang, Gaoxin, et al.
Published: (2026)
Emotion-Inspired Learning Signals (EILS): A Homeostatic Framework for Adaptive Autonomous Agents
by: Tiwari, Dhruv
Published: (2025)
by: Tiwari, Dhruv
Published: (2025)
Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks
by: Ye, Ye
Published: (2025)
by: Ye, Ye
Published: (2025)
A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
by: Caunhye, Ali Murtaza, et al.
Published: (2025)
by: Caunhye, Ali Murtaza, et al.
Published: (2025)
Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
by: Pather, Kaviraj, et al.
Published: (2025)
by: Pather, Kaviraj, et al.
Published: (2025)
Improving Hyperparameter Optimization with Checkpointed Model Weights
by: Mehta, Nikhil, et al.
Published: (2024)
by: Mehta, Nikhil, et al.
Published: (2024)
Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds
by: Sasso, Remo, et al.
Published: (2025)
by: Sasso, Remo, et al.
Published: (2025)
Exploration with Foundation Models: Capabilities, Limitations, and Hybrid Approaches
by: Sasso, Remo, et al.
Published: (2025)
by: Sasso, Remo, et al.
Published: (2025)
GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies
by: Zhang, He, et al.
Published: (2026)
by: Zhang, He, et al.
Published: (2026)
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)
by: Yousaf, Iqra
Published: (2024)
Tail-Safe Hedging: Explainable Risk-Sensitive Reinforcement Learning with a White-Box CBF--QP Safety Layer in Arbitrage-Free Markets
by: Zhang, Jian'an
Published: (2025)
by: Zhang, Jian'an
Published: (2025)
Scalable Nested Optimization for Deep Learning
by: Lorraine, Jonathan
Published: (2024)
by: Lorraine, Jonathan
Published: (2024)
Law-Strength Frontiers and a No-Free-Lunch Result for Law-Seeking Reinforcement Learning on Volatility Law Manifolds
by: Zhang, Jian'an
Published: (2025)
by: Zhang, Jian'an
Published: (2025)
Actor-Critic Model Predictive Control: Differentiable Optimization meets Reinforcement Learning for Agile Flight
by: Romero, Angel, et al.
Published: (2023)
by: Romero, Angel, et al.
Published: (2023)
RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward
by: Fang, Jiawei, et al.
Published: (2025)
by: Fang, Jiawei, et al.
Published: (2025)
Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay
by: Du, Wenzhang
Published: (2025)
by: Du, Wenzhang
Published: (2025)
From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance Adaptation
by: Fang, Junyi, et al.
Published: (2025)
by: Fang, Junyi, et al.
Published: (2025)
Learning from Preferences and Mixed Demonstrations in General Settings
by: Brown, Jason R, et al.
Published: (2025)
by: Brown, Jason R, et al.
Published: (2025)
ASkDAgger: Active Skill-level Data Aggregation for Interactive Imitation Learning
by: Luijkx, Jelle, et al.
Published: (2025)
by: Luijkx, Jelle, et al.
Published: (2025)
STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery
by: Su, Jiarui, et al.
Published: (2026)
by: Su, Jiarui, et al.
Published: (2026)
Streaming Continual Learning for Unified Adaptive Intelligence in Dynamic Environments
by: Giannini, Federico, et al.
Published: (2026)
by: Giannini, Federico, et al.
Published: (2026)
Multiple data-driven missing imputation
by: Kavun, Sergii
Published: (2025)
by: Kavun, Sergii
Published: (2025)
Maximum Entropy Relaxation of Multi-Way Cardinality Constraints for Synthetic Population Generation
by: Pachet, François, et al.
Published: (2026)
by: Pachet, François, et al.
Published: (2026)
Similar Items
-
Modeling and Controlling Deployment Reliability under Temporal Distribution Shift
by: Rahman, Naimur, et al.
Published: (2026) -
An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning
by: Saini, Saurabh, et al.
Published: (2026) -
On Divergence Measures for Training GFlowNets
by: da Silva, Tiago, et al.
Published: (2024) -
Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory
by: Zhang, Zhi, et al.
Published: (2024) -
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)