:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ma, Zhenyao, Liang, Yue, Li, Dongxu
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence I.2.6
Online Access:	https://arxiv.org/abs/2602.20152
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)

Structured Prompt Optimization Meets Reinforcement Learning for Global and Local Interpretability over Complex Text
by: Zhou, Tianyang, et al.
Published: (2026)

DataRater: Meta-Learned Dataset Curation
by: Calian, Dan A., et al.
Published: (2025)

DELTA: Variational Disentangled Learning for Privacy-Preserving Data Reprogramming
by: Malarkkan, Arun Vignesh, et al.
Published: (2025)

Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
by: Riscos, Pablo de los, et al.
Published: (2024)

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
by: Zamaraeva, Elena, et al.
Published: (2025)

TS-ACL: Closed-Form Solution for Time Series-oriented Continual Learning
by: Li, Jiaxu, et al.
Published: (2024)

Hierarchical Universal Value Function Approximators
by: Arora, Rushiv
Published: (2024)

Machine Learning vs Deep Learning: The Generalization Problem
by: Bay, Yong Yi, et al.
Published: (2024)

Expressive Value Learning for Scalable Offline Reinforcement Learning
by: Espinosa-Dice, Nicolas, et al.
Published: (2025)

FastGRPO: Accelerating Policy Optimization via Concurrency-aware Speculative Decoding and Online Draft Learning
by: Zhang, Yizhou, et al.
Published: (2025)

Safe Reinforcement Learning with Preference-based Constraint Inference
by: Li, Chenglin, et al.
Published: (2026)

Bounded Ratio Reinforcement Learning
by: Ao, Yunke, et al.
Published: (2026)

Why Online Reinforcement Learning is Causal
by: Schulte, Oliver, et al.
Published: (2024)

Understanding Goal Generalisation in Sequential Reinforcement Learning
by: Brown, Jason Ross, et al.
Published: (2026)

What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators
by: Zhang, Xinyu
Published: (2026)

Evaluating SAP RPT-1 for Enterprise Business Process Prediction: In-Context Learning vs. Traditional Machine Learning on Structured SAP Data
by: Lal, Amit
Published: (2026)

FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
by: Yuan, Xin, et al.
Published: (2025)

Path-Coupled Bellman Flows for Distributional Reinforcement Learning
by: Xu, Boyang, et al.
Published: (2026)

Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
by: Furuyama, Ryoma, et al.
Published: (2024)

Evidential Deep Active Learning for Semi-Supervised Classification
by: Zhao, Shenkai, et al.
Published: (2025)

A Simple Generalisation of the Implicit Dynamics of In-Context Learning
by: Innocenti, Francesco, et al.
Published: (2025)

Low-Dimensional Execution Manifolds in Transformer Learning Dynamics: Evidence from Modular Arithmetic Tasks
by: Xu, Yongzhong
Published: (2026)

An Idiosyncrasy of Time-discretization in Reinforcement Learning
by: De Asis, Kris, et al.
Published: (2024)

Generative and Contrastive Graph Representation Learning
by: Chen, Jiali, et al.
Published: (2025)

Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda
by: Narendra, Nanjangud C., et al.
Published: (2025)

TACO: Tackling Over-correction in Federated Learning with Tailored Adaptive Correction
by: Liu, Weijie, et al.
Published: (2025)

Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks
by: Arora, Rushiv
Published: (2025)

Load and Renewable Energy Forecasting Using Deep Learning for Grid Stability
by: Sarkar, Kamal
Published: (2025)

Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
by: Mangannavar, Rajesh, et al.
Published: (2024)

Deep Reinforcement Learning for Adverse Garage Scenario Generation
by: Li, Kai
Published: (2024)

What changes after deployment? A survey on On-device Learning in TinyML
by: Pavan, Massimo, et al.
Published: (2026)

Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic
by: Zhao, Xingyu, et al.
Published: (2026)

Adaptable Hindsight Experience Replay for Search-Based Learning
by: Vazaios, Alexandros, et al.
Published: (2025)

FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification
by: Tian, Tian, et al.
Published: (2025)

Simulation-Driven Railway Delay Prediction: An Imitation Learning Approach
by: Elliker, Clément, et al.
Published: (2025)

CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning
by: Sauter, Andreas W. M., et al.
Published: (2024)

A Practical Approach to using Supervised Machine Learning Models to Classify Aviation Safety Occurrences
by: Siow, Bryan Y.
Published: (2025)

DYNAMITE: Dynamic Interplay of Mini-Batch Size and Aggregation Frequency for Federated Learning with Static and Streaming Dataset
by: Liu, Weijie, et al.
Published: (2023)

Dynamics Reveals Structure: Challenging the Linear Propagation Assumption
by: Chang, Hoyeon, et al.
Published: (2026)