:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Schesch, Benedikt, Caserta, Marco
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2402.10177
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Merge-Bench: Resolve Merge Conflicts with Large Language Models
by: Schesch, Benedikt, et al.
Published: (2026)

Continual Learning as Computationally Constrained Reinforcement Learning
by: Kumar, Saurabh, et al.
Published: (2023)

State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)

Locally Constrained Representations in Reinforcement Learning
by: Nath, Somjit, et al.
Published: (2022)

RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
by: Zhou, Runlong, et al.
Published: (2025)

Imitating Cost-Constrained Behaviors in Reinforcement Learning
by: Shao, Qian, et al.
Published: (2024)

Memory Allocation in Resource-Constrained Reinforcement Learning
by: Tamborski, Massimiliano, et al.
Published: (2025)

Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
by: Lu, Yuxiao, et al.
Published: (2023)

Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
by: Mao, Yixiu, et al.
Published: (2025)

PROPEL: Supervised and Reinforcement Learning for Large-Scale Supply Chain Planning
by: Akhlaghi, Vahid Eghbal, et al.
Published: (2025)

Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
by: Yue, Bo, et al.
Published: (2024)

Sample-Efficient Constrained Reinforcement Learning with General Parameterization
by: Mondal, Washim Uddin, et al.
Published: (2024)

Sample Complexity Analysis for Constrained Bilevel Reinforcement Learning
by: Saxena, Naman, et al.
Published: (2026)

Optimistic Exploration for Risk-Averse Constrained Reinforcement Learning
by: McCarthy, James, et al.
Published: (2025)

DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
by: Li, Gang, et al.
Published: (2025)

Offline Actor-Critic Reinforcement Learning Scales to Large Models
by: Springenberg, Jost Tobias, et al.
Published: (2024)

The Limits of Inference Scaling Through Resampling
by: Stroebl, Benedikt, et al.
Published: (2024)

Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds
by: Wang, Fan, et al.
Published: (2025)

HALO: Hierarchical Reinforcement Learning for Large-Scale Adaptive Traffic Signal Control
by: Zhu, Yaqiao, et al.
Published: (2025)

Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
by: Luo, Ziyan, et al.
Published: (2025)

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)

Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies
by: Rietz, Finn, et al.
Published: (2024)

Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning
by: Li, Chenglin, et al.
Published: (2024)

Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
by: Cho, Minjae, et al.
Published: (2024)

Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2025)

Time-Constrained Recommendations: Reinforcement Learning Strategies for E-Commerce
by: Chakrabarty, Sayak, et al.
Published: (2025)

Multi-Agent Deep Reinforcement Learning Under Constrained Communications
by: Shaik, Shahil, et al.
Published: (2026)

Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
by: Hazra, Somnath, et al.
Published: (2025)

Anytime-Constrained Reinforcement Learning
by: McMahan, Jeremy, et al.
Published: (2023)

Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025)

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
by: Fu, Wei, et al.
Published: (2025)

Scale-free Adversarial Reinforcement Learning
by: Chen, Mingyu, et al.
Published: (2024)

Robust off-policy Reinforcement Learning via Soft Constrained Adversary
by: Nakanishi, Kosuke, et al.
Published: (2024)

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024)

Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
by: Xu, Yinglun, et al.
Published: (2023)

Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning
by: Fang, Zeyu, et al.
Published: (2026)

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
by: Nikulin, Alexander, et al.
Published: (2024)

Rethinking Policy Diversity in Ensemble Policy Gradient in Large-Scale Reinforcement Learning
by: Shitanda, Naoki, et al.
Published: (2026)