:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Varys, Kryspin, Cerutti, Federico, Sobey, Adam, Norman, Timothy J.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.15011
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2024)

It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023)

CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning
by: Birkbeck, John, et al.
Published: (2024)

Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
by: Cerutti, Federico
Published: (2025)

It's Not You, It's Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL
by: Dwyer, Madeleine, et al.
Published: (2025)

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory
by: Yuen, Sizhe, et al.
Published: (2025)

Learning Robust Reward Machines from Noisy Labels
by: Parac, Roko, et al.
Published: (2024)

Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators
by: Towers, Mark, et al.
Published: (2024)

Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks
by: Yuen, Sizhe, et al.
Published: (2025)

Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
by: Rajaram, Sara, et al.
Published: (2025)

Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification
by: Ji, Shihao, et al.
Published: (2026)

Democratizing Reward Design for Personal and Representative Value-Alignment
by: Blair, Carter, et al.
Published: (2024)

AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations
by: Lindström, Adam Dahlgren, et al.
Published: (2024)

Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
by: Cheng, Ruoxi, et al.
Published: (2025)

OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning
by: Ma, Xinyu, et al.
Published: (2026)

Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration
by: Mao, Xin, et al.
Published: (2024)

The Reward Model Selection Crisis in Personalized Alignment
by: Rezk, Fady, et al.
Published: (2025)

Explaining Reinforcement Learning: A Counterfactual Shapley Values Approach
by: Shi, Yiwei, et al.
Published: (2024)

Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto
by: Tennant, Elizaveta, et al.
Published: (2023)

Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach
by: Li, Wenyun, et al.
Published: (2025)

Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis
by: Giordano, Sara, et al.
Published: (2025)

Aligning Medical Conversational AI through Online Reinforcement Learning with Information-Theoretic Rewards
by: Verma, Tanvi, et al.
Published: (2026)

Process Reinforcement through Implicit Rewards
by: Cui, Ganqu, et al.
Published: (2025)

Reward Model Routing in Alignment
by: Wu, Xinle, et al.
Published: (2025)

Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards
by: Zhang, Hanping, et al.
Published: (2025)

Enhancing Inverse Reinforcement Learning through Encoding Dynamic Information in Reward Shaping
by: Zhan, Simon Sinong, et al.
Published: (2024)

Reward Hacking in Rubric-Based Reinforcement Learning
by: Mahmoud, Anas, et al.
Published: (2026)

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
by: Liu, Xiaoyuan, et al.
Published: (2025)

Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
by: Wang, Linji, et al.
Published: (2025)

Beyond Monolithic Rewards: A Hybrid and Multi-Aspect Reward Optimization for MLLM Alignment
by: Gulhane, Radha, et al.
Published: (2025)

Reinforcement Learning with Stochastic Reward Machines
by: Corazza, Jan, et al.
Published: (2025)

Reinforcement Learning with Exogenous States and Rewards
by: Trimponias, George, et al.
Published: (2023)

Reinforcement Learning with Symbolic Reward Machines
by: Krug, Thomas, et al.
Published: (2026)

Offline Reinforcement Learning with Imputed Rewards
by: Romeo, Carlo, et al.
Published: (2024)

Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
by: Labiosa, Adam, et al.
Published: (2025)

Is there Value in Reinforcement Learning?
by: Fox, Lior, et al.
Published: (2025)

Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards
by: Guo, Kai-Yuan, et al.
Published: (2026)

Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)

In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior
by: Berkes, Anaïs, et al.
Published: (2026)

Inverse Reinforcement Learning without an Optimal Demonstrator: A Feasible Reward Set Approach
by: Kim, Kihyun, et al.
Published: (2026)