Saved in:
| Main Authors: | Varys, Kryspin, Cerutti, Federico, Sobey, Adam, Norman, Timothy J. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.15011 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2024)
by: Lipinski, Olaf, et al.
Published: (2024)
It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023)
by: Lipinski, Olaf, et al.
Published: (2023)
CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning
by: Birkbeck, John, et al.
Published: (2024)
by: Birkbeck, John, et al.
Published: (2024)
Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
by: Cerutti, Federico
Published: (2025)
by: Cerutti, Federico
Published: (2025)
It's Not You, It's Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL
by: Dwyer, Madeleine, et al.
Published: (2025)
by: Dwyer, Madeleine, et al.
Published: (2025)
Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory
by: Yuen, Sizhe, et al.
Published: (2025)
by: Yuen, Sizhe, et al.
Published: (2025)
Learning Robust Reward Machines from Noisy Labels
by: Parac, Roko, et al.
Published: (2024)
by: Parac, Roko, et al.
Published: (2024)
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators
by: Towers, Mark, et al.
Published: (2024)
by: Towers, Mark, et al.
Published: (2024)
Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks
by: Yuen, Sizhe, et al.
Published: (2025)
by: Yuen, Sizhe, et al.
Published: (2025)
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
by: Rajaram, Sara, et al.
Published: (2025)
by: Rajaram, Sara, et al.
Published: (2025)
Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification
by: Ji, Shihao, et al.
Published: (2026)
by: Ji, Shihao, et al.
Published: (2026)
Democratizing Reward Design for Personal and Representative Value-Alignment
by: Blair, Carter, et al.
Published: (2024)
by: Blair, Carter, et al.
Published: (2024)
AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations
by: Lindström, Adam Dahlgren, et al.
Published: (2024)
by: Lindström, Adam Dahlgren, et al.
Published: (2024)
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
by: Cheng, Ruoxi, et al.
Published: (2025)
by: Cheng, Ruoxi, et al.
Published: (2025)
OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning
by: Ma, Xinyu, et al.
Published: (2026)
by: Ma, Xinyu, et al.
Published: (2026)
Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration
by: Mao, Xin, et al.
Published: (2024)
by: Mao, Xin, et al.
Published: (2024)
The Reward Model Selection Crisis in Personalized Alignment
by: Rezk, Fady, et al.
Published: (2025)
by: Rezk, Fady, et al.
Published: (2025)
Explaining Reinforcement Learning: A Counterfactual Shapley Values Approach
by: Shi, Yiwei, et al.
Published: (2024)
by: Shi, Yiwei, et al.
Published: (2024)
Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto
by: Tennant, Elizaveta, et al.
Published: (2023)
by: Tennant, Elizaveta, et al.
Published: (2023)
Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach
by: Li, Wenyun, et al.
Published: (2025)
by: Li, Wenyun, et al.
Published: (2025)
Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis
by: Giordano, Sara, et al.
Published: (2025)
by: Giordano, Sara, et al.
Published: (2025)
Aligning Medical Conversational AI through Online Reinforcement Learning with Information-Theoretic Rewards
by: Verma, Tanvi, et al.
Published: (2026)
by: Verma, Tanvi, et al.
Published: (2026)
Process Reinforcement through Implicit Rewards
by: Cui, Ganqu, et al.
Published: (2025)
by: Cui, Ganqu, et al.
Published: (2025)
Reward Model Routing in Alignment
by: Wu, Xinle, et al.
Published: (2025)
by: Wu, Xinle, et al.
Published: (2025)
Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards
by: Zhang, Hanping, et al.
Published: (2025)
by: Zhang, Hanping, et al.
Published: (2025)
Enhancing Inverse Reinforcement Learning through Encoding Dynamic Information in Reward Shaping
by: Zhan, Simon Sinong, et al.
Published: (2024)
by: Zhan, Simon Sinong, et al.
Published: (2024)
Reward Hacking in Rubric-Based Reinforcement Learning
by: Mahmoud, Anas, et al.
Published: (2026)
by: Mahmoud, Anas, et al.
Published: (2026)
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
by: Liu, Xiaoyuan, et al.
Published: (2025)
by: Liu, Xiaoyuan, et al.
Published: (2025)
Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
by: Wang, Linji, et al.
Published: (2025)
by: Wang, Linji, et al.
Published: (2025)
Beyond Monolithic Rewards: A Hybrid and Multi-Aspect Reward Optimization for MLLM Alignment
by: Gulhane, Radha, et al.
Published: (2025)
by: Gulhane, Radha, et al.
Published: (2025)
Reinforcement Learning with Stochastic Reward Machines
by: Corazza, Jan, et al.
Published: (2025)
by: Corazza, Jan, et al.
Published: (2025)
Reinforcement Learning with Exogenous States and Rewards
by: Trimponias, George, et al.
Published: (2023)
by: Trimponias, George, et al.
Published: (2023)
Reinforcement Learning with Symbolic Reward Machines
by: Krug, Thomas, et al.
Published: (2026)
by: Krug, Thomas, et al.
Published: (2026)
Offline Reinforcement Learning with Imputed Rewards
by: Romeo, Carlo, et al.
Published: (2024)
by: Romeo, Carlo, et al.
Published: (2024)
Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
by: Labiosa, Adam, et al.
Published: (2025)
by: Labiosa, Adam, et al.
Published: (2025)
Is there Value in Reinforcement Learning?
by: Fox, Lior, et al.
Published: (2025)
by: Fox, Lior, et al.
Published: (2025)
Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards
by: Guo, Kai-Yuan, et al.
Published: (2026)
by: Guo, Kai-Yuan, et al.
Published: (2026)
Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)
by: Ishihara, Yu, et al.
Published: (2025)
In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior
by: Berkes, Anaïs, et al.
Published: (2026)
by: Berkes, Anaïs, et al.
Published: (2026)
Inverse Reinforcement Learning without an Optimal Demonstrator: A Feasible Reward Set Approach
by: Kim, Kihyun, et al.
Published: (2026)
by: Kim, Kihyun, et al.
Published: (2026)
Similar Items
-
Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2024) -
It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023) -
CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning
by: Birkbeck, John, et al.
Published: (2024) -
Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
by: Cerutti, Federico
Published: (2025) -
It's Not You, It's Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL
by: Dwyer, Madeleine, et al.
Published: (2025)