Saved in:
| Main Authors: | Puebla, Guillermo, Doumas, Leonidas A. A. |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2203.13599 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Human-like generalization in a machine through predicate learning
by: Doumas, Leonidas A. A., et al.
Published: (2018)
by: Doumas, Leonidas A. A., et al.
Published: (2018)
A Theory of Relation Learning and Cross-domain Generalization
by: Doumas, Leonidas A. A., et al.
Published: (2019)
by: Doumas, Leonidas A. A., et al.
Published: (2019)
Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey
by: Vegner, Ivan, et al.
Published: (2025)
by: Vegner, Ivan, et al.
Published: (2025)
Rule Based Rewards for Language Model Safety
by: Mu, Tong, et al.
Published: (2024)
by: Mu, Tong, et al.
Published: (2024)
AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
by: Wang, Tevin, et al.
Published: (2025)
by: Wang, Tevin, et al.
Published: (2025)
The Impact of Machine Learning Uncertainty on the Robustness of Counterfactual Explanations
by: Christodoulou, Leonidas, et al.
Published: (2026)
by: Christodoulou, Leonidas, et al.
Published: (2026)
Using psychological theory to ground guidelines for the annotation of misogynistic language
by: Deligianni, Artemis, et al.
Published: (2026)
by: Deligianni, Artemis, et al.
Published: (2026)
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes
by: Infante, Guillermo, et al.
Published: (2024)
by: Infante, Guillermo, et al.
Published: (2024)
A Novel Framework for Uncertainty-Driven Adaptive Exploration
by: Bakopoulos, Leonidas, et al.
Published: (2025)
by: Bakopoulos, Leonidas, et al.
Published: (2025)
WildReward: Learning Reward Models from In-the-Wild Human Interactions
by: Peng, Hao, et al.
Published: (2026)
by: Peng, Hao, et al.
Published: (2026)
FormalRewardBench: A Benchmark for Formal Theorem Proving Reward Models
by: Uluşan, Zeynel A., et al.
Published: (2026)
by: Uluşan, Zeynel A., et al.
Published: (2026)
Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
by: Wang, Linji, et al.
Published: (2025)
by: Wang, Linji, et al.
Published: (2025)
Learning Reasoning Rewards from Expert Demonstrations with Inverse Reinforcement Learning
by: Fanconi, Claudio, et al.
Published: (2025)
by: Fanconi, Claudio, et al.
Published: (2025)
Rule by Rule: Learning with Confidence through Vocabulary Expansion
by: Nössig, Albert, et al.
Published: (2024)
by: Nössig, Albert, et al.
Published: (2024)
Pushdown Reward Machines for Reinforcement Learning
by: Varricchione, Giovanni, et al.
Published: (2025)
by: Varricchione, Giovanni, et al.
Published: (2025)
Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior
by: Zhou, Zhiyuan, et al.
Published: (2022)
by: Zhou, Zhiyuan, et al.
Published: (2022)
Reward Hacking in Rubric-Based Reinforcement Learning
by: Mahmoud, Anas, et al.
Published: (2026)
by: Mahmoud, Anas, et al.
Published: (2026)
SemiReward: A General Reward Model for Semi-supervised Learning
by: Li, Siyuan, et al.
Published: (2023)
by: Li, Siyuan, et al.
Published: (2023)
Learning Logical Rules using Minimum Message Length
by: Sharma, Ruben, et al.
Published: (2025)
by: Sharma, Ruben, et al.
Published: (2025)
Learned-Rule-Augmented Large Language Model Evaluators
by: Meng, Jie, et al.
Published: (2025)
by: Meng, Jie, et al.
Published: (2025)
Reward Learning from Multiple Feedback Types
by: Metz, Yannick, et al.
Published: (2025)
by: Metz, Yannick, et al.
Published: (2025)
RLSR: Reinforcement Learning from Self Reward
by: Simonds, Toby, et al.
Published: (2025)
by: Simonds, Toby, et al.
Published: (2025)
Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules
by: Pulick, Eric, et al.
Published: (2023)
by: Pulick, Eric, et al.
Published: (2023)
Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)
by: Ishihara, Yu, et al.
Published: (2025)
Understanding Expressivity of GNN in Rule Learning
by: Qiu, Haiquan, et al.
Published: (2023)
by: Qiu, Haiquan, et al.
Published: (2023)
A Harmonic Mean Formulation of Average Reward Reinforcement Learning in SMDPs
by: Shtossel, Erel, et al.
Published: (2026)
by: Shtossel, Erel, et al.
Published: (2026)
Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling for Strategic Multiagent Settings
by: Chalkiadakis, Georgios, et al.
Published: (2025)
by: Chalkiadakis, Georgios, et al.
Published: (2025)
Notes on the Reward Representation of Posterior Updates
by: Ortega, Pedro A.
Published: (2026)
by: Ortega, Pedro A.
Published: (2026)
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
by: Shihab, Ibne Farabi, et al.
Published: (2025)
by: Shihab, Ibne Farabi, et al.
Published: (2025)
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
by: Chaudhari, Shreyas, et al.
Published: (2025)
by: Chaudhari, Shreyas, et al.
Published: (2025)
ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning
by: Abboud, Elie, et al.
Published: (2026)
by: Abboud, Elie, et al.
Published: (2026)
RLNVR: Reinforcement Learning from Non-Verified Real-World Rewards
by: Krishnan, Rohit, et al.
Published: (2025)
by: Krishnan, Rohit, et al.
Published: (2025)
Beyond Perfect Scores: Proof-by-Contradiction for Trustworthy Machine Learning
by: Wadduwage, Dushan N., et al.
Published: (2026)
by: Wadduwage, Dushan N., et al.
Published: (2026)
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)
by: Kim, Kihyun, et al.
Published: (2024)
Semantic Association Rule Learning from Time Series Data and Knowledge Graphs
by: Karabulut, Erkan, et al.
Published: (2023)
by: Karabulut, Erkan, et al.
Published: (2023)
VIRAL: Vision-grounded Integration for Reward design And Learning
by: Cuzin-Rambaud, Valentin, et al.
Published: (2025)
by: Cuzin-Rambaud, Valentin, et al.
Published: (2025)
Discriminative Rule Learning for Outcome-Guided Process Model Discovery
by: Norouzifar, Ali, et al.
Published: (2025)
by: Norouzifar, Ali, et al.
Published: (2025)
SuPLE: Robot Learning with Lyapunov Rewards
by: Nguyen, Phu, et al.
Published: (2024)
by: Nguyen, Phu, et al.
Published: (2024)
Learning Robust Reward Machines from Noisy Labels
by: Parac, Roko, et al.
Published: (2024)
by: Parac, Roko, et al.
Published: (2024)
Learning Safety Constraints from Demonstrations with Unknown Rewards
by: Lindner, David, et al.
Published: (2023)
by: Lindner, David, et al.
Published: (2023)
Similar Items
-
Human-like generalization in a machine through predicate learning
by: Doumas, Leonidas A. A., et al.
Published: (2018) -
A Theory of Relation Learning and Cross-domain Generalization
by: Doumas, Leonidas A. A., et al.
Published: (2019) -
Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey
by: Vegner, Ivan, et al.
Published: (2025) -
Rule Based Rewards for Language Model Safety
by: Mu, Tong, et al.
Published: (2024) -
AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
by: Wang, Tevin, et al.
Published: (2025)