Saved in:
| Main Authors: | Umili, Elena, Argenziano, Francesco, Capobianco, Roberto |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.08677 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeepDFA: Injecting Temporal Logic in Deep Learning for Sequential Subsymbolic Applications
by: Umili, Elena, et al.
Published: (2026)
by: Umili, Elena, et al.
Published: (2026)
DeepDFA: Automata Learning through Neural Probabilistic Relaxations
by: Umili, Elena, et al.
Published: (2024)
by: Umili, Elena, et al.
Published: (2024)
Fully Learnable Neural Reward Machines
by: Dewidar, Hazem, et al.
Published: (2025)
by: Dewidar, Hazem, et al.
Published: (2025)
Grounding LTL Tasks in Sub-Symbolic RL Environments for Zero-Shot Generalization
by: Pannacci, Matteo, et al.
Published: (2026)
by: Pannacci, Matteo, et al.
Published: (2026)
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)
by: Marconato, Emanuele, et al.
Published: (2025)
Numeric Reward Machines
by: Levina, Kristina, et al.
Published: (2024)
by: Levina, Kristina, et al.
Published: (2024)
Don't Push the Button! Exploring Data Leakage Risks in Machine Learning and Transfer Learning
by: Apicella, Andrea, et al.
Published: (2024)
by: Apicella, Andrea, et al.
Published: (2024)
Reinforcement Learning with Symbolic Reward Machines
by: Krug, Thomas, et al.
Published: (2026)
by: Krug, Thomas, et al.
Published: (2026)
Reinforcement Learning with Stochastic Reward Machines
by: Corazza, Jan, et al.
Published: (2025)
by: Corazza, Jan, et al.
Published: (2025)
Efficient Reinforcement Learning in Probabilistic Reward Machines
by: Lin, Xiaofeng, et al.
Published: (2024)
by: Lin, Xiaofeng, et al.
Published: (2024)
Learning Robust Reward Machines from Noisy Labels
by: Parac, Roko, et al.
Published: (2024)
by: Parac, Roko, et al.
Published: (2024)
Provably Efficient Exploration in Reward Machines with Low Regret
by: Bourel, Hippolyte, et al.
Published: (2024)
by: Bourel, Hippolyte, et al.
Published: (2024)
Maximally Permissive Reward Machines
by: Varricchione, Giovanni, et al.
Published: (2024)
by: Varricchione, Giovanni, et al.
Published: (2024)
Reinforcement Learning with Reward Machines for Sleep Control in Mobile Networks
by: Levina, Kristina, et al.
Published: (2026)
by: Levina, Kristina, et al.
Published: (2026)
Pushdown Reward Machines for Reinforcement Learning
by: Varricchione, Giovanni, et al.
Published: (2025)
by: Varricchione, Giovanni, et al.
Published: (2025)
Mechanistic Neural Networks for Scientific Machine Learning
by: Pervez, Adeel, et al.
Published: (2024)
by: Pervez, Adeel, et al.
Published: (2024)
Expressive Temporal Specifications for Reward Monitoring
by: Adalat, Omar, et al.
Published: (2025)
by: Adalat, Omar, et al.
Published: (2025)
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
by: Azran, Guy, et al.
Published: (2023)
by: Azran, Guy, et al.
Published: (2023)
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
by: Castanyer, Roger Creus, et al.
Published: (2025)
by: Castanyer, Roger Creus, et al.
Published: (2025)
Few-shot Steerable Alignment: Adapting Rewards and LLM Policies with Neural Processes
by: Kobalczyk, Katarzyna, et al.
Published: (2024)
by: Kobalczyk, Katarzyna, et al.
Published: (2024)
Defining and Monitoring Complex Robot Activities via LLMs and Symbolic Reasoning
by: Argenziano, Francesco, et al.
Published: (2025)
by: Argenziano, Francesco, et al.
Published: (2025)
Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models
by: Maura-Rivero, Roberto-Rafael, et al.
Published: (2025)
by: Maura-Rivero, Roberto-Rafael, et al.
Published: (2025)
Inferring Reward Machines and Transition Machines from Partially Observable Markov Decision Processes
by: Wu, Yuly, et al.
Published: (2025)
by: Wu, Yuly, et al.
Published: (2025)
Attention-Based Reward Shaping for Sparse and Delayed Rewards
by: Holmes, Ian, et al.
Published: (2025)
by: Holmes, Ian, et al.
Published: (2025)
Reward Hacking Mitigation using Verifiable Composite Rewards
by: Tarek, Mirza Farhan Bin, et al.
Published: (2025)
by: Tarek, Mirza Farhan Bin, et al.
Published: (2025)
Repairing Reward Functions with Feedback to Mitigate Reward Hacking
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
Intrinsic Reward Policy Optimization for Sparse-Reward Environments
by: Cho, Minjae, et al.
Published: (2026)
by: Cho, Minjae, et al.
Published: (2026)
Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines
by: Zheng, Xuejing, et al.
Published: (2024)
by: Zheng, Xuejing, et al.
Published: (2024)
Neuro-Symbolic Predictive Process Monitoring
by: Mezini, Axel, et al.
Published: (2025)
by: Mezini, Axel, et al.
Published: (2025)
Reward Centering
by: Naik, Abhishek, et al.
Published: (2024)
by: Naik, Abhishek, et al.
Published: (2024)
Recursive Inference Machines for Neural Reasoning
by: Komisarczyk, Mieszko, et al.
Published: (2026)
by: Komisarczyk, Mieszko, et al.
Published: (2026)
Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking
by: Beigi, Mohammad, et al.
Published: (2026)
by: Beigi, Mohammad, et al.
Published: (2026)
SemiReward: A General Reward Model for Semi-supervised Learning
by: Li, Siyuan, et al.
Published: (2023)
by: Li, Siyuan, et al.
Published: (2023)
Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm
by: Chen, Yang, et al.
Published: (2025)
by: Chen, Yang, et al.
Published: (2025)
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
by: Wang, Chaoqi, et al.
Published: (2025)
by: Wang, Chaoqi, et al.
Published: (2025)
Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior
by: Zhou, Zhiyuan, et al.
Published: (2022)
by: Zhou, Zhiyuan, et al.
Published: (2022)
SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation
by: Yang, Wenjie, et al.
Published: (2025)
by: Yang, Wenjie, et al.
Published: (2025)
Rethinking Rubric Generation for Improving LLM Judge and Reward Modeling for Open-ended Tasks
by: Shen, William F., et al.
Published: (2026)
by: Shen, William F., et al.
Published: (2026)
Neural Network Conversion of Machine Learning Pipelines
by: Sung, Man-Ling, et al.
Published: (2026)
by: Sung, Man-Ling, et al.
Published: (2026)
Bootstrapped Reward Shaping
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Similar Items
-
DeepDFA: Injecting Temporal Logic in Deep Learning for Sequential Subsymbolic Applications
by: Umili, Elena, et al.
Published: (2026) -
DeepDFA: Automata Learning through Neural Probabilistic Relaxations
by: Umili, Elena, et al.
Published: (2024) -
Fully Learnable Neural Reward Machines
by: Dewidar, Hazem, et al.
Published: (2025) -
Grounding LTL Tasks in Sub-Symbolic RL Environments for Zero-Shot Generalization
by: Pannacci, Matteo, et al.
Published: (2026) -
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)