Saved in:
| Main Authors: | Carey, Ryan, Langlois, Eric, van Merwijk, Chris, Legg, Shane, Everitt, Tom |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2001.07118 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Constrained Auto-Bidding via Generative Response Modeling
by: Yang, Eunseok, et al.
Published: (2026)
by: Yang, Eunseok, et al.
Published: (2026)
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)
by: Yousaf, Iqra
Published: (2024)
Sketch Decompositions for Classical Planning via Deep Reinforcement Learning
by: Aichmüller, Michael, et al.
Published: (2024)
by: Aichmüller, Michael, et al.
Published: (2024)
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction
by: George, Robert Joseph, et al.
Published: (2025)
by: George, Robert Joseph, et al.
Published: (2025)
NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems
by: Núñez-Molina, Carlos, et al.
Published: (2023)
by: Núñez-Molina, Carlos, et al.
Published: (2023)
Learning to Select Goals in Automated Planning with Deep-Q Learning
by: Núñez-Molina, Carlos, et al.
Published: (2024)
by: Núñez-Molina, Carlos, et al.
Published: (2024)
GIRL: Generative Imagination Reinforcement Learning via Information-Theoretic Hallucination Control
by: Hiremath, Prakul Sunil
Published: (2026)
by: Hiremath, Prakul Sunil
Published: (2026)
From Next Token Prediction to (STRIPS) World Models
by: Núñez-Molina, Carlos, et al.
Published: (2025)
by: Núñez-Molina, Carlos, et al.
Published: (2025)
PIRS: Physics-Informed Reward Shaping for SAC-Based Building Energy Management
by: Zaregarizi, Shadmehr, et al.
Published: (2026)
by: Zaregarizi, Shadmehr, et al.
Published: (2026)
Improving Industrial Injection Molding Processes with Explainable AI for Quality Classification
by: Rottenwalter, Georg, et al.
Published: (2025)
by: Rottenwalter, Georg, et al.
Published: (2025)
Novel Approaches to Artificial Intelligence Development Based on the Nearest Neighbor Method
by: Priezzhev, I. I., et al.
Published: (2025)
by: Priezzhev, I. I., et al.
Published: (2025)
Advancements in synthetic data extraction for industrial injection molding
by: Rottenwalter, Georg, et al.
Published: (2025)
by: Rottenwalter, Georg, et al.
Published: (2025)
Predicting Future Actions of Reinforcement Learning Agents
by: Chung, Stephen, et al.
Published: (2024)
by: Chung, Stephen, et al.
Published: (2024)
Foundational Requirements for Artificial General Intelligence: A Falsifiable Framework Based on Signal Prediction
by: Šprogar, Matej
Published: (2025)
by: Šprogar, Matej
Published: (2025)
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks
by: Vincze, Mátyás, et al.
Published: (2024)
by: Vincze, Mátyás, et al.
Published: (2024)
Survey Transfer Learning: Recycling Data with Silicon Responses
by: Amini, Ali
Published: (2025)
by: Amini, Ali
Published: (2025)
Differentiable Symbolic Planning: A Neural Architecture for Constraint Reasoning with Learned Feasibility
by: Oruganti, Venkatakrishna Reddy
Published: (2026)
by: Oruganti, Venkatakrishna Reddy
Published: (2026)
Embedded Safety-Aligned Intelligence via Differentiable Internal Alignment Embeddings
by: Rathva, Harsh, et al.
Published: (2025)
by: Rathva, Harsh, et al.
Published: (2025)
Safe Reinforcement Learning with Preference-based Constraint Inference
by: Li, Chenglin, et al.
Published: (2026)
by: Li, Chenglin, et al.
Published: (2026)
Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
by: Riscos, Pablo de los, et al.
Published: (2024)
by: Riscos, Pablo de los, et al.
Published: (2024)
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
by: Zawalski, Michał, et al.
Published: (2022)
by: Zawalski, Michał, et al.
Published: (2022)
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
by: Nuzhin, Egor E., et al.
Published: (2024)
by: Nuzhin, Egor E., et al.
Published: (2024)
Inverting Cryptographic Hash Functions via Cube-and-Conquer
by: Zaikin, Oleg
Published: (2022)
by: Zaikin, Oleg
Published: (2022)
Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies
by: Borro, Andrey, et al.
Published: (2025)
by: Borro, Andrey, et al.
Published: (2025)
Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery
by: Kang, Jiyeon, et al.
Published: (2025)
by: Kang, Jiyeon, et al.
Published: (2025)
Adaptable Hindsight Experience Replay for Search-Based Learning
by: Vazaios, Alexandros, et al.
Published: (2025)
by: Vazaios, Alexandros, et al.
Published: (2025)
Regret-Aware Policy Optimization: Environment-Level Memory for Replay Suppression under Delayed Harm
by: Hiremath, Prakul Sunil
Published: (2026)
by: Hiremath, Prakul Sunil
Published: (2026)
On the Generalization Gap in LLM Planning: Tests and Verifier-Reward RL
by: Belcamino, Valerio, et al.
Published: (2026)
by: Belcamino, Valerio, et al.
Published: (2026)
CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning
by: Sauter, Andreas W. M., et al.
Published: (2024)
by: Sauter, Andreas W. M., et al.
Published: (2024)
Not All Transitions Matter: Evidence from PPO
by: Basnet, Ajhesh
Published: (2026)
by: Basnet, Ajhesh
Published: (2026)
AGWM: Affordance-Grounded World Models for Environments with Compositional Prerequisites
by: Zhang, Qinshi, et al.
Published: (2026)
by: Zhang, Qinshi, et al.
Published: (2026)
What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators
by: Zhang, Xinyu
Published: (2026)
by: Zhang, Xinyu
Published: (2026)
Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application
by: Chaarani, Alaaeddine, et al.
Published: (2026)
by: Chaarani, Alaaeddine, et al.
Published: (2026)
N-Agent Ad Hoc Teamwork
by: Wang, Caroline, et al.
Published: (2024)
by: Wang, Caroline, et al.
Published: (2024)
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
by: Núñez-Molina, Carlos, et al.
Published: (2023)
by: Núñez-Molina, Carlos, et al.
Published: (2023)
ConfProBench: A Confidence Evaluation Benchmark for MLLM-Based Process Judges
by: Zhou, Yue, et al.
Published: (2025)
by: Zhou, Yue, et al.
Published: (2025)
The Shortcomings of Force-from-Motion in Robot Learning
by: Aljalbout, Elie, et al.
Published: (2024)
by: Aljalbout, Elie, et al.
Published: (2024)
Procedural Game Level Design with Deep Reinforcement Learning
by: Özkan, Miraç Buğra
Published: (2025)
by: Özkan, Miraç Buğra
Published: (2025)
PillagerBench: Benchmarking LLM-Based Agents in Competitive Minecraft Team Environments
by: Schipper, Olivier, et al.
Published: (2025)
by: Schipper, Olivier, et al.
Published: (2025)
Similar Items
-
Constrained Auto-Bidding via Generative Response Modeling
by: Yang, Eunseok, et al.
Published: (2026) -
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control
by: Wang, Yuxuan, et al.
Published: (2025) -
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024) -
Sketch Decompositions for Classical Planning via Deep Reinforcement Learning
by: Aichmüller, Michael, et al.
Published: (2024) -
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction
by: George, Robert Joseph, et al.
Published: (2025)