Saved in:
| Main Author: | Nightingale, Peter |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2201.03472 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Algorithm Selection for Optimal Multi-Agent Path Finding via Graph Embedding
by: Shabalin, Carmel, et al.
Published: (2024)
by: Shabalin, Carmel, et al.
Published: (2024)
Unveiling Interesting Insights: Monte Carlo Tree Search for Knowledge Discovery
by: Totis, Pietro, et al.
Published: (2025)
by: Totis, Pietro, et al.
Published: (2025)
AI Agents: Evolution, Architecture, and Real-World Applications
by: Krishnan, Naveen
Published: (2025)
by: Krishnan, Naveen
Published: (2025)
Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning
by: Palma-Borda, Juan, et al.
Published: (2025)
by: Palma-Borda, Juan, et al.
Published: (2025)
Efficient Contextual Preferential Bayesian Optimization with Historical Examples
by: Khan, Farha A., et al.
Published: (2022)
by: Khan, Farha A., et al.
Published: (2022)
Scalable Heterogeneous Graph Foundation Models for Data-Driven Optimal Power Flow in Smart Grids
by: Pasini, Massimiliano Lupo, et al.
Published: (2026)
by: Pasini, Massimiliano Lupo, et al.
Published: (2026)
Two-phase Optimization of Binary Sequences with Low Peak Sidelobe Level Value
by: Bošković, Borko, et al.
Published: (2021)
by: Bošković, Borko, et al.
Published: (2021)
Evolving A* to Efficiently Solve the k Shortest-Path Problem (Extended Version)
by: López, Carlos Linares, et al.
Published: (2024)
by: López, Carlos Linares, et al.
Published: (2024)
The Traveling Thief Problem with Time Windows: Benchmarks and Heuristics
by: Angmalisang, Helen Yuliana, et al.
Published: (2026)
by: Angmalisang, Helen Yuliana, et al.
Published: (2026)
STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery
by: Su, Jiarui, et al.
Published: (2026)
by: Su, Jiarui, et al.
Published: (2026)
Proving Olympiad Algebraic Inequalities without Human Demonstrations
by: Wei, Chenrui, et al.
Published: (2024)
by: Wei, Chenrui, et al.
Published: (2024)
From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning
by: Klačan, Ján, et al.
Published: (2026)
by: Klačan, Ján, et al.
Published: (2026)
InterEvo-TR: Interactive Evolutionary Test Generation With Readability Assessment
by: Delgado-Pérez, Pedro, et al.
Published: (2024)
by: Delgado-Pérez, Pedro, et al.
Published: (2024)
Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignment: Case study with Tetris
by: Wang, Haochuan Kevin
Published: (2026)
by: Wang, Haochuan Kevin
Published: (2026)
Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory
by: Zhang, Zhi, et al.
Published: (2024)
by: Zhang, Zhi, et al.
Published: (2024)
JCLEC-MO: a Java suite for solving many-objective optimization engineering problems
by: Ramírez, Aurora, et al.
Published: (2024)
by: Ramírez, Aurora, et al.
Published: (2024)
Agentic, Context-Aware Risk Intelligence in the Internet of Value
by: Magableh, Basel, et al.
Published: (2026)
by: Magableh, Basel, et al.
Published: (2026)
Is there a half-life for the success rates of AI agents?
by: Ord, Toby
Published: (2025)
by: Ord, Toby
Published: (2025)
PLUGH: A Benchmark for Spatial Understanding and Reasoning in Large Language Models
by: Tikhonov, Alexey
Published: (2024)
by: Tikhonov, Alexey
Published: (2024)
Murphys Laws of AI Alignment: Why the Gap Always Wins
by: Gaikwad, Madhava
Published: (2025)
by: Gaikwad, Madhava
Published: (2025)
STACHE: Local Black-Box Explanations for Reinforcement Learning Policies
by: Elashkin, Andrew, et al.
Published: (2025)
by: Elashkin, Andrew, et al.
Published: (2025)
Space Adaptive Search for Nonholonomic Mobile Robots Path Planning
by: Wang, Qi
Published: (2024)
by: Wang, Qi
Published: (2024)
CAPE: Corrective Actions from Precondition Errors using Large Language Models
by: Raman, Shreyas Sundara, et al.
Published: (2022)
by: Raman, Shreyas Sundara, et al.
Published: (2022)
Maximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement Learning
by: Hu, Yuelin, et al.
Published: (2026)
by: Hu, Yuelin, et al.
Published: (2026)
A Balanced Approach of Rapid Genetic Exploration and Surrogate Exploitation for Hyperparameter Optimization
by: Kim, Chul, et al.
Published: (2025)
by: Kim, Chul, et al.
Published: (2025)
EduQate: Generating Adaptive Curricula through RMABs in Education Settings
by: Tio, Sidney, et al.
Published: (2024)
by: Tio, Sidney, et al.
Published: (2024)
Adaptive Minds: Empowering Agents with LoRA-as-Tools
by: Shekar, Pavan C, et al.
Published: (2025)
by: Shekar, Pavan C, et al.
Published: (2025)
Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search
by: Holt, Samuel, et al.
Published: (2025)
by: Holt, Samuel, et al.
Published: (2025)
Intersymbolic AI: Interlinking Symbolic AI and Subsymbolic AI
by: Platzer, André
Published: (2024)
by: Platzer, André
Published: (2024)
Necessary and Sufficient Conditions for Optimal Decision Trees using Dynamic Programming
by: van der Linden, Jacobus G. M., et al.
Published: (2023)
by: van der Linden, Jacobus G. M., et al.
Published: (2023)
Resource-constrained Amazons chess decision framework integrating large language models and graph attention
by: Qian, Tianhao, et al.
Published: (2026)
by: Qian, Tianhao, et al.
Published: (2026)
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks
by: Vincze, Mátyás, et al.
Published: (2024)
by: Vincze, Mátyás, et al.
Published: (2024)
Learning Affordances at Inference-Time for Vision-Language-Action Models
by: Shah, Ameesh, et al.
Published: (2025)
by: Shah, Ameesh, et al.
Published: (2025)
Overcoming Over-Fitting in Constraint Acquisition via Query-Driven Interactive Refinement
by: Balafas, Vasileios, et al.
Published: (2025)
by: Balafas, Vasileios, et al.
Published: (2025)
Task and Motion Planning in Hierarchical 3D Scene Graphs
by: Ray, Aaron, et al.
Published: (2024)
by: Ray, Aaron, et al.
Published: (2024)
A Hormetic Approach to the Value-Loading Problem: Preventing the Paperclip Apocalypse?
by: Henry, Nathan I. N., et al.
Published: (2024)
by: Henry, Nathan I. N., et al.
Published: (2024)
Learning Abstract Visual Reasoning via Task Decomposition: A Case Study in Raven Progressive Matrices
by: Kwiatkowski, Jakub, et al.
Published: (2023)
by: Kwiatkowski, Jakub, et al.
Published: (2023)
CogniLoad: A Synthetic Natural Language Reasoning Benchmark With Tunable Length, Intrinsic Difficulty, and Distractor Density
by: Kaiser, Daniel, et al.
Published: (2025)
by: Kaiser, Daniel, et al.
Published: (2025)
Conditional Temporal Neural Processes with Covariance Loss
by: Yoo, Boseon, et al.
Published: (2025)
by: Yoo, Boseon, et al.
Published: (2025)
KGMark: A Diffusion Watermark for Knowledge Graphs
by: Peng, Hongrui, et al.
Published: (2025)
by: Peng, Hongrui, et al.
Published: (2025)
Similar Items
-
Algorithm Selection for Optimal Multi-Agent Path Finding via Graph Embedding
by: Shabalin, Carmel, et al.
Published: (2024) -
Unveiling Interesting Insights: Monte Carlo Tree Search for Knowledge Discovery
by: Totis, Pietro, et al.
Published: (2025) -
AI Agents: Evolution, Architecture, and Real-World Applications
by: Krishnan, Naveen
Published: (2025) -
Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning
by: Palma-Borda, Juan, et al.
Published: (2025) -
Efficient Contextual Preferential Bayesian Optimization with Historical Examples
by: Khan, Farha A., et al.
Published: (2022)