Saved in:
| Main Authors: | Arjonilla, Jérôme, Saffidine, Abdallah, Cazenave, Tristan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.10113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Perfect Information Monte Carlo with Postponing Reasoning
by: Arjonilla, Jérôme, et al.
Published: (2024)
by: Arjonilla, Jérôme, et al.
Published: (2024)
Mixture of Public and Private Distributions in Imperfect Information Games
by: Arjonilla, Jérôme, et al.
Published: (2024)
by: Arjonilla, Jérôme, et al.
Published: (2024)
Deep Reinforcement Learning for 5*5 Multiplayer Go
by: Driss, Brahim, et al.
Published: (2024)
by: Driss, Brahim, et al.
Published: (2024)
Learning a Prior for Monte Carlo Search by Replaying Solutions to Combinatorial Problems
by: Cazenave, Tristan
Published: (2024)
by: Cazenave, Tristan
Published: (2024)
Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms
by: Cazenave, Tristan
Published: (2024)
by: Cazenave, Tristan
Published: (2024)
Monte Carlo Permutation Search
by: Cazenave, Tristan
Published: (2025)
by: Cazenave, Tristan
Published: (2025)
Generalized Nested Rollout Policy Adaptation with Limited Repetitions
by: Cazenave, Tristan
Published: (2024)
by: Cazenave, Tristan
Published: (2024)
Eterna is Solved
by: Cazenave, Tristan
Published: (2025)
by: Cazenave, Tristan
Published: (2025)
Pareto-NRPA: A Novel Monte-Carlo Search Algorithm for Multi-Objective Optimization
by: Lallouet, Noé, et al.
Published: (2025)
by: Lallouet, Noé, et al.
Published: (2025)
Minibal: Balanced Game-Playing Without Opponent Modeling
by: Cohen-Solal, Quentin, et al.
Published: (2026)
by: Cohen-Solal, Quentin, et al.
Published: (2026)
Minimax Strikes Back
by: Cohen-Solal, Quentin, et al.
Published: (2020)
by: Cohen-Solal, Quentin, et al.
Published: (2020)
On some improvements to Unbounded Minimax
by: Cohen-Solal, Quentin, et al.
Published: (2025)
by: Cohen-Solal, Quentin, et al.
Published: (2025)
Fair Railway Network Design
by: He, Zixu, et al.
Published: (2024)
by: He, Zixu, et al.
Published: (2024)
SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly
by: Maugin, Narada, et al.
Published: (2025)
by: Maugin, Narada, et al.
Published: (2025)
LLMs can Schedule
by: Abgaryan, Henrik, et al.
Published: (2024)
by: Abgaryan, Henrik, et al.
Published: (2024)
Monte Carlo Graph Coloring
by: Cazenave, Tristan, et al.
Published: (2025)
by: Cazenave, Tristan, et al.
Published: (2025)
Generalized Rapid Action Value Estimation in Memory-Constrained Environments
by: Rautureau, Aloïs, et al.
Published: (2026)
by: Rautureau, Aloïs, et al.
Published: (2026)
BeeRNA: tertiary structure-based RNA inverse folding using Artificial Bee Colony
by: Mlaweh, Mehyar, et al.
Published: (2025)
by: Mlaweh, Mehyar, et al.
Published: (2025)
Refutation of Spectral Graph Theory Conjectures with Search Algorithms)
by: Roucairol, Milo, et al.
Published: (2024)
by: Roucairol, Milo, et al.
Published: (2024)
Starjob: Dataset for LLM-Driven Job Shop Scheduling
by: Abgaryan, Henrik, et al.
Published: (2025)
by: Abgaryan, Henrik, et al.
Published: (2025)
ACCORD: Autoregressive Constraint-satisfying Generation for COmbinatorial Optimization with Routing and Dynamic attention
by: Abgaryan, Henrik, et al.
Published: (2025)
by: Abgaryan, Henrik, et al.
Published: (2025)
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
by: Kobrosly, Lotfi, et al.
Published: (2025)
by: Kobrosly, Lotfi, et al.
Published: (2025)
Limits of PRM-Guided Tree Search for Mathematical Reasoning with LLMs
by: Cinquin, Tristan, et al.
Published: (2025)
by: Cinquin, Tristan, et al.
Published: (2025)
HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection
by: Abdallah, Mohamed A., et al.
Published: (2025)
by: Abdallah, Mohamed A., et al.
Published: (2025)
Guiding Exploration in Reinforcement Learning Through LLM-Augmented Observations
by: Jain, Vaibhav, et al.
Published: (2025)
by: Jain, Vaibhav, et al.
Published: (2025)
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
by: Bester, Tristan, et al.
Published: (2023)
by: Bester, Tristan, et al.
Published: (2023)
Enhancing Reinforcement Learning for the Floorplanning of Analog ICs with Beam Search
by: Della Rovere, Sandro Junior, et al.
Published: (2025)
by: Della Rovere, Sandro Junior, et al.
Published: (2025)
Learning to Better Search with Language Models via Guided Reinforced Self-Training
by: Moon, Seungyong, et al.
Published: (2024)
by: Moon, Seungyong, et al.
Published: (2024)
Refuting the Direct Sum Conjecture for Total Functions in Deterministic Communication Complexity
by: Mackenzie, Simon, et al.
Published: (2024)
by: Mackenzie, Simon, et al.
Published: (2024)
Polynomial Prenexing of QBFs with Non-Monotone Boolean Operators
by: Saffidine, Abdallah, et al.
Published: (2025)
by: Saffidine, Abdallah, et al.
Published: (2025)
XAI-based Feature Ensemble for Enhanced Anomaly Detection in Autonomous Driving Systems
by: Nazat, Sazid, et al.
Published: (2024)
by: Nazat, Sazid, et al.
Published: (2024)
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
by: Liu, Max, et al.
Published: (2024)
by: Liu, Max, et al.
Published: (2024)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)
by: Chen, Mingyang, et al.
Published: (2025)
Policy Guided Tree Search for Enhanced LLM Reasoning
by: Li, Yang
Published: (2025)
by: Li, Yang
Published: (2025)
Opinion-Guided Reinforcement Learning
by: Dagenais, Kyanna, et al.
Published: (2024)
by: Dagenais, Kyanna, et al.
Published: (2024)
Subgoal-Guided Policy Heuristic Search with Learned Subgoals
by: Tuero, Jake, et al.
Published: (2025)
by: Tuero, Jake, et al.
Published: (2025)
Robust Reinforcement Learning Objectives for Sequential Recommender Systems
by: Mozifian, Melissa, et al.
Published: (2023)
by: Mozifian, Melissa, et al.
Published: (2023)
Hybrid Reinforcement Learning and Search for Flight Trajectory Planning
by: Luise, Alberto, et al.
Published: (2025)
by: Luise, Alberto, et al.
Published: (2025)
Certificate-Guided Evaluation of Reinforcement Learning Generalization
by: Subramanian, Vignesh, et al.
Published: (2026)
by: Subramanian, Vignesh, et al.
Published: (2026)
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning
by: Sun, Jingbo, et al.
Published: (2026)
by: Sun, Jingbo, et al.
Published: (2026)
Similar Items
-
Perfect Information Monte Carlo with Postponing Reasoning
by: Arjonilla, Jérôme, et al.
Published: (2024) -
Mixture of Public and Private Distributions in Imperfect Information Games
by: Arjonilla, Jérôme, et al.
Published: (2024) -
Deep Reinforcement Learning for 5*5 Multiplayer Go
by: Driss, Brahim, et al.
Published: (2024) -
Learning a Prior for Monte Carlo Search by Replaying Solutions to Combinatorial Problems
by: Cazenave, Tristan
Published: (2024) -
Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms
by: Cazenave, Tristan
Published: (2024)