Saved in:
| Main Authors: | Martin, Carlos, Sandholm, Tuomas |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.08687 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ApproxED: Approximate exploitability descent via learned best responses
by: Martin, Carlos, et al.
Published: (2023)
by: Martin, Carlos, et al.
Published: (2023)
Domain-Independent Game Abstraction using Word Embedding Techniques
by: Kim, Juho, et al.
Published: (2026)
by: Kim, Juho, et al.
Published: (2026)
Heuristic Pathologies and Further Variance Reduction via Uncertainty Propagation in the AIVAT Family of Techniques
by: Kim, Juho, et al.
Published: (2026)
by: Kim, Juho, et al.
Published: (2026)
Parallelizing Counterfactual Regret Minimization
by: Kim, Juho, et al.
Published: (2026)
by: Kim, Juho, et al.
Published: (2026)
General search techniques without common knowledge for imperfect-information games, and application to superhuman Fog of War chess
by: Zhang, Brian Hu, et al.
Published: (2025)
by: Zhang, Brian Hu, et al.
Published: (2025)
Watermarking Game-Playing Agents in Perfect-Information Extensive-Form Games
by: Kim, Juho, et al.
Published: (2026)
by: Kim, Juho, et al.
Published: (2026)
Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation
by: Zhang, Brian, et al.
Published: (2022)
by: Zhang, Brian, et al.
Published: (2022)
Learning Potentials for Dynamic Matching and Application to Heart Transplantation
by: Zilberstein, Itai, et al.
Published: (2026)
by: Zilberstein, Itai, et al.
Published: (2026)
Near-Optimal Dynamic Matching via Coarsening with Application to Heart Transplantation
by: Zilberstein, Itai, et al.
Published: (2026)
by: Zilberstein, Itai, et al.
Published: (2026)
AlphaZero-Edu: Democratizing Access to AlphaZero
by: Li, Ruitong, et al.
Published: (2025)
by: Li, Ruitong, et al.
Published: (2025)
Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search
by: Taguelmimt, Redha, et al.
Published: (2024)
by: Taguelmimt, Redha, et al.
Published: (2024)
Joint-perturbation simultaneous pseudo-gradient
by: Martin, Carlos, et al.
Published: (2024)
by: Martin, Carlos, et al.
Published: (2024)
Simultaneous incremental support adjustment and metagame solving: An equilibrium-finding framework for continuous-action games
by: Martin, Carlos, et al.
Published: (2024)
by: Martin, Carlos, et al.
Published: (2024)
Solving Infinite-Player Games with Player-to-Strategy Networks
by: Martin, Carlos, et al.
Published: (2025)
by: Martin, Carlos, et al.
Published: (2025)
Game-Theoretic Multiagent Reinforcement Learning
by: Yang, Yaodong, et al.
Published: (2020)
by: Yang, Yaodong, et al.
Published: (2020)
A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation
by: Taguelmimt, Redha, et al.
Published: (2025)
by: Taguelmimt, Redha, et al.
Published: (2025)
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
by: Liang, Yongyuan, et al.
Published: (2023)
by: Liang, Yongyuan, et al.
Published: (2023)
Alpha Zero for Physics: Application of Symbolic Regression with Alpha Zero to find the analytical methods in physics
by: Michishita, Yoshihiro
Published: (2023)
by: Michishita, Yoshihiro
Published: (2023)
A multi-scale loss formulation for learning a probabilistic model with proper score optimisation
by: Lang, Simon, et al.
Published: (2025)
by: Lang, Simon, et al.
Published: (2025)
MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari Games
by: Wu, Ti-Rong, et al.
Published: (2023)
by: Wu, Ti-Rong, et al.
Published: (2023)
Diversifying AI: Towards Creative Chess with AlphaZero
by: Zahavy, Tom, et al.
Published: (2023)
by: Zahavy, Tom, et al.
Published: (2023)
Convergence of $\text{log}(1/ε)$ for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis
by: Anagnostides, Ioannis, et al.
Published: (2024)
by: Anagnostides, Ioannis, et al.
Published: (2024)
AlphaMath Almost Zero: Process Supervision without Process
by: Chen, Guoxin, et al.
Published: (2024)
by: Chen, Guoxin, et al.
Published: (2024)
Regret-Guided Search Control for Efficient Learning in AlphaZero
by: Tsai, Yun-Jui, et al.
Published: (2026)
by: Tsai, Yun-Jui, et al.
Published: (2026)
Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games
by: Zhang, Brian Hu, et al.
Published: (2024)
by: Zhang, Brian Hu, et al.
Published: (2024)
Zero loss guarantees and explicit minimizers for generic overparametrized Deep Learning networks
by: Chen, Thomas, et al.
Published: (2025)
by: Chen, Thomas, et al.
Published: (2025)
Improving Robustness of AlphaZero Algorithms to Test-Time Environment Changes
by: Tamassia, Isidoro, et al.
Published: (2025)
by: Tamassia, Isidoro, et al.
Published: (2025)
Scalable Mechanism Design for Multi-Agent Path Finding
by: Friedrich, Paul, et al.
Published: (2024)
by: Friedrich, Paul, et al.
Published: (2024)
Multi-agent AI systems outperform human teams in creativity
by: Hu, Tiancheng, et al.
Published: (2026)
by: Hu, Tiancheng, et al.
Published: (2026)
Representation Matters for Mastering Chess: Improved Feature Representation in AlphaZero Outperforms Switching to Transformers
by: Czech, Johannes, et al.
Published: (2023)
by: Czech, Johannes, et al.
Published: (2023)
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
by: Mehrabian, Abbas, et al.
Published: (2023)
by: Mehrabian, Abbas, et al.
Published: (2023)
MAPLE: Multi-State Aggregated Policy Evaluation for AlphaZero in Imperfect-Information Games
by: Li, Qian-Rong, et al.
Published: (2026)
by: Li, Qian-Rong, et al.
Published: (2026)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Code-enabled language models can outperform reasoning models on diverse tasks
by: Zhang, Cedegao E., et al.
Published: (2025)
by: Zhang, Cedegao E., et al.
Published: (2025)
Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
by: Hou, Charlie, et al.
Published: (2023)
by: Hou, Charlie, et al.
Published: (2023)
Randomness Is All You Need: Semantic Traversal of Problem-Solution Spaces with Large Language Models
by: Sandholm, Thomas, et al.
Published: (2024)
by: Sandholm, Thomas, et al.
Published: (2024)
Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
by: Zollicoffer, Geigh, et al.
Published: (2024)
by: Zollicoffer, Geigh, et al.
Published: (2024)
Steering LLMs? Actually, Sparse Autoencoders can outperform simple baselines
by: Jørgensen, Mikkel Godsk, et al.
Published: (2026)
by: Jørgensen, Mikkel Godsk, et al.
Published: (2026)
Can OpenAI o1 outperform humans in higher-order cognitive thinking?
by: Latif, Ehsan, et al.
Published: (2024)
by: Latif, Ehsan, et al.
Published: (2024)
Dual-attention ResNet outperforms transformers in HER2 prediction on DCE-MRI
by: Fridman, Naomi, et al.
Published: (2025)
by: Fridman, Naomi, et al.
Published: (2025)
Similar Items
-
ApproxED: Approximate exploitability descent via learned best responses
by: Martin, Carlos, et al.
Published: (2023) -
Domain-Independent Game Abstraction using Word Embedding Techniques
by: Kim, Juho, et al.
Published: (2026) -
Heuristic Pathologies and Further Variance Reduction via Uncertainty Propagation in the AIVAT Family of Techniques
by: Kim, Juho, et al.
Published: (2026) -
Parallelizing Counterfactual Regret Minimization
by: Kim, Juho, et al.
Published: (2026) -
General search techniques without common knowledge for imperfect-information games, and application to superhuman Fog of War chess
by: Zhang, Brian Hu, et al.
Published: (2025)