:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Martin, Carlos, Sandholm, Tuomas
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.08687
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ApproxED: Approximate exploitability descent via learned best responses
by: Martin, Carlos, et al.
Published: (2023)

Domain-Independent Game Abstraction using Word Embedding Techniques
by: Kim, Juho, et al.
Published: (2026)

Heuristic Pathologies and Further Variance Reduction via Uncertainty Propagation in the AIVAT Family of Techniques
by: Kim, Juho, et al.
Published: (2026)

Parallelizing Counterfactual Regret Minimization
by: Kim, Juho, et al.
Published: (2026)

General search techniques without common knowledge for imperfect-information games, and application to superhuman Fog of War chess
by: Zhang, Brian Hu, et al.
Published: (2025)

Watermarking Game-Playing Agents in Perfect-Information Extensive-Form Games
by: Kim, Juho, et al.
Published: (2026)

Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation
by: Zhang, Brian, et al.
Published: (2022)

Learning Potentials for Dynamic Matching and Application to Heart Transplantation
by: Zilberstein, Itai, et al.
Published: (2026)

Near-Optimal Dynamic Matching via Coarsening with Application to Heart Transplantation
by: Zilberstein, Itai, et al.
Published: (2026)

AlphaZero-Edu: Democratizing Access to AlphaZero
by: Li, Ruitong, et al.
Published: (2025)

Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search
by: Taguelmimt, Redha, et al.
Published: (2024)

Joint-perturbation simultaneous pseudo-gradient
by: Martin, Carlos, et al.
Published: (2024)

Simultaneous incremental support adjustment and metagame solving: An equilibrium-finding framework for continuous-action games
by: Martin, Carlos, et al.
Published: (2024)

Solving Infinite-Player Games with Player-to-Strategy Networks
by: Martin, Carlos, et al.
Published: (2025)

Game-Theoretic Multiagent Reinforcement Learning
by: Yang, Yaodong, et al.
Published: (2020)

A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation
by: Taguelmimt, Redha, et al.
Published: (2025)

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
by: Liang, Yongyuan, et al.
Published: (2023)

Alpha Zero for Physics: Application of Symbolic Regression with Alpha Zero to find the analytical methods in physics
by: Michishita, Yoshihiro
Published: (2023)

A multi-scale loss formulation for learning a probabilistic model with proper score optimisation
by: Lang, Simon, et al.
Published: (2025)

MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari Games
by: Wu, Ti-Rong, et al.
Published: (2023)

Diversifying AI: Towards Creative Chess with AlphaZero
by: Zahavy, Tom, et al.
Published: (2023)

Convergence of $\text{log}(1/ε)$ for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis
by: Anagnostides, Ioannis, et al.
Published: (2024)

AlphaMath Almost Zero: Process Supervision without Process
by: Chen, Guoxin, et al.
Published: (2024)

Regret-Guided Search Control for Efficient Learning in AlphaZero
by: Tsai, Yun-Jui, et al.
Published: (2026)

Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games
by: Zhang, Brian Hu, et al.
Published: (2024)

Zero loss guarantees and explicit minimizers for generic overparametrized Deep Learning networks
by: Chen, Thomas, et al.
Published: (2025)

Improving Robustness of AlphaZero Algorithms to Test-Time Environment Changes
by: Tamassia, Isidoro, et al.
Published: (2025)

Scalable Mechanism Design for Multi-Agent Path Finding
by: Friedrich, Paul, et al.
Published: (2024)

Multi-agent AI systems outperform human teams in creativity
by: Hu, Tiancheng, et al.
Published: (2026)

Representation Matters for Mastering Chess: Improved Feature Representation in AlphaZero Outperforms Switching to Transformers
by: Czech, Johannes, et al.
Published: (2023)

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
by: Mehrabian, Abbas, et al.
Published: (2023)

MAPLE: Multi-State Aggregated Policy Evaluation for AlphaZero in Imperfect-Information Games
by: Li, Qian-Rong, et al.
Published: (2026)

AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)

Code-enabled language models can outperform reasoning models on diverse tasks
by: Zhang, Cedegao E., et al.
Published: (2025)

Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
by: Hou, Charlie, et al.
Published: (2023)

Randomness Is All You Need: Semantic Traversal of Problem-Solution Spaces with Large Language Models
by: Sandholm, Thomas, et al.
Published: (2024)

Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
by: Zollicoffer, Geigh, et al.
Published: (2024)

Steering LLMs? Actually, Sparse Autoencoders can outperform simple baselines
by: Jørgensen, Mikkel Godsk, et al.
Published: (2026)

Can OpenAI o1 outperform humans in higher-order cognitive thinking?
by: Latif, Ehsan, et al.
Published: (2024)

Dual-attention ResNet outperforms transformers in HER2 prediction on DCE-MRI
by: Fridman, Naomi, et al.
Published: (2025)