Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hamara, Andrew, Hamerly, Greg, Rivas, Pablo, Freeman, Andrew C.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.04892
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913877841674240
author	Hamara, Andrew Hamerly, Greg Rivas, Pablo Freeman, Andrew C.
author_facet	Hamara, Andrew Hamerly, Greg Rivas, Pablo Freeman, Andrew C.
contents	Modern chess engines achieve superhuman performance through deep tree search and regressive evaluation, while human players rely on intuition to select candidate moves followed by a shallow search to validate them. To model this intuition-driven planning process, we train a transformer encoder using supervised contrastive learning to embed board states into a latent space structured by positional evaluation. In this space, distance reflects evaluative similarity, and visualized trajectories display interpretable transitions between game states. We demonstrate that move selection can occur entirely within this embedding space by advancing toward favorable regions, without relying on deep search. Despite using only a 6-ply beam search, our model achieves an estimated Elo rating of 2593. Performance improves with both model size and embedding dimensionality, suggesting that latent planning may offer a viable alternative to traditional search. Although we focus on chess, the proposed embedding-based planning method can be generalized to other perfect-information games where state evaluations are learnable. All source code is available at https://github.com/andrewhamara/SOLIS.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_04892
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study Hamara, Andrew Hamerly, Greg Rivas, Pablo Freeman, Andrew C. Computer Vision and Pattern Recognition Modern chess engines achieve superhuman performance through deep tree search and regressive evaluation, while human players rely on intuition to select candidate moves followed by a shallow search to validate them. To model this intuition-driven planning process, we train a transformer encoder using supervised contrastive learning to embed board states into a latent space structured by positional evaluation. In this space, distance reflects evaluative similarity, and visualized trajectories display interpretable transitions between game states. We demonstrate that move selection can occur entirely within this embedding space by advancing toward favorable regions, without relying on deep search. Despite using only a 6-ply beam search, our model achieves an estimated Elo rating of 2593. Performance improves with both model size and embedding dimensionality, suggesting that latent planning may offer a viable alternative to traditional search. Although we focus on chess, the proposed embedding-based planning method can be generalized to other perfect-information games where state evaluations are learnable. All source code is available at https://github.com/andrewhamara/SOLIS.
title	Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2506.04892

Similar Items