SVM and Pattern-Enriched Common Fate Graphs for the Game of Go

Lin Wu; Pierre Baldi

Outline

Title

SVM and Pattern-Enriched Common Fate Graphs for the Game of Go

Lin Wu

Pierre Baldi

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

We propose a pattern-based approach combined with the concept of Enriched Common Fate Graph for the problem of classifying Go positions. A kernel function for weighted graphs to compute the similarity between two board positions is proposed and used to learn a support vector machine and address the problem of position evaluation. Numerical simulations are carried out using a set of human played games and show the relevance of our approach.

Lin Wu, Pierre Baldi

Go is an ancient board game that poses unique opportunities and challenges for AI and machine learning. Here we develop a machine learning approach to Go, and related board games, focusing primarily on the problem of learning a good evaluation function in a scalable way. Scalability is essential at multiple levels, from the library of local tactical patterns, to the integration of patterns across the board, to the size of the board itself. The system we propose is capable of automatically learning the propensity of local patterns from a library of games. Propensity and other local tactical information are fed into a recursive neural network, derived from a Bayesian network architecture. The network integrates local information across the board and produces local outputs that represent local territory ownership probabilities. The aggregation of these probabilities provides an effective strategic evaluation function that is an estimate of the expected area at the end (or at other stages) of the game. Local area targets for training can be derived from datasets of human games. A system trained using only 9 × 9 amateur game data performs surprisingly well on a test set derived from 19 × 19 professional game data. Possible directions for further improvements are briefly discussed.

downloadDownload free PDF View PDFchevron_right

Learning to play Go using recursive neural networks

Lin Wu, Pierre Baldi

Go is an ancient board game that poses unique opportunities and challenges for artificial intelligence. Currently, there are no computer Go programs that can play at the level of a good human player. However, the emergence of large repositories of games is opening the door for new machine learning approaches to address this challenge. Here we develop a machine learning approach to Go, and related board games, focusing primarily on the problem of learning a good evaluation function in a scalable way. Scalability is essential at multiple levels, from the library of local tactical patterns, to the integration of patterns across the board, to the size of the board itself. The system we propose is capable of automatically learning the propensity of local patterns from a library of games. Propensity and other local tactical information are fed into recursive neural networks, derived from a probabilistic Bayesian network architecture. The recursive neural networks in turn integrate local information across the board in all four cardinal directions and produce local outputs that represent local territory ownership probabilities. The aggregation of these probabilities provides an effective strategic evaluation function that is an estimate of the expected area at the end, or at various other stages, of the game. Local area targets for training can be derived from datasets of games played by human players. In this approach, while requiring a learning time proportional to N 4 , skills learned on a board of size N 2 can easily be transferred to boards of other sizes. A system trained using only 9 × 9 amateur game data performs surprisingly well on a test set derived from 19 × 19 professional game data. Possible directions for further improvements are briefly discussed.

downloadDownload free PDF View PDFchevron_right

Evaluating Go game records for prediction of player attributes

Roman Neruda

2015 IEEE Conference on Computational Intelligence and Games (CIG), 2015

We propose a way of extracting and aggregating permove evaluations from sets of Go game records. The evaluations capture different aspects of the games such as played patterns or statistic of sente/gote sequences. Using machine learning algorithms, the evaluations can be utilized to predict different relevant target variables. We apply this methodology to predict the strength and playing style of the player (e.g. territoriality or aggressivity) with good accuracy. We propose a number of possible applications including aiding in Go study, seeding realwork ranks of internet players or tuning of Go-playing programs.

downloadDownload free PDF View PDFchevron_right

Distance features for general game playing

Stephan Schiffel

… of the IJCAI Workshop on General …, 2011

General Game Playing (GGP) is concerned with the development of programs that are able to play previously unknown games well. The main problem such a player is faced with is to come up with a good heuristic evaluation function automatically. Part of these heuristics are distance measures used to estimate, e.g., the distance of a pawn towards the promotion rank. However, current distance heuristics in GGP are based on too specific detection patterns as well as expensive internal simulations, they are limited to the scope of totally ordered domains and/or they apply a uniform Manhattan distance heuristics regardless of the move pattern of the object involved. In this paper we describe a method to automatically construct distance measures by analyzing the game rules. The presented method is an improvement to all previously presented distance estimation methods, because it is not limited to specific structures, such as, game boards. Furthermore, the constructed distance measures are admissible. We demonstrate how to use the distance measures in an evaluation function of a general game player and show the effectiveness of our approach by comparing with a state-of-the-art player.

downloadDownload free PDF View PDFchevron_right

A learning architecture for the game of Go

Alex Meijer

Proceedings of the 2nd Annual European …, 2001

In this paper, a three-component architecture of a learning environment for Go is sketched, which can be applied to any two-player, deterministic, full information, partizan, combinatorial game. The architecture called HUGO has natural and human-like reasoning components. Its most abstract component deals with the selection of subgames of Go. The second component is concerned with initiative. The notion of gote no sente (a move that loses initiative but creates new lines of play that will hold initiative) is formalized. In the third component, game values are computed with a new kind of ®-¯algorithm based on fuzzy, partial ordering. Our approach leaves some valuable control parameters and o¤ers ways to apply further machine learning techniques.

downloadDownload free PDF View PDFchevron_right

Canonical Sequence Directed Tactics Analyzer for Computer Go Games

Chung-Chih Li

International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06)

We present an approach for improving computer Go programs called CSDTA (Canonical Sequence Directed Tactics Analyzer), which analyzes the local tactics on a quadrant of the standard Go board based on a collection of canonical sequences (Joseki). We collect 1278 canonical sequences and their deviations in our system. Instead of trivially matching the current game and the collected canonical sequences, we define a notion of similar sequences with respect to the current game. This paper also explains how to extract the most suitable move from the candidate sequences for the next move. The simplicity of our method and its positive outcome make our approach suitable to be intergraded in a complete computer Go program for foreseeable improvement.

downloadDownload free PDF View PDFchevron_right

Learning patterns for playing strategies

Eduardo Morales

1994

A rst order system, PAL, that can learn Chess patterns in the form of Horn clauses from simple example descriptions and general purpose knowledge about Chess is described. This is the rst time that Chess patterns which can be used for over-the-board play have been learned. To test if the patterns learned by PAL can be used to play, a simple playing strategy for the King and Rook against King (KRK) endgame was constructed with patterns learned by PAL. Limitations of PAL in particular, and rstorder systems in general, are exposed in domains like Chess where a large number of background de nitions may be required for induction. Conclusions and future research directions are given.

downloadDownload free PDF View PDFchevron_right

Support vector comparison machines

Lakjaree Sphanurattana

In ranking problems, the goal is to learn a ranking function r(x) ∈ R from labeled pairs x, x ′ of input points. In this paper, we consider the related comparison problem, where the label y ∈ {−1, 0, 1} indicates which element of the pair is better (y = −1 or 1), or if there is no significant difference (y = 0). We cast the learning problem as a margin maximization, and show that it can be solved by converting it to a standard SVM. We use simulated nonlinear patterns and a real learning to rank sushi data set to show that our proposed SVMcompare algorithm outperforms SVMrank when there are equality y = 0 pairs. In addition, we show that SVMcompare outperforms the ELO rating system when predicting the outcome of chess matches.

downloadDownload free PDF View PDFchevron_right

A Neural Network Classifier of Chess Moves

Cezary Dendek

Eighth International Conference on Hybrid Intelligent …, 2008

downloadDownload free PDF View PDFchevron_right

Spatial State-Action Features for General Games

Eric Piette

2022

In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. Simple, linear policies of such features are unlikely to produce state-of-the-art playing strength like the deep neural networks that have been more commonly used in recent years do. However, they typically require significantly fewer resources to train, which is paramount for large-scale studies of hundreds to thousands of distinct games. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the s...

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (4)

M. Buro. Experiments with Multi-ProbCut and a New High-Quality Evaluation Function for Othello. In Games in AI Research, 2000.
T. Graepel, M. Goutrie, M. Krüger, and R. Herbrich. Learning on graphs in the game of go. In Proc. of Int. Conf. on Artificial Neural Networks (ICANN-01), Vienna, Austria, 2001.
C. Cortes and V. Vapnik. Support Vector Networks. Machine Learning, 20:1-25, 1995.
T. Joachims. Making Large-Scale Support Vector Machine Learning Practical. In B. Schölkopf, C. Burges, and A. Smola, editors, Adv. in Kernel Methods -Support Vector Learning, pages 169-184. MIT Press, Cambridge, MA, 1998.

Jos Uiterwijk

Theoretical Computer Science, 2005

This paper presents a learning system for scoring final positions in the Game of Go. Our system learns to predict life and death from labelled game records. 98.9% of the positions are scored correctly and nearly all incorrectly scored positions are recognized. By providing reliable score information our system opens the large source of Go knowledge implicitly available in human game records, thus paving the way for a successful application of machine learning in Go.

downloadDownload free PDF View PDFchevron_right

Computing Elo Ratings of Move Patterns in the Game of Go

Gustavo Andrés Landfried

Move patterns are an essential method to incorporate domain knowledge into Go-playing programs. This paper presents a new Bayesian technique for supervised learning of such patterns from game records, based on a generalization of Elo ratings. Each sample move in the training data is considered as a victory of a team of pattern features. Elo ratings of individual pattern features are computed from these victories, and can be used in previously unseen positions to compute a probability distribution over legal moves. In this approach, several pattern features may be combined, without an exponential cost in the number of features. Despite a very small number of training games (652), this algorithm outperforms most previous pattern-learning algorithms, both in terms of mean log-evidence (−2.69), and prediction rate (34.9%). A 19x19 Monte-Carlo program improved with these patterns reached the level of the strongest classical programs.

downloadDownload free PDF View PDFchevron_right

Machine learning for positional play in chess

Pranav Jindal, Rohit Mundra

With increases in computational power and better algorithms, chess engines are able to explore game trees of greater depths and thus have excelled at calculative play. However, since the growth of the move search space is exponential, even with increase in computational power chess engines perform only marginally better year-toyear. In this paper, we explore a novel technique for assessing board positions using machine learning techniques that can be used to supplement and improve current chess engines. We model chess board positions as networks of interacting pieces and use supervised machine learning techniques to analyze positions and predict outcomes in chess games.

downloadDownload free PDF View PDFchevron_right

Learning to predict life and death from Go game records

Jaap Van Den Herik

Information Sciences, 2005

This paper presents a learning system for predicting life and death in the game of Go. Learning examples are extracted from game records. On average our system correctly predicts life and death for 88% of all blocks. Towards the end of a game the performance increases up to 99%. Clearly, such a predictor will be an important component for building a full-board evaluation function.

downloadDownload free PDF View PDFchevron_right

Move Prediction in Go with the Maximum Entropy Method

Junichi Tsujii

2007 IEEE Symposium on Computational Intelligence and Games, 2007

We address the problem of predicting moves in the board game of Go. We use the relative frequencies of local board patterns observed in game records to generate a ranked list of moves, and then apply the maximum entropy method (MEM) to the list to re-rank the moves. Move prediction is the task of selecting a small number of promising moves from all legal moves, and move prediction output can be used to improve the efficiency of the game tree search. The MEM enables us to make use of multiple overlapping features, while avoiding problems with data sparseness. Our system was trained on 20000 expert games and had 33.9% prediction accuracy in 500 expert games.

downloadDownload free PDF View PDFchevron_right

Design and Development of Game PlayingSystem in Chess using Machine Learning

Vikrant Chole

2021

Game playing in chess is one of the important areas of machine learning research. Though creating a powerful chess engine that can play at a superhuman level is not the hardest problem anymore with the advent of powerful chess engines like Stockfish, Alpha Zero, Leela chess zero etc. Most of these engines still depend upon powerful and highly optimized look-ahead algorithms. CNN(convolutional neural networks) which is used primarily for images and matrix-like data is been proved successful with games like chess and go. In this project, we are treating chess like a regression problem. In this paper, we have proposed a supervised learning approach using the convolutional neural network with a limited look ahead. We have collected data of around 44029 chess games from the FICS chess database with players having an Elo rating of 2000 and above. Our goal is to create a zero-knowledge chess engine. The trained model is then paired with a minimax algorithm to create the AI. Our proposed supervised system can learn the chess rules by itself from the data. It was able to win 10% of the games and draw 30% of games when manually tested against Stockfish computer engine with Elo of 1300. We suggest that CNN can detect various tactical pattern to excel in games like chess even when using a limited lookahead search.

downloadDownload free PDF View PDFchevron_right

IRJET- Design and Development of Game PlayingSystem in Chess using Machine Learning VIKRANT CHOLE 1 , NIMISH GOTE 2 , SHRADDHA UJWANE 3 , PIYUSH HEDAOO 4 , APARNA UMARE 5

IRJET Journal

IRJET, 2021

downloadDownload free PDF View PDFchevron_right

Pattern Analysis and Analogy in Shogi: Predicting Shogi Moves from Prior Experience

Steven Walczak

Knowledge and Information Systems, 2000

As a research paradigm, pattern analysis has been shown to be an effective tool for analyzing complex game situations in both chess and go. We extend the prior pattern analysis research in chess to the domain of shogi. Shogi is computationally more complex than chess and should realize greater benefits than the chess domain from pattern recognition and pattern exploitation research. The IAM program, which has accurately predicted up to 28% of the moves for a specific chess player, is redesigned to operate in the domain of shogi. Results similar to those achieved for the domain of chess are achieved in shogi.

downloadDownload free PDF View PDFchevron_right

Co-evolutionary and Reinforcement Learning Techniques Applied to Computer Go players

WESTER EDISON ZELA MORAYA

To my brothers Paul, who is studying a Master of Artificial Inteligence, and Aurora, who is studying biology, with whom I discussed some topics to realize by myself if some points discussed in the thesis made sense. To all the proffesors of the programs from which I got introduced to the Artificial Intelligence.

downloadDownload free PDF View PDFchevron_right

Semantic Representation of Action Games

Katia Kermanidis

Player modeling is a crucial procedure for the user-oriented videogame design. For player modeling to be achieved, modeling domain knowledge is essential. Mastering the semantics of a domain is to learn the "language" of the domain. In the present study, two approaches to representing "words" are considered and analyzed comparatively. The first approach ("grid" representation) is based on the division of the game terrain as a grid, capturing in each cell the content-based information of the action game. The second approach ("holistic" representation) captures the contextual information in which the action takes place (life, score, shield, number of hit asteroids etc.). For the present comparative analysis, we use the action videogame SpaceDebris. We analyze the data using the classification algorithms J48, Naive Bayes, and SMO, as well as the Kmeans clustering and we compare the results in an attempt to identify the approach that represents the semantic space more reliably. The data acquired from the"grid" representation perform better, however the low value of the performance's difference does not allow us to come to rock solid conclusions.

downloadDownload free PDF View PDFchevron_right

SVM and Pattern-Enriched Common Fate Graphs for the Game of Go

Sign up for access to the world's latest research

Abstract

Related papers

References (4)

Related papers

Related topics