Psychometric modeling of decision making via game play
2013, 2013 IEEE Conference on Computational Inteligence in Games (CIG)
https://doi.org/10.1109/CIG.2013.6633653Abstract
We build a model for the kind of decision making involved in games of strategy such as chess, making it abstract enough to remove essentially all game-specific contingency, and compare it to known psychometric models of test taking, item response, and performance assessment. Decisions are modeled in terms of fallible agents Z faced with possible actions ai whose utilities ui = u(ai) are not fully apparent. The three main goals of the model are prediction, meaning to infer probabilities pi for Z to choose ai; intrinsic rating, meaning to assess the skill of a person's actual choices ai t over various test items t; and simulation of the distribution of choices by an agent with a specified skill set. We describe and train the model on large data from chess tournament games of different ranks of players, and exemplify its accuracy by applying it to give intrinsic ratings for world championship matches.
References (34)
- K. Regan and G. Haworth, "Intrinsic chess ratings," in Proceedings of AAAI 2011, San Francisco, August 2011.
- K. Regan, B. Macieja, and G. Haworth, "Understanding distributions of chess performances," in Proceedings of the 13th ICGA Conference on Advances in Computer Games, Tilburg, Netherlands, November 2011 2011.
- F. B. Baker, The Basics of Item Response Theory. ERIC Clearinghouse on Assessment and Evaluation, 2001.
- G. L. Thorpe and A. Favia, "Data analysis using item response theory methodology: An introduction to selected programs and applications," Psychology Faculty Scholarship, p. 20, 2012.
- G. A. Morris, L. Branum-Martin, N. Harshman, S. D. Baker, E. Mazur, S. Dutta, T. Mzoughi, , and V. McCauley, "Testing the test: Item response curves and test quality," American Journal of Physics, vol. 81, no. 144, 2013.
- G. Rasch, Probabilistic models for for some intelligence and attainment tests. Copenhagen: Danish Institute for Educational Research, 1960.
- --, "On general laws and the meaning of measurement in psychol- ogy," in Proceedings, Fourth Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, 1961, pp. 321-334.
- E. Andersen, "Conditional inference for multiple-choice question- naires," Brit. J. Math. Stat. Psych., vol. 26, pp. 31-44, 1973.
- D. Andrich, Rasch Models for Measurement. Beverly Hills, California: Sage Publications, 1988.
- --, "A rating scale formulation for ordered response categories," Psychometrika, vol. 43, pp. 561-573, 1978.
- G. Masters, "A Rasch model for partial credit scoring," Psychometrika, vol. 47, pp. 149-174, 1982.
- J. M. Linacre, "Rasch analysis of rank-ordered data," Journal of Applied Measurement, vol. 7, no. 1, 2006.
- R. Ostini and M. Nering, Polytomous Item Response Theory Models. Thousand Oaks, California: Sage Publications, 2006.
- F. Wichmann and N. J. Hill, "The psychometric function: I. Fitting, sampling, and goodness of fit," Perception and Psychophysics, vol. 63, pp. 1293-1313, 2001.
- H. L. J. V. D. Maas and E.-J. Wagenmakers, "A psychometric analysis of chess expertise," American Journal of Psychology, vol. 118, pp. 29- 60, 2005.
- A. Elo, The Rating of Chessplayers, Past and Present. New York: Arco Pub., 1978.
- M. E. Glickman, "Parameter estimation in large dynamic paired com- parison experiments," Applied Statistics, vol. 48, pp. 377-394, 1999.
- P. Dangauthier, R. Herbrich, T. Minka, and T. Graepel, "TrueSkill through time: Revisiting the history of chess," Microsoft Report 74417, research.microsoft.com/pubs/74417/NIPS2007 0931.pdf, 2007, poster, 2007 Neural Information Processing (NIPS) workshop.
- T. I. Fenner, M. Levene, and G. Loizou, "A discrete evolutionary model for chess players' ratings," IEEE Trans. Comput. Intellig. and AI in Games, vol. 4, no. 2, pp. 84-93, 2012.
- A. Reibman and B. Ballard, "Non-minimax strategies for use against fallible opponents," in Proceedings, Third National Conference on Artificial Intelligence (AAAI-83), 1983.
- R. Korf, "Real-time single-agent search: first results," in Proceedings, 6th International Joint Conf. on Artificial Intelligence, 1987.
- --, "Real-time single-agent search: new results," in Proceedings, 7th International Joint Conf. on Artificial Intelligence, 1988.
- --, "Generalized game-trees," in Proceedings, 8th International Joint Conf. on Artificial Intelligence, 1989.
- P. Jansen, "KQKR: Awareness of a fallible opponent," ICCA Journal, vol. 15, pp. 111-131, 1992.
- G. Haworth, "Reference fallible endgame play," ICGA Journal, vol. 26, pp. 81-91, 2003.
- --, "Gentlemen, Stop Your Engines!" ICGA Journal, vol. 30, pp. 150-156, 2007.
- G. DiFatta, G. Haworth, and K. Regan, "Skill rating by Bayesian inference," in Proceedings, 2009 IEEE Symposium on Computational Intelligence and Data Mining (CIDM'09), Nashville, TN, March 30- April 2 2009, pp. 89-94.
- G. Haworth, K. Regan, and G. DiFatta, "Performance and prediction: Bayesian modelling of fallible choice in chess," in Proceedings, 12th ICGA Conference on Advances in Computer Games, Pamplona, Spain, May 11-13, 2009, ser. Lecture Notes in Computer Science, vol. 6048. Springer-Verlag, 2010, pp. 99-110.
- V. Rajlich and L. Kaufman, "Rybka 3 chess engine," 2008, http://www.rybkachess.com.
- M. Guid and I. Bratko, "Computer analysis of world chess champions," ICGA Journal, vol. 29, no. 2, pp. 65-73, 2006.
- --, "Using heuristic-search based engines for estimating human skill at chess," ICGA Journal, vol. 34, no. 2, pp. 71-81, 2011.
- M. Guid, A. Pérez, and I. Bratko, "How trustworthy is Crafty's analysis of world chess champions?" ICGA Journal, vol. 31, no. 3, pp. 131-144, 2008.
- J. H. Moxley, K. A. Ericsson, N. Charness, and R. T. Krampe, "The role of intuition and deliberative thinking in experts' superior tactical decision-making," Cognition, vol. 124, no. 1, pp. 72 -78, 2012. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0010027712000558
- C. Chabris and E. Hearst, "Visualization, pattern recognition, and forward search: Effects of playing speed and sight of the position on grandmaster chess errors," Cognitive Science, vol. 27, pp. 637-648, 2003.