Ing. Michal Šustr

Všechny publikace

Autoři: Ing. Michal Šustr, Schmid, M., Moravčík, M., Burch, N., Lanctot, M., Bowling, M.
Publikace: AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, Virtual Event, United Kingdom, May 3-7, 2021. New York: ACM, 2021. p. 1662-1664. ISSN 1548-8403. ISBN 978-1-7138-3262-1.
Rok: 2021

Pracoviště: Katedra počítačů, Centrum umělé inteligence
Anotace:
Search has played a fundamental role in computer game research since the very beginning. And while online search has been commonly used in perfect information games such as Chess and Go, online search methods for imperfect information games have only been introduced relatively recently. This paper addresses the question of what is a sound online algorithm in an imperfect information setting of two-player zero-sum games? We argue that the fixedstrategy definitions of exploitability and epsilon-Nash equilibria are ill suited to measure the worst-case performance of an online algorithm. We thus formalize epsilon-soundness, a concept that connects the worst-case performance of an online algorithm to the performance of an epsilon-Nash equilibrium. Our definition of soundness and the consistency hierarchy finally provide appropriate tools to analyze online algorithms in repeated imperfect information games. We thus inspect some of the previous online algorithms in a new light, bringing new insights into their worst case performance guarantees.

Autoři: Ing. Michal Šustr, Kovařík, V., doc. Mgr. Viliam Lisý, MSc., Ph.D.,
Publikace: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. New York: ACM, 2019. p. 224-232. ISSN 2523-5699. ISBN 978-1-4503-6309-9.
Rok: 2019

Pracoviště: Katedra počítačů, Centrum umělé inteligence
Anotace:
Online game playing algorithms produce high-quality strategieswith a fraction of memory and computation required by their of-fline alternatives. Continual Resolving (CR) is a recent theoreti-cally sound approach to online game playing that has been usedto outperform human professionals in poker. However, parts ofthe algorithm were specific to poker, which enjoys many proper-ties not shared by other imperfect information games. We presenta domain-independent formulation of CR applicable to any two-player zero-sum extensive-form games (EFGs). It works with anabstract resolving algorithm, which can be instantiated by variousEFG solvers. We further describe and implement its Monte Carlovariant (MCCR) which uses Monte Carlo Counterfactual RegretMinimization (MCCFR) as a resolver. We prove the correctness ofCR and show anO(T−1/2)-dependence of MCCR’s exploitabilityon the computation time. Furthermore, we present an empiricalcomparison of MCCR with incremental tree building to OnlineOutcome Sampling and Information-set MCTS on several domains.