삼목 게임을 위해 개선된 몬테카를로 트리탐색 알고리즘

이병두

논문 상세보기

삼목 게임을 위해 개선된 몬테카를로 트리탐색 알고리즘 KCI 등재

Enhanced strategic Monte-Carlo Tree Search algorithm to play the game of Tic-Tac-Toe

이병두

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/328856

서비스가 종료되어 열람이 제한될 수 있습니다.

한국게임학회 논문지 (Journal of Korea Game Society)

제16권 제4호 (2016.08)
pp.79-86

한국게임학회 (Korea Game Society)

초록

몬테카를로 트리탐색은 최대우선탐색 알고리즘이며, 많은 게임 특히 바둑 게임에 성공적으로 적용되어 왔다. 삼목 게임에서 MCTS 간의 대국을 통해 성능을 평가하고자 했다. 첫 번째 대국 자는 항상 두 번째 대국자에 비해 압도적인 우위를 보였으며, 최선의 게임 결과가 무승부가 됨 에도 불구하고 첫 번째 대국자가 두 번째 대국자에 비해 우월한 이유를 찾고자 했다. MCTS는 반복적인 무작위 샘플링을 기반으로 하는 통계적 알고리즘이기 때문에, 특히 두 번째 대국자를 위해 전략을 요하는 시급한 문제를 적절히 대처하지 못한다. 이를 위해 전략적 MCTS(S-MCTS)를 제안하며, S-MCTS는 결코 삼목 게임에서 지지 않는다는 것을 보였다.

Monte-Carlo Tree Search(MCTS) is a best-first tree search algorithm and has been successfully applied to various games, especially to the game of Go. We evaluate the performance of MCTS playing against each other in the game of Tic-Tac-Toe. It reveals that the first player always has an overwhelming advantage to the second player; and we try to find out the reason why the first player is superior to the second player in spite of the fact that the best game result should be a draw. Since MCTS is a statistical algorithm based on the repeated random sampling, it cannot adequately tackle an urgent problem that needs a strategy, especially for the second player. For this, we propose a strategic MCTS(S-MCTS) and show that the S-MCTS player never loses a Tic-Tac-Toe game.

키워드

Go(바둑)Tic-Tac-Toe(삼목)MCTS(몬테카를로 트리탐색)S-MCTS(전략적 몬테카를로 트리탐색)Monte-Carlo Tree Search(몬테카를로 트리탐색)

저자

이병두(세한대학교 체육학부 바둑학과) | Byung-Doo Lee Corresponding Author

같은 권호 다른 논문