삼목 게임에 적용된 몬테카를로 트리탐색

이병두

논문 상세보기

삼목 게임에 적용된 몬테카를로 트리탐색 KCI 등재

Monte-Carlo Tree Search Applied to the Game of Tic-Tac-Toe

이병두

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/277835

서비스가 종료되어 열람이 제한될 수 있습니다.

한국게임학회 논문지 (Journal of Korea Game Society)

제14권 제 3호 (2014.06)
pp.47-54

한국게임학회 (Korea Game Society)

초록

바둑 게임은 가장 오래된 게임 중의 하나이며 적어도 2,500년 전에 기원되었다. 게임프로그래밍에서 대부분의 성공적인 접근법은 평가함수를 활용한 게임트리 탐색을 사용하는 것이다. 그러나 컴퓨터바둑에서 그럴싸한 평가함수를 구축한다는 것은 매우 어렵다. 몬테카를로 트리탐색(MCTS)은 9줄 바둑에서 프로기사를 제압한 MoGo와 CrazyStone과 같은 강력한 컴퓨터바둑 프로그램을 만들어 내었다. 몬테카를로 트리탐색은 몬테카를로 시뮬레이션에 의해 계산된 승률을 근간으로 한다. 몬테카를로 트리탐색을 컴퓨터바둑에 구현하기에 앞서 삼목에서 최상의 첫 수로 중앙, 귀, 변의 세 수에 대한 각각의 승률을 측정하려고 했다. 실험 결과로 최상의 첫 수는 중앙이 우선하고, 다음은 귀, 마지막으로는 변이라는 사실이 밝혀졌다.

The game of Go is one of the oldest games and originated at least more than 2,500 years ago. In game programming the most successful approach is to use game tree searches using evaluation functions. However it is really difficult to construct feasible evaluation function in computer Go. Monte-Carlo Tree Search(MCTS) has created strong computer Go programs such as MoGo and CrazyStone which defeated human Go professionals played on the 9⨯9 board. MCTS is based on the winning rate estimated by Monte-Carlo simulation. Prior to implementing MCTS into computer Go, we tried to measure each winning rate of three positions, center, corner and side, in Tic-Tac-Toe playing as the best first move. The experimental result revealed that the center is the best, a corner the next and a side the last as the best first move.

키워드

컴퓨터바둑 몬테카를로 트리탐색 삼목 몬테카를로 트리탐색 computer Go Monte-Carlo Tree Search Tic-Tac-Toe MCTS

저자

이병두(세한대학교 체육학부 바둑학과) | Lee, Byung-Doo

같은 권호 다른 논문