삼목 게임에서 최상의 첫 수를 구하기 위해 적용된 신뢰상한트리 알고리즘

이병두; 박동수; 최영욱

논문 상세보기

삼목 게임에서 최상의 첫 수를 구하기 위해 적용된 신뢰상한트리 알고리즘 KCI 등재

The UCT algorithm applied to find the best first move in the game of Tic-Tac-Toe

이병두, 박동수, 최영욱

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/310904

서비스가 종료되어 열람이 제한될 수 있습니다.

한국게임학회 논문지 (Journal of Korea Game Society)

제15권 제5호 (2015.10)
pp.109-118

한국게임학회 (Korea Game Society)

초록

고대 중국에서 기원된 바둑은 인공지능 분야에서 가장 어려운 도전 중의 하나로 간주된다. 지난 수년에 걸쳐 MCTS를 기반으로 하는 정상급 컴퓨터바둑 프로그램이 놀랍게도 접바둑에서 프로기사를 물리쳤다. MCTS는 게임이 끝날 때까지 일련의 무작위 유효착수를 시뮬레이션 하 는 접근법이며, 기존의 지식기반 접근법을 대체했다. 저자는 MCTS의 변형인 UCT 알고리즘을 삼목 게임에 적용하여 최선의 첫 수를 찾고자 했으며, 순수 MCTS의 결과와 비교를 했다. 아울 러 UCB 이해를 위한 다중슬롯머신 문제를 풀기 위해 엡실론-탐욕 알고리즘과 UCB 알고리즘 을 소개 및 성능을 비교하였다.

The game of Go originated from ancient China is regarded as one of the most difficult challenges in the filed of AI. Over the past few years, the top computer Go programs based on MCTS have surprisingly beaten professional players with handicap. MCTS is an approach that simulates a random sequence of legal moves until the game is ended, and replaced the traditional knowledge-based approach. We applied the UCT algorithm which is a MCTS variant to the game of Tic-Tac-Toe for finding the best first move, and compared it with the result generated by a pure MCTS. Furthermore, we introduced and compared the performances of epsilon-Greedy algorithm and UCB algorithm for solving the Multi-Armed Bandit problem to understand the UCB.

키워드

Go 바둑 Tic-Tac-Toe 삼목 MCTS 몬테카를로 트리탐색 UCT 신뢰상한트리 Multi-Armed Bandit 다중슬롯머신 epsilon-Greedy 엡실론-탐욕 UCB 신뢰상한

저자

이병두(세한대학교 체육학부) | Byung-Doo Lee Corresponding Author
박동수(세한대학교 체육학부) | Dong-Soo Park
최영욱(세한대학교 체육학부) | Young-Wook Choi

같은 권호 다른 논문