미니맥스 알고리즘을 이용한 학습속도 개선을 위한 Q러닝

신용우

논문 상세보기

미니맥스 알고리즘을 이용한 학습속도 개선을 위한 Q러닝 KCI 등재

Q-learning to improve learning speed using Minimax algorithm

신용우

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/358074

서비스가 종료되어 열람이 제한될 수 있습니다.

한국게임학회 논문지 (Journal of Korea Game Society)

제18권 제4호 (2018.08)
pp.99-106

한국게임학회 (Korea Game Society)

초록

보드게임에서는 많은 경우의 수의 말들과 많은 상태공간들을 가지고 있다. 그러므로 게임은 학습을 오래 하여야 한다. 본 논문에서는 Q러닝 알고리즘을 이용했다. 그러나 강화학습은 학습초기에 학습속도가 느려지는 단점이 있다. 그러므로 학습을 하는 동안에 같은 최선의 값이 있을 때, 게임트리를 고려한 문제영역의 지식을 활용한 휴리스틱을 사용하여 학습의 속도향상을 시도하였다. 기존 구현된 말과 개선하여 구현된 말을 비교하기 위하여 보드게임을 제작했다. 그래서 일방적으로 공격하는 말과 승부를 겨루게 하였다. 개선된 말은 게임트리를 고려하여 상대방 말을 공격하였다. 실험결과 개선하여 구현된 말이 학습속도적인 면에서 향상됨 것을 알수 있었다

Board games have many game characters and many state spaces. Therefore, games must be long learning. This paper used reinforcement learning algorithm. But, there is weakness with reinforcement learning. At the beginning of learning, reinforcement learning has the drawback of slow learning speed. Therefore, we tried to improve the learning speed by using the heuristic using the knowledge of the problem domain considering the game tree when there is the same best value during learning. In order to compare the existing character the improved one. I produced a board game. So I compete with one-sided attacking character. Improved character attacked the opponent's one considering the game tree. As a result of experiment, improved character’s capability was improved on learning speed.

키워드

Gonu game 고누게임 Minimax algorithm 미니맥스 알고리즘 Q Learning Q러닝 Learning speed 학습속도

요 약
ABSTRACT
1. 서 론
2. 관련연구
  2.1 Q러닝
  2.2 고누게임(Gonu game)
  2.3 미니맥스 알고리즘
  2.4 알파베타 가지치기
3. 지능형 보드게임의 구현
  3.1 제안하는 알고리즘
  3.2 이동생성
  3.3 평가함수
4. 실험 및 결과
  4.1 게임소개 및 실험환경
  4.2 실험결과
5. 결 론
REFERENCES

저자

신용우(동아방송예술대학교 창의융합교양학부) | YongWoo Shin (Division of Creative Convergence Education, Dong-Ah Institute of Media and Arts) Corresponding Author

같은 권호 다른 논문