검색결과

검색조건

좁혀보기

검색필터 CLOSE

검색결과 2건

2021.03 KCI 등재 구독 인증기관 무료, 개인회원 유료

Deep Q-Network를 이용한 준능동 제어알고리즘 개발

Development of Semi-Active Control Algorithm Using Deep Q-Network

김현수, 강주원

한국공간구조학회지 제21권 제1호 pp.79-86 한국공간구조학회

Control performance of a smart tuned mass damper (TMD) mainly depends on control algorithms. A lot of control strategies have been proposed for semi-active control devices. Recently, machine learning begins to be applied to development of vibration control algorithm. In this study, a reinforcement learning among machine learning techniques was employed to develop a semi-active control algorithm for a smart TMD. The smart TMD was composed of magnetorheological damper in this study. For this purpose, an 11-story building structure with a smart TMD was selected to construct a reinforcement learning environment. A time history analysis of the example structure subject to earthquake excitation was conducted in the reinforcement learning procedure. Deep Q-network (DQN) among various reinforcement learning algorithms was used to make a learning agent. The command voltage sent to the MR damper is determined by the action produced by the DQN. Parametric studies on hyper-parameters of DQN were performed by numerical simulations. After appropriate training iteration of the DQN model with proper hyper-parameters, the DQN model for control of seismic responses of the example structure with smart TMD was developed. The developed DQN model can effectively control smart TMD to reduce seismic responses of the example structure.

4,000원

2019.09 KCI 등재 서비스 종료(열람 제한)

심층 큐 신경망을 이용한 게임 에이전트 구현

Deep Q-Network based Game Agents

한동기, 김명섭, 김재윤, 김정수

로봇학회논문지 제14권 제3호(통권 제53호) pp.157-162 한국로봇학회

The video game Tetris is one of most popular game and it is well known that its game rule can be modelled as MDP (Markov Decision Process). This paper presents a DQN (Deep Q-Network) based game agent for Tetris game. To this end, the state is defined as the captured image of the Tetris game board and the reward is designed as a function of cleared lines by the game agent. The action is defined as left, right, rotate, drop, and their finite number of combinations. In addition to this, PER (Prioritized Experience Replay) is employed in order to enhance learning performance. To train the network more than 500000 episodes are used. The game agent employs the trained network to make a decision. The performance of the developed algorithm is validated via not only simulation but also real Tetris robot agent which is made of a camera, two Arduinos, 4 servo motors, and artificial fingers by 3D printing.