검색결과

검색조건

좁혀보기

검색필터 CLOSE

검색결과 12건

2025.06 KCI 등재 구독 인증기관 무료, 개인회원 유료

다중에이전트 강화학습 기반 복근보 자동설계 모델 개발

Development of Multi-Agent RL-Based Automated Design Model for Doubly Reinforced Beams

김현수, 신수진, 김수찬, 김태웅, 안시현

한국공간구조학회지 제25권 제2호 통권100호 pp.45-52 한국공간구조학회

Reinforcement learning (RL) is successfully applied to various engineering fields. RL is generally used for structural control cases to develop the control algorithms. On the other hand, a machine learning (ML) is adopted in various research to make automated structural design model for reinforced concrete (RC) beam members. In this case, ML models are developed to produce results that are as similar to those of training data as possible. The ML model developed in this way is difficult to produce better results than the training data. However, in reinforcement learning, an agent learns to make decisions by interacting with an environment. Therefore, the RL agent can find better design solution than the training data. In the structural design process (environment), the action of RL agent represent design variables of RC beam. Because the number of design variables of RC beam section is many, multi-agent DQN (Deep Q-Network) was used in this study to effectively find the optimal design solution. Among various versions of DQN, Double Q-Learning (DDQN) that not only improves accuracy in estimating the action-values but also improves the policy learned was used in this study. American Concrete Institute (318) was selected as the design codes for optimal structural design of RC beam and it was used to train the RL model without any hand-labeled dataset. Six agents of DDQN provides actions for beam with, beam depth, bottom rebar size, number of bottom rebar, top rebar size, and shear stirrup size, respectively. Six agents of DDQN were trained for 5,000 episodes and the performance of the multi-agent of DDQN was evaluated with 100 test design cases that is not used for training. Based on this study, it can be seen that the multi-agent RL algorithm can provide successfully structural design results of doubly reinforced beam.

4,000원

2024.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

Agent-based Dispatching System for a Multi-area Manufacturing System

다중구역 제조시스템을 위한 에이전트 기반 디스패칭 시스템

Minjung Kim, Moonsoo Shin

한국산업경영시스템학회지 Vol.47 No.4 pp.12-21 한국산업경영시스템학회

Recently, in the manufacturing industry, changes in various environmental conditions and constraints appear rapidly. At this time, a dispatching system that allocates work to resources at an appropriate time plays an important role in improving the speed or quality of production. In general, a rule-based static dispatching method has been widely used. However, this static approach to a dynamic production environment with uncertainty leads to several challenges, including decreased productivity, delayed delivery, and lower operating rates, etc. Therefore, a dynamic dispatching method is needed to address these challenges. This study aims to develop a reinforcement learning-based dynamic dispatching system, in which dispatching agents learn optimal dispatching rules for given environmental states. The state space represents various information such as WIP(work-in-process) and inventory levels, order status, machine status, and process status. A dispatching agent selects an optimal dispatching rule that considers multiple objectives of minimizing total tardiness and minimizing the number of setups at the same time. In particular, this study targets a multi-area manufacturing system consisting of a flow-shop area and a cellular-shop area. Thus, in addition to the dispatching agent that manages inputs to the flow-shop, a dispatching agent that manages transfers from the flow-shop to the cellular-shop is also developed. These two agents interact closely with each other. In this study, an agent-based dispatching system is developed and the performance is verified by comparing the system proposed in this study with the existing static dispatching method.

4,000원

2024.06 KCI 등재 구독 인증기관 무료, 개인회원 유료

Improving Dynamic Missile Defense Effectiveness Using Multi-Agent Deep Q-Network Model

멀티에이전트 기반 Deep Q-Network 모델을 이용한 동적 미사일 방어효과 개선

Min Gook Kim, Dong Wook Hong, Bong Wan Choi, Ji Hoon Kyung

한국산업경영시스템학회지 Vol.47 No.2 pp.74-83 한국산업경영시스템학회

The threat of North Korea's long-range firepower is recognized as a typical asymmetric threat, and South Korea is prioritizing the development of a Korean-style missile defense system to defend against it. To address this, previous research modeled North Korean long-range artillery attacks as a Markov Decision Process (MDP) and used Approximate Dynamic Programming as an algorithm for missile defense, but due to its limitations, there is an intention to apply deep reinforcement learning techniques that incorporate deep learning. In this paper, we aim to develop a missile defense system algorithm by applying a modified DQN with multi-agent-based deep reinforcement learning techniques. Through this, we have researched to ensure an efficient missile defense system can be implemented considering the style of attacks in recent wars, such as how effectively it can respond to enemy missile attacks, and have proven that the results learned through deep reinforcement learning show superior outcomes.

4,000원

2023.12 KCI 등재 구독 인증기관·개인회원 무료

A Study on the Influence of Platform Design in Level Design by utilizing Multi-agent Reinforcement Learning

강화학습을 이용한 대전 슈팅 게임의 플랫폼 형태에 따른 레벨 디자인 영향 분석

Jun Ho KIM, Hanul Sung

한국컴퓨터게임학회 논문지 제36권 제4호 pp.236-29 한국컴퓨터게임학회

다중 에이전트 강화학습의 발전과 함께 게임 분야에서 강화학습을 레벨 디자인에 적용하려는 연구가 계속되 고 있다. 플랫폼의 형태가 레벨 디자인의 중요한 요소임에도 불구하고 지금까지의 연구들은 플레이어의 스킬 수준이나, 스킬 구성 등 플레이어의 매트릭에 초첨을 맞춰 강화학습을 활용하였다. 따라서 본 논문에서는 레 벨 디자인에 플랫폼의 형태가 사용될 수 있도록 시각 센서의 가시성과 구조물의 복잡성을 고려하여 플랫폼 이 플레이 경험에 미치는 영향을 연구한다. 이를 위해Unity ML-Agents Toolkit과MA-POCA 알고리즘, Self-play 방식을 기반으로2vs2 대전 슈팅 게임 환경을 개발하였으며 다양한 플랫폼의 형태를 구성하였다. 분석을 통해 플랫폼의 형태에 따른 가시성과 복잡성의 차이가 승률 밸런스에는 크게 영향을 미치지 않으나 전체 에피소 드 수, 무승부 비율, Elo의 증가폭에 유의미한 영향을 미치는 것을 확인했다.

2023.06 KCI 등재 구독 인증기관 무료, 개인회원 유료

다중 에이전트 강화학습을 이용한 RC보 최적설계 기술개발

Development of Optimal Design Technique of RC Beam using Multi-Agent Reinforcement Learning

강주원, 김현수

한국공간구조학회지 제23권 제2호 pp.29-36 한국공간구조학회

Reinforcement learning (RL) is widely applied to various engineering fields. Especially, RL has shown successful performance for control problems, such as vehicles, robotics, and active structural control system. However, little research on application of RL to optimal structural design has conducted to date. In this study, the possibility of application of RL to structural design of reinforced concrete (RC) beam was investigated. The example of RC beam structural design problem introduced in previous study was used for comparative study. Deep q-network (DQN) is a famous RL algorithm presenting good performance in the discrete action space and thus it was used in this study. The action of DQN agent is required to represent design variables of RC beam. However, the number of design variables of RC beam is too many to represent by the action of conventional DQN. To solve this problem, multi-agent DQN was used in this study. For more effective reinforcement learning process, DDQN (Double Q-Learning) that is an advanced version of a conventional DQN was employed. The multi-agent of DDQN was trained for optimal structural design of RC beam to satisfy American Concrete Institute (318) without any hand-labeled dataset. Five agents of DDQN provides actions for beam with, beam depth, main rebar size, number of main rebar, and shear stirrup size, respectively. Five agents of DDQN were trained for 10,000 episodes and the performance of the multi-agent of DDQN was evaluated with 100 test design cases. This study shows that the multi-agent DDQN algorithm can provide successfully structural design results of RC beam.

4,000원

2012.08 KCI 등재 구독 인증기관 무료, 개인회원 유료

유고상황 시 MatSIM을 활용한 도시부 도로네트워크 운영 분석

Application of Multi-Agent Transport Simulation for Urban Road Network Operation in Incident Case

김주영, 유연승, 이승재, 허혜정, 성정곤

한국도로학회논문집 제14권 제4호 pp.163-173 한국도로학회

PURPOSES : The purpose of this study is to check the possibilities of traffic pattern analysis using MatSIM for urban road network operation in incident case. METHODS : One of the stochastic dynamic models is MatSIM. MatSIM is a transportation simulation tool based on stochastic dynamic model and activity based model. It is an open source software developed by IVT, ETH zurich, Switzerland. In MatSIM, various scenario comparison analyses are possible and analyses results are expressed using the visualizer which shows individual vehicle movements and traffic patterns. In this study, trip distribution in 24-hour, traffic volume, and travel speed using MatSIM are similar to those of measured values. Therefore, results of MatSIM are reasonable comparing with measured values. Traffic patterns are changed according to incident from change of individual behavior. RESULTS : The simulation results and the actual measured values are similar. The simulation results show reasonable ranges which can be used for traffic pattern analysis. CONCLUSIONS : The change of traffic pattern including trip distribution, traffic volumes and speeds according to various incident scenarios can be used for traffic control policy decision to provide effective operation of urban road network.

4,200원

2006.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

컴퓨터 게임 환경에서의 효율적인 멀티 에이전트 공간탐사

Efficient Multi-Agent Exploration in Computer Game Environments

최은미, 김인철

한국컴퓨터게임학회 논문지 제9호 pp.26-36 한국컴퓨터게임학회

4,200원

2017.09 KCI 등재 서비스 종료(열람 제한)

다중 에이전트 모바일 로봇 대형제어를 위한 유한시간 슬라이딩 모드 제어기 설계

Finite-Time Sliding Mode Controller Design for Formation Control of Multi-Agent Mobile Robots

박동주, 문정환, 한성익

로봇학회논문지 제12권 제3호 (통권 제45호) pp.339-349 한국로봇학회

In this paper, we present a finite-time sliding mode control (FSMC) with an integral finitetime sliding surface for applying the concept of graph theory to a distributed wheeled mobile robot (WMR) system. The kinematic and dynamic property of the WMR system are considered simultaneously to design a finite-time sliding mode controller. Next, consensus and formation control laws for distributed WMR systems are derived by using the graph theory. The kinematic and dynamic controllers are applied simultaneously to compensate the dynamic effect of the WMR system. Compared to the conventional sliding mode control (SMC), fast convergence is assured and the finite-time performance index is derived using extended Lyapunov function with adaptive law to describe the uncertainty. Numerical simulation results of formation control for WMR systems shows the efficacy of the proposed controller.

2007.12 KCI 등재 서비스 종료(열람 제한)

조정 에이전트를 이용한 작업 할당 최적화 기법

An Optimization Strategy of Task Allocation using Coordination Agent

백재현, 엄기현, 조경은

한국게임학회 논문지 제7권 제 4호 pp.93-104 한국게임학회

게임과 같은 실시간이며 복잡한 다중 에이전트 환경에서는 시스템의 효율성을 극대화하기 위해 반복적으로 작업 할당이 수행된다. 본 논문에서는 실시간 다중 에이전트 구조에 적합하며 최적화된 작업 할당이 가능한 방안으로 A* 알고리즘을 적용한 조정 에이전트를 제안한다. 제안하는 조정 에이전트는 수행 가능한 에이전트와 할당 가능한 작업으로 정제된 모든 에이전트와 작업의 조합으로 상태 그래프를 생성하고, A* 알고리즘을 이용한 평가함수를 적용하여 최적화된 작업 할당을 수행한다. 또한 실시간 재 할당에 따른 지연을 방지하기 위해 그리디 방식을 선택적으로 사용함으로써 재할당 요구에 대한 빠른 처리가 가능하다. 마지막으로 모의실험을 통해 조정 에이전트를 통한 최적화된 작업 할당 결과가 그리디 방식의 작업 할당보다 성능이 25%향상되었음을 입증한다.

10.

2007.09 KCI 등재 서비스 종료(열람 제한)

확장충돌맵의 수학적 분석을 이용한 다개체의 충돌탐지

Conflict Detection for Multi-agent Motion Planning using Mathematical Analysis of Extended Collision Map

윤영환, 최정식, 이범희

로봇학회논문지 제2권 제3호 pp.234-241 한국로봇학회

Effective tools which can alleviate the complexity and computational load problem in collision-free motion planning for multi-agent system have steadily been demanded in robotics field. To reduce the complexity, the extended collision map (ECM) which adopts decoupled approach and prioritization is already proposed. In ECM, the collision regions which represent the potential collision of robots are calculated using the computational power; the complexity problem is not resolved completely. In this paper, we propose a mathematical analysis of the extended collision map; as a result, we formulate the collision region as an equation with 5–8 variables. For mathematical analysis, we introduce realistic assumptions as follows; the paths of robots can be approximated to a straight line or an arc and the robots move with uniform velocity or constant acceleration near the intersection between paths. Our result reduces the computational complexity in comparison with the previous result without losing optimality, because we use simple but exact equations of the collision regions. This result can be widely applicable to coordinated multi-agent motion planning.

11.

2006.09 KCI 등재 서비스 종료(열람 제한)

우선 순위 기반 쌍방향 다개체 동작 계획 방법

A Priority-based Interactive Approach to Multi-agent Motion Planning

지상훈, 정연수, 이범희

로봇학회논문지 제1권 제1호 pp.46-57 한국로봇학회

It is well known that mathematical solutions for multi-agent planning problems are very difficult to obtain due to the complexity of mutual interactions among multi-agent. Most of the past research results thus are based on the probabilistic completeness. However, the practicality and effectiveness of the solution from the probabilistic completeness is significantly reduced by heavy computational burden. In this paper, we propose a practically applicable solution technique for multi-agent planning problems, which assures a reasonable computation time and a real world application for more than 3 multi-agents for the case of general shaped paths in agent movement. First, to reduce the computation time, a collision map is utilized for detecting potential collisions and obtaining collision-free solutions for multi-agents. Second, to minimize the maximum of multi-agent task execution time, a method is developed for selecting an optimal priority order. Simulations are finally provided for more than 20 agents to emphasize the effectiveness of the proposed interactive approach to multi-agent planning problems.

12.

2005.08 KCI 등재 서비스 종료(열람 제한)

자동화 컨테이너 터미널을 위한 멀티에이전트 기반의 운영시스템 모델링

Multi-Agent based Operation System Modeling for Automated Container Terminals

강경원, 유선영, 모수종, 임재홍

Journal of Korean Navigation and Port Reserch Vol.29 No.6 pp.567-572 한국항해항만학회

세계무역기구(WTO : World Trade Organization)를 설립된 이후 무역은 세계화가 되고, WTO에서 무역 장벽을 낮춰 국가 간의 경제 교류가 점점 증가하면서 국제적인 물류 시스템이 필요하게 되었다. 원가를 절감하기 위해 대랑 수송 수판으로 컨테이너선을 이용하면서 대형 컨테이너 선사들은 국제적인 물류 시스템의 대안으로 기업에게 화물추적 정보시스템의 제공이나 장비, 기기 관리를 위한 정보시스템 네트워크를 구축하여 자동화 시스템을 도입했다. 컨테이너 터미널 자동화를 위해 본 논문에서는 수시로 변경되는 정보를 인식하여 에이전트간의 정보교환을 위해 유동적으로 대처할 수 있는 XML(eXtensive Markup Language)과 JMS(Java Message Service)를 이용한 멀티에이전트간의 통신모델을 제안했다. 이 논문은 기존의 자동화한 컨테이너 터미널 시스템 사례와 자동화 시스템을 개발하는데 어려움, 컨테이너 터미널 시스템이 요구하는 통신과 자동화에 대하여 분석하였다.