검색결과 - koreascholar

1.

2025.06 KCI 등재 구독 인증기관 무료, 개인회원 유료

Development of Multi-Agent RL-Based Automated Design Model for Doubly Reinforced Beams

한국공간구조학회지 제25권 제2호 통권100호 pp.45-52 한국공간구조학회

Reinforcement learning (RL) is successfully applied to various engineering fields. RL is generally used for structural control cases to develop the control algorithms. On the other hand, a machine learning (ML) is adopted in various research to make automated structural design model for reinforced concrete (RC) beam members. In this case, ML models are developed to produce results that are as similar to those of training data as possible. The ML model developed in this way is difficult to produce better results than the training data. However, in reinforcement learning, an agent learns to make decisions by interacting with an environment. Therefore, the RL agent can find better design solution than the training data. In the structural design process (environment), the action of RL agent represent design variables of RC beam. Because the number of design variables of RC beam section is many, multi-agent DQN (Deep Q-Network) was used in this study to effectively find the optimal design solution. Among various versions of DQN, Double Q-Learning (DDQN) that not only improves accuracy in estimating the action-values but also improves the policy learned was used in this study. American Concrete Institute (318) was selected as the design codes for optimal structural design of RC beam and it was used to train the RL model without any hand-labeled dataset. Six agents of DDQN provides actions for beam with, beam depth, bottom rebar size, number of bottom rebar, top rebar size, and shear stirrup size, respectively. Six agents of DDQN were trained for 5,000 episodes and the performance of the multi-agent of DDQN was evaluated with 100 test design cases that is not used for training. Based on this study, it can be seen that the multi-agent RL algorithm can provide successfully structural design results of doubly reinforced beam.

4,000원

2.

2025.05 구독 인증기관·개인회원 무료

볼트 제조라인에 대한 강화학습 기반 스케줄링 사례 연구

김유진, 이채민, 장혜린, 하수영, 이동호

한국산업경영시스템학회 학술대회 우리나라 방위산업의 미래 p.313 한국산업경영시스템학회

3.

2025.05 구독 인증기관 무료, 개인회원 유료

셋업 시간 및 납기 지연 최소화를 위한 강화학습 기반 동적 스케줄링 모델

김란, 신문수

한국산업경영시스템학회 학술대회 우리나라 방위산업의 미래 pp.38-46 한국산업경영시스템학회

4,000원

4.

2024.11 구독 인증기관·개인회원 무료

강화학습 기반 코일 단위 작업 스케줄링

박형준, 위현곤

한국산업경영시스템학회 학술대회 에너지 산업의 현황과 전망 p.284 한국산업경영시스템학회

5.

2024.11 구독 인증기관·개인회원 무료

한밭 스마트팩토리 테스트베드를 위한 강화학습 기반 동적 라우팅

김란, 백종호, 신문수

한국산업경영시스템학회 학술대회 에너지 산업의 현황과 전망 p.273 한국산업경영시스템학회

6.

2024.11 구독 인증기관 무료, 개인회원 유료

다중 에이전트 강화학습 기반 공급망 재고 관리 및 배송 최적화

박형준, 최동구, 민대기

한국산업경영시스템학회 학술대회 에너지 산업의 현황과 전망 pp.193-204 한국산업경영시스템학회

4,300원

7.

2024.11 구독 인증기관 무료, 개인회원 유료

강화학습 기반 반도체 소자 배치 및 회로 배선 최적화

박형준, 최동구

한국산업경영시스템학회 학술대회 에너지 산업의 현황과 전망 pp.169-180 한국산업경영시스템학회

4,300원

8.

2024.10 KCI 등재 구독 인증기관 무료, 개인회원 유료

다양한 상황 최적 전이기법 패턴 점검 및 강화학습 기반 최적 전이시간 선택모형 연구

Reinforcement Learning and Selection Patterns for Optimal Traffic Signal Transition Techniques

조용빈, 김진태

한국도로학회논문집 제26권 제5호 통권127호 pp.113-122 한국도로학회

PURPOSES : In this study, the existence of an optimal pattern among transition methods applied during changes in traffic signal timing was investigated. We aimed to develop this pattern into an artificial intelligence reinforcement-learning model to assess its effectiveness METHODS : By developing various traffic signal transition scenarios and considering 19 different traffic signal transition situations that can be applied to these scenarios, a simulation analysis was performed to identify patterns through statistical analysis. Subsequently, a reinforcement-learning model was developed to select an optimal transition time model suitable for various traffic conditions. This model was then tested by simulating a virtual experimental center environment and conducting performance comparison evaluations on a daily basis. RESULTS : The results indicated that when the change in the traffic signal cycle length was less than 50% in the negative direction, the subtraction method was efficient. In cases where the transition was less than 15% in the positive direction, the proposed center method for traffic signal transition was found to be advantageous. By applying the proposed optimal transition model selection, we observed that the transition time decreased by approximately 70%. CONCLUSIONS : The findings of this study provide guidance for the next level of traffic signal transitions. The importance of traffic signal transition will increase in future AI-based traffic signal control methods, requiring ongoing research in this field.

4,000원

9.

2024.06 KCI 등재 구독 인증기관 무료, 개인회원 유료

A Reinforcement Learning Model for Dispatching System through Agent-based Simulation

에이전트 기반 시뮬레이션을 통한 디스패칭 시스템의 강화학습 모델

Minjung Kim, Moonsoo Shin

한국산업경영시스템학회지 Vol.47 No.2 pp.116-123 한국산업경영시스템학회

In the manufacturing industry, dispatching systems play a crucial role in enhancing production efficiency and optimizing production volume. However, in dynamic production environments, conventional static dispatching methods struggle to adapt to various environmental conditions and constraints, leading to problems such as reduced production volume, delays, and resource wastage. Therefore, there is a need for dynamic dispatching methods that can quickly adapt to changes in the environment. In this study, we aim to develop an agent-based model that considers dynamic situations through interaction between agents. Additionally, we intend to utilize the Q-learning algorithm, which possesses the characteristics of temporal difference (TD) learning, to automatically update and adapt to dynamic situations. This means that Q-learning can effectively consider dynamic environments by sensitively responding to changes in the state space and selecting optimal dispatching rules accordingly. The state space includes information such as inventory and work-in-process levels, order fulfilment status, and machine status, which are used to select the optimal dispatching rules. Furthermore, we aim to minimize total tardiness and the number of setup changes using reinforcement learning. Finally, we will develop a dynamic dispatching system using Q-learning and compare its performance with conventional static dispatching methods.

4,000원

10.

2024.06 구독 인증기관·개인회원 무료

자율운항선박의 지역경로 생성을 위한 역강화학습 시나리오 구현

Implementation of Inverse Reinforcement Learning Scenarios for Local Path Planning of Autonomous Ships

이재용, 남궁호, 김주성, 이진석, 장다운

해양환경안전학회 학술대회 논문집 2024년도 춘계학술발표회 p.42 해양환경안전학회

11.

2024.05 구독 인증기관·개인회원 무료

강화학습을 이용한 한밭 스마트팩토리 내 팔레트의 이동 경로 최적화

백종호, 김란, 홍시원, 백수정

한국산업경영시스템학회 학술대회 디지털 전환시대의 중소기업 혁신전략 p.452 한국산업경영시스템학회

12.

2024.05 구독 인증기관 무료, 개인회원 유료

산업 최적화 문제의 정책 구조를 활용하는 강화학습 기법

박형준, 최동구, 민대기

한국산업경영시스템학회 학술대회 디지털 전환시대의 중소기업 혁신전략 pp.375-391 한국산업경영시스템학회

5,100원

13.

2024.05 구독 인증기관 무료, 개인회원 유료

에이전트 기반 시뮬레이션을 통한 디스패칭 시스템의 강화학습 모델

A Reinforcement Learning Model for Dispatching System through Agent-based Simulation

김민정, 신문수

한국산업경영시스템학회 학술대회 디지털 전환시대의 중소기업 혁신전략 pp.189-197 한국산업경영시스템학회

4,000원

14.

2024.05 구독 인증기관 무료, 개인회원 유료

심층 강화학습을 이용한 미사일 방어체계 연구 - 멀테이에전트 기반의 개선된 DQN을 활용하여 -

Research on Missile Defense Systems Using Deep Reinforcement Learning

김민국, 최봉완, 경지훈

한국산업경영시스템학회 학술대회 디지털 전환시대의 중소기업 혁신전략 pp.34-40 한국산업경영시스템학회

4,000원

15.

2023.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

이종 병렬설비에서 총납기지연 최소화를 위한 강화학습 기반 일정계획 알고리즘

Scheduling Algorithm, Based on Reinforcement Learning for Minimizing Total Tardiness in Unrelated Parallel Machines

이태희, 김재곤, 유우식

대한안전경영과학회지 제25권 제4호 pp.131-140 대한안전경영과학회

This paper proposes an algorithm for the Unrelated Parallel Machine Scheduling Problem(UPMSP) without setup times, aiming to minimize total tardiness. As an NP-hard problem, the UPMSP is hard to get an optimal solution. Consequently, practical scenarios are solved by relying on operator's experiences or simple heuristic approaches. The proposed algorithm has adapted two methods: a policy network method, based on Transformer to compute the correlation between individual jobs and machines, and another method to train the network with a reinforcement learning algorithm based on the REINFORCE with Baseline algorithm. The proposed algorithm was evaluated on randomly generated problems and the results were compared with those obtained using CPLEX, as well as three scheduling algorithms. This paper confirms that the proposed algorithm outperforms the comparison algorithms, as evidenced by the test results.

4,000원

16.

2023.12 구독 인증기관·개인회원 무료

픽업 대기 시간의 감소를 위한 강화 학습 기반의 승차 공유 서비스의 매칭 및 요금 책정 알고리즘

Matching and Pricing Algorithm in a Ride-hailing Service Based on Reinforcement Learning to Reduce Pickup Waiting Time

정다운

한국도로학회지 제25권 제4호 pp.117-118 한국도로학회

17.

2023.12 구독 인증기관·개인회원 무료

강화학습을 활용한 도시부 도로의 Speed Zoning Model 개발 연구

Development of a Speed Zoning Model on Urban Roads Using Reinforcement Learning

강민지

한국도로학회지 제25권 제4호 pp.115-116 한국도로학회

18.

2023.12 KCI 등재 구독 인증기관·개인회원 무료

A Study on the Influence of Platform Design in Level Design by utilizing Multi-agent Reinforcement Learning

강화학습을 이용한 대전 슈팅 게임의 플랫폼 형태에 따른 레벨 디자인 영향 분석

Jun Ho KIM, Hanul Sung

한국컴퓨터게임학회 논문지 제36권 제4호 pp.236-29 한국컴퓨터게임학회

다중 에이전트 강화학습의 발전과 함께 게임 분야에서 강화학습을 레벨 디자인에 적용하려는 연구가 계속되 고 있다. 플랫폼의 형태가 레벨 디자인의 중요한 요소임에도 불구하고 지금까지의 연구들은 플레이어의 스킬 수준이나, 스킬 구성 등 플레이어의 매트릭에 초첨을 맞춰 강화학습을 활용하였다. 따라서 본 논문에서는 레 벨 디자인에 플랫폼의 형태가 사용될 수 있도록 시각 센서의 가시성과 구조물의 복잡성을 고려하여 플랫폼 이 플레이 경험에 미치는 영향을 연구한다. 이를 위해Unity ML-Agents Toolkit과MA-POCA 알고리즘, Self-play 방식을 기반으로2vs2 대전 슈팅 게임 환경을 개발하였으며 다양한 플랫폼의 형태를 구성하였다. 분석을 통해 플랫폼의 형태에 따른 가시성과 복잡성의 차이가 승률 밸런스에는 크게 영향을 미치지 않으나 전체 에피소 드 수, 무승부 비율, Elo의 증가폭에 유의미한 영향을 미치는 것을 확인했다.

19.

2023.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

A Player-Like StarCraft II AI for Enhanced Fun and Diversity using Reinforcement Learning

향상된 유닛 생산과 게임 플레이 경험을 위한 스타크래프트 II 강화학습 AI

Kyo Seoung KOO, Hanul Sung

한국컴퓨터게임학회 논문지 제36권 제4호 pp.31-36 한국컴퓨터게임학회

기존의 스타크래프트II 내장AI는 미리 정의한 행동 패턴을 따르기 때문에 사용자가 전략을 쉽게 파악할 수 있어 사용자의 흥미를 오랫동안 유지시키기 힘들다. 이를 해결하기 위해, 많은 강화학습 기반의 스타크래프 트II AI 연구가 진행되었다. 그러나 기존의 강화학습AI는 승률에만 중점을 두고 에이전트를 학습시킴으로써 소수의 유닛을 사용하거나 정형화 된 전략만을 사용하여 여전히 사용자들이 게임의 재미를 느끼기에 한계가 존재한다. 본 논문에서는 게임의 재미를 향상시키기 위하여, 강화학습을 활용하여 실제 플레이어와 유사한 AI을 제안한다. 에이전트에게 스타크래프트II의 상성표를 학습시키고, 정찰한 정보로 보상을 부여해 유동적 으로 전략을 변경하도록 한다. 실험 결과, 사용자가 느끼는 재미와 난이도, 유사도 부분에서 고정된 전략을 사용하는 에이전트보다 본 논문에서 제안하는 에이전트가 더 높은 평가를 받았다..

4,000원

20.

2023.11 구독 인증기관·개인회원 무료

강화학습기반 인명안전피난 알고리즘 개발 가능성

Possibility of developing a reinforcement learning-based human safety evacuation algorithm

황광일, 김별

해양환경안전학회 학술대회 논문집 2023년도 추계학술발표회 p.82 해양환경안전학회