검색결과

검색조건

좁혀보기

검색필터 CLOSE

검색결과 15건

2025.04 KCI 등재 구독 인증기관 무료, 개인회원 유료

A Study on Frost/Fog-Induced Black Ice Prediction and Contributing Atmospheric Factors Using Explainable Machine Learning Models: A Focus on Random Forest and XGBoost

설명 가능 인공지능 모델을 이용한 서리/안개 블랙아이스 예측 및 블랙아이스 영향 대기기상 요소 분석(랜덤 포레스트 및 XGBoost 모형을 중심으로)

Jang Jinhwan

한국도로학회논문집 제27권 제2호 통권130호 pp.1-11 한국도로학회

Given the hazards posed by black ice, it is crucial to investigate the conditions that contribute to its formation. Two ensemble machinelearning algorithms, Random Forest (RF) and Extreme Gradient Boosting (XGBoost), were employed to forecast the occurrence of black ice using atmospheric data. Additionally, explainable artificial intelligence techniques, including Feature Importance (FI) and partial dependence Plot (PDP), were utilized to identify atmospheric conditions that significantly increase the likelihood of black ice formation. The machinelearning algorithms achieved a forecasting accuracy of 90%, demonstrating reliable performance. FI analysis revealed distinct key predictors between the algorithms: relative humidity was the most critical for RF, whereas wind speed was paramount for XGBoost. The PDP analysis identified the specific atmospheric conditions under which black ice was likely to form. This study provides detailed insights into the atmospheric precursors of frost/fog-induced black ice formation. These findings enable road managers to implement proactive winter road maintenance strategies, such as optimizing anti-icing patrol routes and displaying warnings on various message signs, thereby enhancing road safety.

4,200원

2025.04 구독 인증기관·개인회원 무료

Frankliniella occidentalis monitoring using Random forest algorithm

Taechul Park, SoEun Eom, Jung-Joon Park

한국응용곤충학회 학술대회논문집 2025년 한국곤충학회, 한국응용곤충학회 공동 춘계학술대회 p.138 한국응용곤충학회

2025.03 구독 인증기관·개인회원 무료

Random Forest Classification Model 기반 국내 결빙사고 위험구간 추정

Estimation of Black Ice Accident-Prone Sections in South Korea Using a Random Forest Classification Model

김세호, 진현호, 손석철, 송기영, 박주연, 정진훈

한국도로학회 학술대회 발표논문 초록집 2025년도 한국도로학회 봄학술대회 발표논문 초록집 pp.79-80 한국도로학회

결빙(Black Ice)은 도로 포장체 표면의 균열 등에 스며든 습기나 눈, 그리고 차량 주행 중 발생하는 타이어 분진 및 배 기가스 등의 영향으로 인해 도로 표면과 유사한 색상의 얇은 얼음막이 형성되는 현상을 의미한다(Cho et al., 2021). 도로 노면이 결빙 상태일 경우, 평균 미끄럼 저항 계수는 건조 노면의 약 30% 수준으로 크게 낮아진다(Lee et al., 2024). 또 한, 결빙은 도로 표면과 색상이 유사하여 운전자가 노면 상태를 즉각적으로 인지하기 어렵고, 이에 따라 제동이나 회피 를 위한 충분한 시간을 확보하기 어렵다. 최근 5년간 발생한 서리·결빙 노면 교통사고의 치사율(사고 100건당 사망자 수) 은 2.69명으로, 이는 건조 노면 교통사고 치사율의 약 2배, 습윤 노면의 1.3배 수준에 해당한다(KoROAD, 2024). 이러한 위험성을 고려하여 국토교통부는 2020년 전국 고속국도 및 일반, 위임국도를 대상으로 403개 구간을 결빙 취약 구간으로 지정하였으며, 이후 464개소로 확대하여 자동염수분사시설, 그루빙(Grovving), 결빙주의표지판 등 안전시설을 확충하여 결빙사고를 집중적으로 관리하고 있다(MOLIT, 2020; BAI 2021). 하지만, 결빙사고 발생건수는 2020년 524건, 2021년 1,204건, 2022년 1,042건으로 증가추세를 보이고 있어, 결빙 취약 구간의 평가 적절성과 실효성에 대한 검토 필요성이 대 두되고 있다(KoROAD, 2024). 본 연구에서는 최근 10년 고속국도에서 발생한 결빙사고와 결빙사고 영향인자를 Random Forest Algorithm으로 분석하 여 도로 구간별 결빙사고 위험도를 평가하였다. 국가교통정보센터의 노드·링크(Node·Link) 체계를 기반으로 전국 고속국 도의 동절기 기상, 기하구조, 교통량 등 결빙사고 영향인자를 구간별로 수집하였다. 각 구간은 최근 10년 결빙사고 데이 터를 통해 결빙사고 발생구간과 비발생 구간으로 분류하였다. 구간별 수집한 결빙사고 영향인자를 독립변수, 사고발생유 무를 종속변수로하여 알고리즘 학습을 위한 데이터셋(Data Set)을 구성하고, 데이터불균형 문제를 해결하기 위해 오버샘 플링(OverSampling) 기법 중 하나인 SMOTE(Synthetic Minority Oversampling Technique)을 적용하였다. 최종적으로 Random Forest Classification Model을 학습하고, 모델의 하이퍼파라미터 조정(HyperParameter Tunning)을 거처 결빙사 고 발생구간 예측성능이 가장 높은 모델을 결정하였다. 이를 통해, 전국 고속국도의 구간별 결빙사고 발생 위험도를 평 가하고 각 결빙사고 영향인자의 변수중요도를 분석함으로써 결빙 취약구간 평가 방안의 신뢰성 제고를 기대한다.

2025.02 KCI 등재 구독 인증기관 무료, 개인회원 유료

Random Forest 모형과 SHAP 기법을 활용한 개인형 이동장치 교통사고 유형별 사고심각도 영향요인 분석

Analysis of Factors Influencing the Severity of Traffic Crashes by Type of Traffic Crashes on Personal Mobility Using Random Forest Model and SHAP Technique

양윤철, 박수진, 이동윤, 정경옥

한국도로학회논문집 제27권 제1호 통권129호 pp.89-99 한국도로학회

In this study, we aim to classify personal mobility (PM)-related traffic crash data into four categories: PM-to-vehicle, PM-to-pedestrian, PM-single, and vehicle-to-PM crashes, and analyze the factors influencing the severity of each crash type. To overcome the limitations of existing studies in explaining the impact of independent variables on ordinal dependent variables, a random forest model was combined with the Shapley additive explanation technique. This approach visualizes the influence of independent variables on a dependent variable, providing clearer insights and enhancing interpretability. The analysis of PM traffic accidents, categorized into at-fault, single-vehicle, and victim accidents, revealed distinct key factors for each type. The main contributors to the severity of crashes caused by PM are traffic violations by teenagers and collisions with elderly pedestrians. Single-vehicle accidents were predominantly caused by overturn incidents, with inadequate driving skills among PM users aged 40 years and older, and significantly increasing severity. Victim accidents primarily occur at intersections, where the behavior of the at-fault driver and age of the PM user are critical factors influencing the severity. We identified various factors influencing the severity of PM crashes by type, highlighting the need for tailored policy measures. Proposed policies include physically separating bicycle–pedestrian shared spaces and strictly regulating illegal PM sidewalk riding, introducing PM licenses for teenagers to ensure compliance with traffic rules, and implementing regular safety education programs for all age groups. Although this study applied a new analytical technique, it relied on limited crash data, thus limiting the results to estimates.

4,200원

2024.10 구독 인증기관·개인회원 무료

Random Forest 알고리즘을 통한 국내 결빙사고 영향인자 상관성 분석

Correlation Analysis of Factors Influencing Black-Ice Accidents in Korea Using the Random Forest Algorithm

김세호, 김효원, 진현호, 최문규, 손석철, 송기영, 정진훈

한국도로학회 학술대회 발표논문 초록집 2024년도 제24회 한국도로학회 학술대회 발표논문 초록집 pp.111-112 한국도로학회

2019년 12월, 상주-영천 고속도로 상행선에서 도로 노면 결빙에 의한 연쇄추돌사고로 48명의 사상자가 발생하였다. 이에, 국토교통부 는 2020년 1월 결빙 취약구간 선정기준을 마련하여 결빙 취약구간 403개소를 지정하고, 결빙 취약구간을 대상으로 2022년까지 1,699억 원의 예산을 투입하여 결빙사고 예방사업을 계획하였다(BAI, 2021). 하지만, 결빙 취약구간 선정기준에 대해 적정성 검토가 이루어지 지 않아 그 신뢰성과 실효성이 충분히 검증되지 않았다. 본 연구에서는 국가교통정보센터의 노드·링크(Node·Link) 체계를 기반으로 전국 고속국도 및 일반국도의 특성정보(시설, 선형구조, 기상, 교통 등)를 GIS(Geographic Information System) 데이터로 구축하였다. 최근 5년 결빙사고 발생이력이 있는 도로구간(Link)을 확인하고 Random Forest 알고리즘을 통해 도로 특성정보의 결빙사고에 대한 변수 중요도(Feature Importance)를 분석했다. 이를 통해 결빙사고와 각 인자의 상관성을 파악하여 ‘결빙 취약구간 평가 세부 배점표’의 항목별 배점을 수정, 보완함으로써 평가표의 신뢰성을 제고한다.

2024.10 구독 인증기관·개인회원 무료

Bemisia tabaci monitoring using random forest algorithm

Taechul Park, SoEun Eom, Ji-won Jeong, Jung-Joon Park

한국응용곤충학회 학술대회논문집 한국응용곤충학회 추계학술대회논문집 p.44 한국응용곤충학회

2022.06 구독 인증기관·개인회원 무료

COVID-19 상황에서 LSTM 기법과 Random Forest 기법을 활용한 영화 관객수 예측

이소현, 손영훈, 최재용, 장영관

한국산업경영시스템학회 학술대회 2022년 한국산업경영시스템학회 춘계학술대회 p.903 한국산업경영시스템학회

2022.04 KCI 등재 구독 인증기관 무료, 개인회원 유료

로지스틱 회귀분석 방법과 랜덤포레스트 방법을 활용한 대학생의 소속 학과 만족도에 대한 영향 요인 분석

Analysis of factors influencing college students' satisfaction with their departments using logistic regression analysis method and random forest method.

하충원, 이승희

미래교육연구 제12권 2호 pp.1-22 한국미래교육학회

이 연구의 목적은 머신러닝 분석방법을 활용하여 대학생의 소속 학과 만족도에 영향을 미치는 주요 요 인을 분석하여 대학생의 진로지도와 중도탈락 예방 관련 정책 및 제도 수립을 위한 기초 연구 자료를 제 공하기 위함이다. 이를 위해 한국교육고용패널 􎟯(KEEP 􎟯)자료의 4년제 대학 진학생 1,298명을 연구대 상으로 머신러닝 분석방법인 로지스틱 회귀분석과 랜덤포레스트 방법을 통하여 분석을 진행하였다. 주요 분석 결과는 다음과 같다. 첫째, 대학 입학년도에는 대학 생활 관련 변수 이외에도 고등학교 재학 시기 및 고등학교 졸업 후 진로 계획과 관련한 설명변수들이 중요도 상위 10개 항목 중 상당수를 차지하였으며, 입학년도와 졸업년도를 제외한 기간에는 전공 학습과 진로활동에 대한 변수들이, 졸업년도에는 취업준비 및 교육훈련 경험 등이 로지스틱 회귀분석과 랜덤포레스트 분석 결과에서 공통적으로 높은 중요도를 기록하였다. 둘째, 두 분석방 법에 따른 학년별 중요도 상위 10개 변수의 일치도는 63.3%로 나타났다. 셋째, 로지스틱 회귀분석과 달리 랜덤포레스트 분석에서는 설문의 응답자가 다수의 척도를 사용하여 응답한 설명변수들이 중요도 상위 10 개 설명변수에 포함된 경우가 상대적으로 많았다. 이 연구는 교육패널 자료를 단일 분석방법이 아닌 두 가지 머신러닝 방법을 사용하여 공통 요소를 도출하고, 결과의 비교를 시도했다는 점에 의의가 있다.

5,800원

2021.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

대청호 Chl-a 예측을 위한 random forest와 gradient boosting 알고리즘 적용 연구

A study on applying random forest and gradient boosting algorithm for Chl-a prediction of Daecheong lake

이상민, 김일규

상하수도학회지 제35권 제6호 pp.507-516 대한상하수도학회

In this study, the machine learning which has been widely used in prediction algorithms recently was used. the research point was the CD(chudong) point which was a representative point of Daecheong Lake. Chlorophyll-a(Chl-a) concentration was used as a target variable for algae prediction. to predict the Chl-a concentration, a data set of water quality and quantity factors was consisted. we performed algorithms about random forest and gradient boosting with Python. to perform the algorithms, at first the correlation analysis between Chl-a and water quality and quantity data was studied. we extracted ten factors of high importance for water quality and quantity data. as a result of the algorithm performance index, the gradient boosting showed that RMSE was 2.72 mg/m³ and MSE was 7.40 mg/m³ and R² was 0.66. as a result of the residual analysis, the analysis result of gradient boosting was excellent. as a result of the algorithm execution, the gradient boosting algorithm was excellent. the gradient boosting algorithm was also excellent with 2.44 mg/m³ of RMSE in the machine learning hyperparameter adjustment result.

4,000원

10.

2021.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

Identifying the Expression Patterns of Depression Based on the Random Forest

랜덤 포레스트 기반 우울증 발현 패턴 도출

Hyeon Jin Jeon, Chang-Ho Jihn

한국산업경영시스템학회지 Vol. 44 No. 4 pp.53-64 한국산업경영시스템학회

Depression is one of the most important psychiatric disorders worldwide. Most depression-related data mining and machine learning studies have been conducted to predict the presence of depression or to derive individual risk factors. However, since depression is caused by a combination of various factors, it is necessary to identify the complex relationship between the factors in order to establish effective anti-depression and management measures. In this study, we propose a methodology for identifying and interpreting patterns of depression expressions using the method of deriving random forest rules, where the random forest rule consists of the condition for the manifestation of the depressive pattern and the prediction result of depression when the condition is met. The analysis was carried out by subdividing into 4 groups in consideration of the different depressive patterns according to gender and age. Depression rules derived by the proposed methodology were validated by comparing them with the results of previous studies. Also, through the AUC comparison test, the depression diagnosis performance of the derived rules was evaluated, and it was not different from the performance of the existing PHQ-9 summing method. The significance of this study can be found in that it enabled the interpretation of the complex relationship between depressive factors beyond the existing studies that focused on prediction and deduction of major factors.

4,300원

11.

2019.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

랜덤포레스트를 이용한 낙엽송과 편백의 적지적수도 제작: 경상남도를 대상으로

Mapping Species-Specific Optimal Plantation Sites Using Random Forest in Gyeongsangnam-do Province, South Korea

박은정, 박준형, 김형호

농업생명과학연구 제53권 제6호 pp.65-74 경상국립대학교 농업생명과학연구원

본 연구의 목적은 적지적수 판단에 있어 최근 분류 예측에 활용되고 있는 랜덤포레스트 기법의 적용 가능성을 살펴보는데 있다. 즉, 수종별 조림 적지 판단에 있어 랜덤포레스트 기법을 소개하고 적지적수 도를 작성하여 적용성을 판단하고자 한다. 그 결과 랜덤포레스트 기법의 예측 정확도는 낙엽송 89.29%, 편백 73.89%로 높은 편으로 나타났다. 변수 중요도는 두 개의 수종 모두 표고, 경사, 방위의 순으로 영향력이 높은 것으로 나타났으며 지형, 토성, 토양형이 낮은 영향력을 보였다. 적지적수도 작성 결과, 낙엽송은 경상남도 중부를 제외한 대부분 지역이 가능지와 적지로 나타났으며, 편백은 경상남도의 북동부 지역이 적지로 나타났다. 랜덤포레스트 기법은 적지적수도 작성뿐만 아니라 산림 분야에서 적용되어 왔던 다양한 형태의 분류 및 예측 연구에서 활용 가능성이 높을 것으로 사료된다.

4,000원

12.

2019.10 구독 인증기관·개인회원 무료

Development of a RIVPACS-type model in Korean streams based on the Random Forest model

Da-Yeong Lee, Young-Seuk Park

한국응용곤충학회 학술대회논문집 2019 Fall International Conference of KSAE p.88 한국응용곤충학회

13.

2019.04 구독 인증기관·개인회원 무료

Improvement of Random-Forest OBIA Algorithm for Tree Anomaly Detection in UAV Imagery: Focused on the Birobong-Peak Area of Sobaeksan National Park

유병혁, 박홍철, 이승민

한국환경생태학회 학술대회논문집 제29권 1호 p.54 한국환경생태학회

14.

2014.04 구독 인증기관·개인회원 무료

Hazard Rating of Coastal Disaster Prevention Pine Forests for a Black Pine Bast Scale Through Self-Organizing Map (SOM) and Random Forest Approaches

Youngwoo Nam, Sang-Hyun Koh, Sung-Jae Jeon, Ho-Joong Youn, Young-Seuk Park, Won Il Choi

한국응용곤충학회 학술대회논문집 곤충학의 창조적 융합(Entomology-Creative Convergence) p.208 한국응용곤충학회

This study examined the effects of environmental factors on the abundance of black pine bast scale (BPBS), Matsucoccus thunbergianae Miller and Park, in coastal disaster prevention forest stands composed mostly of Japanese black pine. Geographical factors, soil conditions and forest stand conditions were measured to evaluate the hazard rating for the occurrence of BPBS from 35 plots in the coastal forest stands. To assess the hazard rating, a combination of a self-organizing map (SOM), which classified the samples according to their characteristics, and a random forest model, which predicted the probability of the occurrence of BPBS from SOM results, was used in this study. Our results showed that major factors determining the abundance of BPBS were climate, tree size, and tree health. BPBS was more common in low latitude coastal forests, suggesting that warmer conditions were favorable to BPBS population buildup. Tree size also influenced the abundance of BPBS, which was higher in forests composed of larger trees (greater DBH). Finally, BPBS was also more abundant in areas with high soil salinity and clay-loam soil, and north-facing slopes where tree vigor was lower.

15.

2018.10 KCI 등재 서비스 종료(열람 제한)

Assessment of climate change impact on aquatic ecology health indices in Han river basin using SWAT and random forest

SWAT 및 random forest를 이용한 기후변화에 따른 한강유역의 수생태계 건강성 지수 영향 평가

Woo So Young, Jung Chung Gil, Kim Jin Uk, Kim Seong Joon

한국수자원학회 논문집 Vol. 51 No. 10 pp.863-874 한국수자원학회

본 연구에서는 SWAT 모형과 random forest를 이용하여 미래 기후변화에 따른 한강유역(34,148 km2)의 수생태계 건강성을 평가하였다. 국립 환경과학원에서 8년간(2008~2015년) 봄철(4~6월)에 모니터링한 부착돌말류 지수(TDI), 저서형 대형무척추동물지수(BMI), 어류평가지수(FAI)는 0~100점, A~E등급으로 평가되며, 이를 본 연구에서 사용하였다. 수생태 건강성에 영향을 미치는 변수로는 수질(T-N, NH4, NO3, T-P, PO4)과 수온을 선정하였으며, 수질 오염도가 낮은 경우에는 수생태계 건강성 점수가 광범위하게 분포되지만 수질 오염도가 높은 경우 수생태계 건강성 점수가 낮아지는 역상관관계를 확인하였다. 기계학습의 분류 분석 기법 중 하나인 random forest 모델을 이용한 세 개의 수생태 건강성 지수 등급 분류 결과 정밀도, 재현율, f1-score 모두 0.81 이상의 예측 정확도를 나타내었다. 기상청의 HadGEM3-RA RCP 4.5와 8.5 시나리오를 적용한 미래 SWAT 수문, 수질 결과 기저유출의 증가로 인해 질소 계열 수질 농도는 기준년도 대비 최대 43.2% 증가하였고, 지표유출 감소로 인해 인 계열 수질 오염도는 최대 18.9% 감소하는 것으로 분석되었다. 미래 FAI, BMI의 등급은 개선되는 경향을 보이지만 TDI는 등급이 악화되는 것으로 나타 났다. 이를 통해 TDI는 질소 계열 수질에 민감하고 FAI, BMI는 인 계열 수질에 더 민감하다고 판단하였다.