머신러닝을 이용한 안개 예측 시 목측과 시정계 계측 방법에 따른 모델 성능 차이 비교

박창현; 이순환

논문 상세보기

머신러닝을 이용한 안개 예측 시 목측과 시정계 계측 방법에 따른 모델 성능 차이 비교 KCI 등재

Comparison of Machine Learning Model Performance based on Observation Methods using Naked-eye and Visibility-meter

박창현, 이순환

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/421736

구독 기관 인증 시 무료 이용이 가능합니다. 4,600원

한국지구과학회지 (The Journal of The Korean Earth Science Society)

제44권 제2호 (2023.04)
pp.105-118

한국지구과학회 (The Korean Earth Science Society)

초록

본 연구에서는 2 016년부터 2 02 0년까지 내륙 관측소 중 안개 최다발 지역인 안동을 대상으로 XGBoost-DART 머신러닝 알고리즘을 이용하여 1 시간 후 안개 유무를 예측하였다. 기상자료, 농업관측자료, 추가 파생자료와 각 자료 를 오버 샘플링한 확장자료, 총 6개의 데이터 세트를 사용하였다. 목측으로 획득한 기상현상번호와 시정계 관측으로 측 정된 시정거리 자료를 각각 안개 유[1]무[0]로 이진 범주화하였다. 총 12개의 머신러닝 모델링 실험을 설계하였고, 안개 가 사회와 지역사회에 미치는 유해성을 고려하여 모델의 성능은 재현율과 AUC-ROC를 중심으로 평가하였다. 전체적으 로, 오버샘플링한 기상자료와 기상현상번호 기반의 예측 목표를 조합한 실험이 최고 성능을 보였다. 이 연구 결과는 머 신러닝 알고리즘을 활용한 안개 예측에 있어서, 목측으로 획득한 기상현상번호의 중요성을 암시한다.

In this study, we predicted the presence of fog with a one-hour delay using the XGBoost DART machine learning algorithm for Andong, which had the highest occurrence of fog among inland stations from 2016 to 2020. We used six datasets: meteorological data, agricultural observation data, additional derived data, and their expanded data. The weather phenomenon numbers obtained through naked-eye observations and the visibility distances measured by visibility meters were classified as fog [1] or no-fog [0]. We set up twelve machine learning modeling experiments and used data from 2021 for model validation. We mainly evaluated model performance using recall and AUC-ROC, considering the harmful effects of fog on society and local communities. The combination of oversampled meteorological data features and the target induced by weather phenomenon numbers showed the best performance. This result highlights the importance of naked-eye observations in predicting fog using machine learning algorithms.

키워드

익스트림 그레디언트 부스팅안개 예측시정머신러닝기상현상번호 XGBoostfog predictionvisibilitymachine learningweather phenomenon number

Abstract
요 약
1. 서 론
2. 연구 방법
    2.1. 데이터
    2.2. 모델링 구성
    2.3. 모델 성능 평가 지표
3. 연구 결과
    3.1. 현상번호(Phen) 목표 예측
    3.2. 시정거리 범주(Vis) 목표 예측
4. 요약 및 결론
사 사
참고문헌

저자

박창현(부산대학교 환경연구원) | Changhyoun Park (Institute of Environmental Studies, Pusan National University)
이순환(부산대학교 지구과학교육과) | Soon-hwan Lee (Department of Earth Science Education, Pusan National University) Corresponding author

같은 권호 다른 논문