검색결과 - koreascholar

1.

2026.04 구독 인증기관·개인회원 무료

Comprehensive phylogenetics of the Pseudoscorpiones using backbone and data-mining

Kyeong-Hoon Jeong, Hee Han, Jinsung Park, Jiseung Kim, Jithin Johnson, Hsiang-Yun Lin, Danilo Harms, Sora Kim

한국응용곤충학회 학술대회논문집 2026 Spring Conference of KSAE & ESK p.145 한국응용곤충학회

2.

2025.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

스포츠 전공 청년들을 위한 일자리 추천모형 개발: 데이터마이닝 기반 추천 알고리즘 적용

Development of Job Recommendation Model for Youth Majoring in Sport : Applying Recommendation Algorithms based on Data Mining

신진호

한국응용과학기술학회지 Vol.42 No.6 pp.972-979 한국응용과학기술학회(구 한국유화학회)

본 연구는 준거집단(취직자)들의 활동 데이터 뱅크를 생성하여 예비 취업자(고등교육기관의 체 육계열 전공자)들이 현재까지 활동했던 데이터를 데이터마이닝 기반 추천 알고리즘을 적용해 예비 취업자 들에게 가장 적합한 직업군을 추천해주는 스포츠 일자리 추천모형을 개발하고 검증하는 것이다. 따라서 평 가지표를 구성하고, 준거집단을 대상으로 인터뷰 및 조사를 통해 데이터 뱅크를 생성했다. 또한 비확률 표 본추출법 중 할당표본 추출법과 눈덩이표본 추출법을 적용해 예비 취업자 조사를 실시했으며, 총 921명의 자료를 통해 스포츠 일자리 추천모형 개발과 유사도를 통해 모형을 검증했다. 즉, 본 결과는 다음과 같다. 첫째, 준거집단과 예비 취업자의 평가지표를 구성했다. 둘째, 준거집단의 데이터 뱅크를 생성했다. 셋째, 스 포츠 전공 청년들을 위한 일자리 추천모형을 개발하고, 유사도를 통해 모형을 검증했다.

4,000원

3.

2025.07 KCI 등재 구독 인증기관 무료, 개인회원 유료

로컬콘텐츠 중점대학에 관한 온라인 담론 구조와 실행 네트워크 분석 : 소셜 빅데이터 기반 텍스트마이닝 접근

Discourse and Network Structures of Local Content-Focused Universities : A Text Mining Approach Using Social Big Data

노성여

한국과 세계 제7권 4호 pp.193-235 한국국회학회

본 연구는 중소벤처기업부가 추진하는 정책사업인 로컬콘텐츠 중점대 학을 중심으로, 최근 2년간 온라인에서 형성된 담론의 구조와 사회적 수 용 양상을 소셜 빅데이터 분석을 통해 규명하는 데 목적이 있다. 이를 위해 네이버와 다음의 블로그, 뉴스, 웹문서 등 다양한 채널에서 수집한 데이터를 기반으로 텍스트 마이닝(단어빈도, TF-IDF, N-gram), 개체명 인식, 2-mode 매트릭스 분석, 감성 분석, CONCOR 분석, LDA 토픽모 델링 및 의미기반 클러스터링을 수행하였다. 분석 결과, ‘대학’, ‘콘텐츠’, ‘창업’, ‘지역’, ‘지원’ 등 핵심어를 중심으로 한 의미구조가 형성되어 있 었으며, 담론은 대체로 긍정적 정서를 포함하고 있었다. 또한 대학, 지역 기관, 중소기업 간 협력 네트워크가 주체별로 상이한 양상을 보이며 다 층적 실행 구조를 보여주었다. 본 연구는 로컬콘텐츠 중점대학의 사회적 인식과 정책적 함의를 담론 기반으로 조망함으로써, 향후 제도 설계 및 지역혁신전략 수립에 기초자료를 제공하고자 한다.

9,500원

4.

2024.03 KCI 등재 구독 인증기관 무료, 개인회원 유료

中国社交媒体对埃隆·马斯克人物形象认知的特点分析 : 基于微博博文数据的挖掘

Analysis of the Characteristics of Elon Musk's Image on Chinese Social Media : Centered on Data Mining of Weibo Articles

涂波

한국과 세계 제6권 2호 pp.913-928 한국국회학회

Elon·Musk is a business man who attracts the world’s most attention, not only because of its unusual business mind, advanced challenging consciousness and legendary entrepreneurial experience which made him the world's richest man, but also because he is good at using the trend of social network society (SNS) platform to achieve social interaction. This study uses python 3.11 software to capture and filter Musk's Weibo articles on August 18th, 2023, and makes logical analysis based on the chronological related events, so as to extract Musk’s cognitive characteristics of Chinese social media. This paper finds that Chinese social media builds Musk's image cognition through reporting and judging his career development and hot issues, the cognition varies with the dynamic changes of character events; Chinese social media focuses on fields of Tesla intelligent driving, spaceship and brain neural technology, as well as social media; Weibo articles’ cognitive characteristics of Musk's image are extreme, where the extremely positive proportion accounts for more than 60%, and the extremely negative proportion accounts for more than 10%.

4,900원

5.

2023.10 구독 인증기관·개인회원 무료

Data mining large-volume photometric galaxy catalog

Duho Kim

천문학회보 제48권 2호 pp.126-127 한국천문학회

6.

2023.09 KCI 등재 구독 인증기관 무료, 개인회원 유료

Application and Comparison of Data Mining Technique to Prevent Metal-Bush Omission

메탈부쉬 누락예방을 위한 데이터마이닝 기법의 적용 및 비교

Sang-Hyun Ko, Dongju Lee

한국산업경영시스템학회지 Vol. 46 No. 3 pp.139-147 한국산업경영시스템학회

The metal bush assembling process is a process of inserting and compressing a metal bush that serves to reduce the occurrence of noise and stable compression in the rotating section. In the metal bush assembly process, the head diameter defect and placement defect of the metal bush occur due to metal bush omission, non-pressing, and poor press-fitting. Among these causes of defects, it is intended to prevent defects due to omission of the metal bush by using signals from sensors attached to the facility. In particular, a metal bush omission is predicted through various data mining techniques using left load cell value, right load cell value, current, and voltage as independent variables. In the case of metal bush omission defect, it is difficult to get defect data, resulting in data imbalance. Data imbalance refers to a case where there is a large difference in the number of data belonging to each class, which can be a problem when performing classification prediction. In order to solve the problem caused by data imbalance, oversampling and composite sampling techniques were applied in this study. In addition, simulated annealing was applied for optimization of parameters related to sampling and hyper-parameters of data mining techniques used for bush omission prediction. In this study, the metal bush omission was predicted using the actual data of M manufacturing company, and the classification performance was examined. All applied techniques showed excellent results, and in particular, the proposed methods, the method of mixing Random Forest and SA, and the method of mixing MLP and SA, showed better results.

4,000원

7.

2023.08 KCI 등재 구독 인증기관 무료, 개인회원 유료

군사학의 학술연구 동향과 인식에 관한 연구 : 빅데이터를 활용한 텍스트마이닝을 중심으로

A Study on the Academic Research Tendency and Perception of the Military Science : Focusing on text mining employing big data

김동훈, 김법헌

한국과 국제사회 제7권 4호 pp.87-106 한국정치사회연구소

군사학은 급변하는 안보환경과 국제정세의 변화, 4차산업혁명시대의 무기체계 발전과 저출산에 따른 병역제도 등의 사회적 관심이 증대되 고 있다. 따라서 본 연구는 빅데이터를 활용한 텍스트마이닝 기법으로 군사학의 학술연구 동향과 사회적 인식을 분석하여 시사점을 제시하는 데 있다. 연구 결과 학술연구 동향은 주변국 관계, 무기체계, 방위산업, 인공지능 등이 중점을 이루었지만, 사회적 인식은 대학교와 군사학과, 장교 등의 관심으로 차이점을 보였다. 군사학 발전을 위해 연구 중심의 역량과 환경을 구축하고, 융·복합적 연구와 지역사회와 연계한 산학협 력 체계구축 및 국민 참여를 통한 학술 세미나 및 통합연구 등이 요구 되었다.

5,500원

8.

2023.02 KCI 등재 구독 인증기관 무료, 개인회원 유료

코로나19 발생 후 지역농산물 이용 간편식에 대한 시장 이슈 변화: 온라인 빅데이터의 텍스트마이닝

Change in Market Issues on HMR (Home Meal Replacements) Using Local Foods after the COVID-19 Outbreak: Text Mining of Online Big Data

주유정, 변우진, 윤지현

韓國食生活文化學會誌 제38권 제1호 pp.1-14 한국식생활문화학회

This study was conducted to explore the change in the market issues on HMR (Home Meal Replacements) using local foods after the COVID-19 outbreak. Online text data were collected from internet news, social media posts, and web documents before (from January 2016 to December 2019) and after (from January 2020 to November 2022) the COVID- 19 outbreak. TF-IDF analysis showed that ‘Trend’, ‘Market’, ‘Consumption’, and ‘Food service industry’ were the major keywords before the COVID-19 outbreak, whereas ‘Wanju-gun’, ‘Distribution’, ‘Development’, and ‘Meal-kit’ were main keywords after the COVID-19 outbreak. The results of topic modeling analysis and categorization showed that after the COVID-19 outbreak, the ‘Market’ category included ‘Non-face-to-face market’ instead of ‘Event,’ and ‘Delivery’ instead of ‘Distribution’. In the ‘Product’ category, ‘Marketing’ was included instead of ‘Trend’. Additionally, in the ‘Support’ category, ‘Start-up’ and ‘School food service’ appeared as new topics after the COVID-19 outbreak. In conclusion, this study showed that meaningful change had occurred in market issues on HMR using local foods after the COVID-19 outbreak. Therefore, governments should take advantage of such market opportunity by implementing policy and programs to promote the development and marketing of HMR using local foods.

4,600원

9.

2022.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

궤적 데이터 마이닝 연구 동향: 응용 분야와 분석 방법론을 중심으로

Research Trend Analysis on Trajectory Data Mining: Focusing on Applications and Methods

김지연, 이도현, 이지윤, 조주연, 강영옥

한국지도학회지 제22권 제3호 pp.37-57 한국지도학회

최근 GPS에 기반한 위치 수집 기술의 발전과 스마트폰과 같은 GPS를 탑재한 디바이스의 폭발적인 증가로 사람, 차량, 선박, 항공체와 같은 움직이는 물체의 지리적 위치에 대한 엄청난 양의 데이터가 실시간으로 수집되고 있다. 이는 사물의 움직임과 관련된 중요한 학문적 및 실용적 가치를 가지고 있다. 이와 같은 데이터를 분석하기 위한 데이터 마이닝 방법 또한 함께 발전하고 있으며 연구자들은 궤적 데이터를 활용하여 도시에서 일어나는 이동 현상과 도시를 구성하는 장소 간의 관계 등을 탐색함으로써 다양한 도시 문제에 대한 해결방안을 제시하고 있다. 궤적은 다양한 물체의 움직임을 추적할 수 있는 만큼 그 활용 분야와 목적 역시 매우 다양하여 도시 계획, 교통, 행동생태학, 공공안전, 이상 및 위반 탐지, 감시 등과 같은 분야에서 널리 활용되고 있다. 특히 최근 데이터 마이닝 방법론과 딥러닝 기술의 발전으로 궤적 데이터 분석에 다양한 분석방법이 융합적으로 접목되어 의미 있는 연구결과 도출되고 있어 이에 대한 체계적 분석이 필요하다. 이러한 배경하에 본 연구는 궤적 데이터를 활용한 국내외 약 150여 편의 연구를 응용분야 및 활용방법론 별로 구분하고, 응용분야별, 궤적 데이터 분석 방법론별 최근 동향을 분석하였다. 이는 향후 궤적 데이터에 적용가능한 방법론 탐색, 궤적 데이터 분석과 관련된 구체적 사례 탐색, 궤적 데이터를 활용한 응용서비스 도출의 자료로 활용될 수 있을 것으로 사료된다.

5,700원

10.

2022.11 구독 인증기관·개인회원 무료

AIS 기반의 시공간 데이터마이닝을 이용한 해상교통 공간패턴 분석에 관한 연구

Study on Maritime Traffic Spatial Pattern Analysis using AIS-based Spatial-temporal Data Mining

김태현, 김영도, 박세정, 임올렉, 이준화, 김윤지, 이정석, 조익순

해양환경안전학회 학술대회 논문집 2022년도 추계학술발표회 p.113 해양환경안전학회

11.

2021.09 KCI 등재 구독 인증기관 무료, 개인회원 유료

터널시설물 점검진단 데이터의 텍스트마이닝 분석을 통한 유형별･지역별 중점 유지관리요소의 이해

Understanding Facility Management on Tunnel through Text Mining of Precision Safety Diagnosis Data

서정은, 오진탁

한국대공간건축 논문집(구 한국공간구조학회지) 제21권 제3호 pp.85-92 한국공간구조학회

The purpose of this paper is to understand the key factors for efficient maintenance of rapidly aging facilities. Therefore, the safety inspection/diagnosis reports accumulated in the unstructured data were collected and preprocessed. Then, the analysis was performed using a text mining analysis method. The derived vulnerabilities of tunnel facilities can be used as elements of inspections that take into account the characteristics of individual facilities during regular inspections and daily inspections in the short term. In addition, if detailed specification information and other inspection results(safety, durability, and ease of use) are used for analysis, it provides a stepping stone for supporting preemptive maintenance decision-making in the long term.

4,000원

12.

2021.06 구독 인증기관 무료, 개인회원 유료

점검진단 데이터의 텍스트마이닝 분석을 통한 터널시설물 중점 유지관리요소의 이해

Understanding Facility Management on Tunnel through Text Mining of Precision Safety Diagnosis Data

서정은, 오진탁

복합신소재구조학회지 제12권 제2호 pp.55-61 한국복합신소재구조학회

4,000원

13.

2021.06 구독 인증기관 무료, 개인회원 유료

국내 금융교육 학술적 담론에 대한 텍스트 마이닝(LDA) 분석

The Academic Discourse of Financial Education in Korea - Data Using Text Mining

이주형

금융교육연구 제6권 pp.49-70 한국금융교육학회

국내에서 연구된 금융교육 유관 학술논문을 보다 객관적으로 이해하고자 논문 초록에서 추출된 키워드를 중심으로 주요 토픽을 추론하여 포괄적인 담론들을 알아보고자 한다. 연구의 효율성을 높이고 반복될 수 있는 후속과제 연구를 위하여 빅 데이터 분석기법(텍스트 마이닝 - LDA)을 활용하였고 주요토픽에 대한 단어들을 추출하였다. 총 208건의 유관된 학술 논문을 전 처리 한 후에 추출된 명사 32,523건 중 상위빈도 1,201건에 대하여 LDA 토픽모델링을 실시한 결과 16개의 토픽 군이 형성되었다. 최다 빈도의 단어는 “금융이해력” 이었고 다음은 “학생”, “금융소비자” 순이었는데 추론 된 토픽들의 공통적인 주요 요소에는 학교와 학생들에게 교육을 제공하거나 공급하는 과정에 관심을 가지고 있다는 것이었다. 핵심 텍스트와 토픽을 정의하면서 피교육자의 사회적 요구와 성인을 위한 금융교육 관심도가 미흡하여 향후 지속적인 연구영역 확대 가 필요하다는 시사점을 발견하게 되었다.

5,800원

14.

2021.06 구독 인증기관·개인회원 무료

데이터 마이닝 기법을 활용한 부산신항 컨테이너선의 입항패턴 분석

Analysis of Container Ship Trajectories in Busan New Port using Data Mining Technique

이형탁, 이정석, 양현, 조익순

해양환경안전학회 학술대회 논문집 2021년도 공동학술대회 해양환경안전학회 춘계학술발표회 p.43 해양환경안전학회

15.

2020.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

4차 산업혁명이 주목한 Z세대의 스포츠 소비 스타일 탐색: 데이터마이닝 기반 의사결정 나무 분석 적용

Exploring Sport Consumption Style of Generation Z that the 4th Industrial revolution paid attention to: Applying Decision Tree Analysis based on Data Mining

신진호, 임영삼, 김지선

한국응용과학기술학회지 Vol. 37 No. 5 pp.1208-1221 한국응용과학기술학회(구 한국유화학회)

본 연구는 데이터 마이닝 기반 의사결정 나무 분석을 적용해 Z세대 스포츠 소비 스타일을 탐색 하여 Z세대가 주도할 스포츠 소비 시장을 예측하기 위한 기초자료를 제공하고자 했다. 따라서 Z세대 중 만 19세 이상 남성 및 여성을 표본으로 선정해 본 조사를 실시했으며, 총 429명의 자료를 최종 분석에 사용했다. 자료처리는 SPSS statistics(ver. 21.0) 프로그램을 이용하여 빈도분석, 탐색적 요인분석, 재검사 신 뢰도 및 신뢰도 분석, 의사결정 나무 분석을 실시했다. 본 연구의 주요 결과는 다음과 같다. 첫째, 합리 효율성 지수가 높고, 심미적 소비 지수가 낮을 경우 여성 집단으로 분류될 확률이 96.8%로 나타났다. 반면에 합리 효율성과 가격 지향 지수가 낮을 경우 남성 집단으로 분류될 확률이 100%로 나타났다. 둘째, 브랜드 지향, 가격 지향, 합리 효율성 지수가 높을 경우 수도권 집단으로 분류될 확률이 97.3%로 나타났다. 앞서 제시한 결과와는 상반적으로 브랜드 지향, 기념 의례, 지위 상징 지수가 낮을 경우 이외 지역 집단으로 분 류될 확률이 82.1%로 나타났다. 셋째, 지위 상징, 유행 지향 지수가 높으며, 기능성 지수가 낮을 경우 일상 생활 및 패션 집단으로 분류될 확률이 77.6%로 나타났다. 이와 반대로 지위 상징 지수가 낮고, 소속감 유지, 소비 향유 지수가 높을 경우 운동 및 경기 집단으로 분류될 확률이 81.0%로 나타났다.

4,600원

16.

2020.06 구독 인증기관 무료, 개인회원 유료

Application on Electronic Chart Data Management based on Data Mining Technology

Haoran Song

International Journal of e-Navigation and Maritime Economy Volume 14 pp.68-74 국제이네비해양경제학회

After decades of vigorous development, data mining technology has achieved fruitful theoretical and application results. As a highly applicable subject, data mining technology has penetrated into various fields of the national economy, and has aroused great attention from academia and industry. A large amount of chart data is stored in the electronic chart database, and its application is very extensive, providing a valuable decision basis for managers in all walks of life. It is of great significance to establish a complete data management mechanism based on data mining technology. The traditional data analogy extraction technology, because of the data association index and the poor ability of data association, leads to the difference between the extraction data and the target data. Therefore, the application of data mining technology on electronic chart data management is studied. Data mining technology uses rough set to obtain the basic information of electronic chart data management according to similarity function, mining electronic chart data management association rules; through the comprehensive evaluanon data system of electronic chart data management, building rule base, setting up the evaluation index of electronic chart data management, achieving the similarity evaluation of the mining results. Experimental test results: compared with the traditional data analogy extraction technology, the results obtained by data mining technology have higher similarity with the target data and meet the requirements of electronic chart data management acquisition. It can be seen that this technology is more suitable for the application of electronic chart data management

4,000원

17.

2018.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

Unstructured Data Quantification Scheme Based on Text Mining for User Feedback Extraction

사용자 의견 추출을 위한 텍스트 마이닝 기반 비정형 데이터 정량화 방안

Jung-Heum Jo, Yong-Taek Chung, Seong-Wook Choi, Changsoo Ok

한국산업경영시스템학회지 Vol. 41 No. 4 pp.131-137 한국산업경영시스템학회

People write reviews of numerous products or services on the Internet, in their blogs or community bulletin boards. These unstructured data contain important emotions and opinions about the author's product or service, which can provide important information for future product design or marketing. However, this text-based information cannot be evaluated quantitatively, and thus they are difficult to apply to mathematical models or optimization problems for product design and improvement. Therefore, this study proposes a method to quantitatively extract user’s opinion or preference about a specific product or service by utilizing a lot of text-based information existing on the Internet or online. The extracted unstructured text information is decomposed into basic unit words, and positive rate is evaluated by using existing emotional dictionaries and additional lists proposed in this study. This can be a way to effectively utilize unstructured text data, which is being generated and stored in vast quantities, in product or service design. Finally, to verify the effectiveness of the proposed method, a case study was conducted using movie review data retrieved from a portal website. By comparing the positive rates calculated by the proposed framework with user ratings for movies, a guideline on text mining based evaluation of unstructured data is provided.

4,000원

18.

2018.12 KCI 등재 구독 인증기관 무료, 개인회원 유료

궤적 데이터 마이닝을 통한 서울방문 관광객의 이동 특성 분석

Analysis of Travel Patterns of Seoul Tourists by Trajectory Data Mining

이주윤, 강영옥, 김나연, 김동은, 박예림

한국지도학회지 제18권 제3호 pp.117-129 한국지도학회

본 연구는 소셜 네트워크 서비스 중 한 유형인 플리커를 이용하여 궤적 데이터를 생성하고, 서울을 방문한 관광객의 이동 특성을 분석하였다. 연구에는 2015년 1월 1일 부터 2017년 12월 31일까지 서울을 방문한 1,476명 관광객이 게시한 플리커 사진 39,157건을 활용하였다. 연구기간 내 서울을 방문한 관광객은 1회 방문시 평균 5.12일을 체류하며, 약 1.27회 방문한 것으로 나타났다. 서울방문 관광객의 첫 방문지는 종로･남산, 신촌･홍대, 이태원 순으로 나타났으며, 주 목적지는 종로･남산이며 주로 인접 지역으로 이동하는 것으로 나타났다. 본 연구에서 활용한 데이터와 방법론은 관광행태 분석을 효율화하고, 다각적 분석을 가능하게 하는데 기여할 것으로 판단된다.

4,500원

19.

2018.08 KCI 등재 구독 인증기관 무료, 개인회원 유료

데이터 마이닝 기법을 이용한 차량용 반도체의 불량률 예측 연구

Prediction of field failure rate using data mining in the Automotive semiconductor

윤경식, 정희운, 박승범

기술혁신연구 26권 3호 pp.37-68 기술경영경제학회

본 논문에서는 차량용 반도체가 제품 출하 후 사용 환경에 따라 발생되는 불량률을 데이터 마이닝 기법을 이용하여 분석하였다. 20세기 이후 가장 보편적인 이동 수단인 자동차는 전자 컨트롤 장치와 자동차용 반도체의 사용량이 급격히 증가하면서 매우 빠른 속도로 진화하고 있다. 자동차용 반도체는 차량용 전자 컨트롤 장치 중 핵심 부품으로 소비자들에게 안정성, 연료 사용의 효율성, 운전의 안정감을 제공하기 위해 사용되고 있다. 자동차용 반도체는 가솔린엔진, 디젤 엔진, 전기 모터를 컨트롤하는 기술, 헤드업 디스플레이, 차선 유지 시스템 등 많은 부분에 적용되고 있다. 이와 같이 반도체는 자동차를 구성하는 거의 모든 전자 컨트롤 장치에 적용되고 있으며 기계적인 장치를 단순히 조합한 이상의 효과를 만들어 내고 있다. 자동차용 반도체는 10년 이상의 자동차 사용 기간을 고려하여 높은 신뢰성, 내구성, 장기공급 등의 특성을 요구하고 있다. 자동차용 반도체의 신뢰성은 자동차의 안전성과 직접적으로 연결되기 때문이다. 반도체업계에서는 JEDEC과 AEC 등의 산업 표준 규격을 이용하여 자동차용 반도체의 신뢰성을 평가하고 있다. 또한 자동차 산업에서 표준으로 제시한 신뢰성 실험 방법과 그 결과를 이용하여 개발 초기 단계 및 제품 양산 초기 단계에서 제품의 수명을 예측 하고 있다. 하지만 고객의 다양한 사용 조건 및 사용 시간 등 여러 변수들에 의해 발생되는 불량률을 예측하는 데는 한계가 있다. 이러한 한계점을 극복하기 위하여 학계와 산업계에서 많은 연구가 있어왔다. 그 중 데이터 마이닝 기법을 이용한 연구가 다수의 반도체 분야에서 진행되고 있지만, 아직 자동 차용 반도체에 대한 적용 및 연구는 미비한 상태이다. 이러한 관점에서 본 연구는 데이터 마이닝 기법을 이용하여 반도체 조립(Assembly) 과 패키지 테스트(Package test) 공정 중 발생 된 데이터들간의 연관성을 규명하고, 고객 불량 데이터를 이용하여 잠재 불량률 예측에 적합한 데이터 마이닝 기법을 검증하였다.

7,800원

20.

2018.03 KCI 등재 구독 인증기관 무료, 개인회원 유료

빅데이터 분석 기반의 오피니언 마이닝을 이용한 정보화 사업 평가 분석

An Analysis of IT Proposal Evaluation Results using Big Data-based Opinion Mining

김홍삼, 김종수

한국산업경영시스템학회지 Vol. 41 No. 1 pp.1-10 한국산업경영시스템학회

Current evaluation practices for IT projects suffer from several problems, which include the difficulty of self-explanation for the evaluation results and the improperly scaled scoring system. This study aims to develop a methodology of opinion mining to extract key factors for the causal relationship analysis and to assess the feasibility of quantifying evaluation scores from text comments using opinion mining based on big data analysis. The research has been performed on the domain of publicly procured IT proposal evaluations, which are managed by the National Procurement Service. Around 10,000 sets of comments and evaluation scores have been gathered, most of which are in the form of digital data but some in paper documents. Thus, more refined form of text has been prepared using various tools. From them, keywords for factors and polarity indicators have been extracted, and experts on this domain have selected some of them as the key factors and indicators. Also, those keywords have been grouped into into dimensions. Causal relationship between keyword or dimension factors and evaluation scores were analyzed based on the two research models-a keyword-based model and a dimension-based model, using the correlation analysis and the regression analysis. The results show that keyword factors such as planning, strategy, technology and PM mostly affects the evaluation result and that the keywords are more appropriate forms of factors for causal relationship analysis than the dimensions. Also, it can be asserted from the analysis that evaluation scores can be composed or calculated from the unstructured text comments using opinion mining, when a comprehensive dictionary of polarity for Korean language can be provided. This study may contribute to the area of big data-based evaluation methodology and opinion mining for IT proposal evaluation, leading to a more reliable and effective IT proposal evaluation method.

4,000원