논문 상세보기

Extracting Specific Information in Web Pages Using Machine Learning KCI 등재

머신러닝을 이용한 웹페이지 내의 특정 정보 추출

  • 언어KOR
  • URLhttps://db.koreascholar.com/Article/Detail/364064
구독 기관 인증 시 무료 이용이 가능합니다. 4,000원
한국산업경영시스템학회지 (Journal of Society of Korea Industrial and Systems Engineering)
한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)
초록

With the advent of the digital age, production and distribution of web pages has been exploding. Internet users frequently need to extract specific information they want from these vast web pages. However, it takes lots of time and effort for users to find a specific information in many web pages. While search engines that are commonly used provide users with web pages containing the information they are looking for on the Internet, additional time and efforts are required to find the specific information among extensive search results. Therefore, it is necessary to develop algorithms that can automatically extract specific information in web pages. Every year, thousands of international conference are held all over the world. Each international conference has a website and provides general information for the conference such as the date of the event, the venue, greeting, the abstract submission deadline for a paper, the date of the registration, etc. It is not easy for researchers to catch the abstract submission deadline quickly because it is displayed in various formats from conference to conference and frequently updated. This study focuses on the issue of extracting abstract submission deadlines from International conference websites. In this study, we use three machine learning models such as SVM, decision trees, and artificial neural network to develop algorithms to extract an abstract submission deadline in an international conference website. Performances of the suggested algorithms are evaluated using 2,200 conference websites.

목차
1. 서 론
 2. 데이터
  2.1 데이터 수집
  2.2 데이터 전처리
 3. 머신러닝
  3.1 SVM 모델
  3.2 의사결정나무 모델
  3.3 인공신경망 모델
 4. 실험결과
 5. 결 론
 References
저자
  • Joung-Yun Lee(Industrial and Management Engineering, Incheon National University) | 이정윤
  • Jae-Gon Kim(Industrial and Management Engineering, Incheon National University) | 김재곤 Corresponding Author