Extracting Specific Information in Web Pages Using Machine Learning

Joung-Yun Lee; Jae-Gon Kim

논문 상세보기

Extracting Specific Information in Web Pages Using Machine Learning KCI 등재

머신러닝을 이용한 웹페이지 내의 특정 정보 추출

Joung-Yun Lee, Jae-Gon Kim

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/364064

구독 기관 인증 시 무료 이용이 가능합니다. 4,000원

한국산업경영시스템학회지 (Journal of Society of Korea Industrial and Systems Engineering)

Vol. 41 No. 4 (2018.12)
pp.189-195

한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)

초록

With the advent of the digital age, production and distribution of web pages has been exploding. Internet users frequently need to extract specific information they want from these vast web pages. However, it takes lots of time and effort for users to find a specific information in many web pages. While search engines that are commonly used provide users with web pages containing the information they are looking for on the Internet, additional time and efforts are required to find the specific information among extensive search results. Therefore, it is necessary to develop algorithms that can automatically extract specific information in web pages. Every year, thousands of international conference are held all over the world. Each international conference has a website and provides general information for the conference such as the date of the event, the venue, greeting, the abstract submission deadline for a paper, the date of the registration, etc. It is not easy for researchers to catch the abstract submission deadline quickly because it is displayed in various formats from conference to conference and frequently updated. This study focuses on the issue of extracting abstract submission deadlines from International conference websites. In this study, we use three machine learning models such as SVM, decision trees, and artificial neural network to develop algorithms to extract an abstract submission deadline in an international conference website. Performances of the suggested algorithms are evaluated using 2,200 conference websites.

키워드

Data ExtractionMachine LearningSVMDecision TreeNeural Network

1. 서 론
2. 데이터
  2.1 데이터 수집
  2.2 데이터 전처리
3. 머신러닝
  3.1 SVM 모델
  3.2 의사결정나무 모델
  3.3 인공신경망 모델
4. 실험결과
5. 결 론
References

저자

Joung-Yun Lee(Industrial and Management Engineering, Incheon National University) | 이정윤
Jae-Gon Kim(Industrial and Management Engineering, Incheon National University) | 김재곤 Corresponding Author

같은 권호 다른 논문