This study aims to develop a comprehensive predictive model for Digital Quality Management (DQM) and to analyze the impact of various quality activities on different levels of DQM. By employing the Classification And Regression Tree (CART) methodology, we are able to present predictive scenarios that elucidate how varying quantitative levels of quality activities influence the five major categories of DQM. The findings reveal that the operation level of quality circles and the promotion level of suggestion systems are pivotal in enhancing DQM levels. Furthermore, the study emphasizes that an effective reward system is crucial to maximizing the effectiveness of these quality activities. Through a quantitative approach, this study demonstrates that for ventures and small-medium enterprises, expanding suggestion systems and implementing robust reward mechanisms can significantly improve DQM levels, particularly when the operation of quality circles is challenging. The research provides valuable insights, indicating that even in the absence of fully operational quality circles, other mechanisms can still drive substantial improvements in DQM. These results are particularly relevant in the context of digital transformation, offering practical guidelines for enterprises to establish and refine their quality management strategies. By focusing on suggestion systems and rewards, businesses can effectively navigate the complexities of digital transformation and achieve higher levels of quality management.
본 연구는 서울교육청 교육연구정보원의 「서울교육종단연구(SELS)」에 서 수집된 자료를 활용하여, 고등학생 3학년인 9차(2018년) 자료에서 학 생 2,793명을 연구 대상자로 정하였다. 청소년의 학교만족도와 관련한 예측요인을 확인하기 위해 SPSS 26.0을 사용하여 의사결정나무모형 분 석을 실시하였다. 연구결과를 살펴보면, 첫째, 청소년의 학교만족도의 분 류에서 개인적인 요인으로는 성별, 자아개념, 자기평가, 사회적 관계 요 인으로 보호자, 학교교사, 학교 특성/문화 요인으로는 학교에 대한 평가, 학교풍토가 유의한 변인으로 확인되었다. 둘째, 학교만족도 분류에 영향 을 주는 변인들 중에서는 학교에 대한 평가가 가장 영향력을 가진 변인 으로 나타났다. 셋째, 학교교사 수치가 높은 집단에서는 학교풍토, 자아 개념이 분류의 중요한 의미 있는 변인이었고, 학교교사 수치가 낮은 집 단에서는 자기평가, 학교풍토, 학교에 대한 평가가 영향력 있는 변인이었 다. 넷째, 학교에 대한 평가 수준 및 학교풍토가 바람직하고 좋으면 학교 만족도가 긍정적으로 상승하는 것으로 확인되었다. 본 연구결과는 청소 년의 학교만족도 증진을 위한 방안 모색, 교육정책 수립 및 프로그램 운 영에 도움이 될 것으로 사료된다.
This paper analyzed the correlation between injection molding factors through correlation analysis. In addition, the decision-tree model, which is a white box model with excellent explanatory power, was used to obtain optimal molding conditions that satisfy multiple constraint conditions. First, 243 data to be used in the experiment were created through a full factorial design. Second, a correlation analysis was conducted to understand the correlation. Third, to verify the decision-tree model, the prediction performance was evaluated using RMSE. As a result, good prediction performance was confirmed. A decision-tree experiment analysis was conducted. As a result of the progress, the same results as the correlation analysis were derived. Based on the previous analysis results, optimal molding conditions were applied to CAE. As a result, the amount of deformation in the multi-cavity could be improved by about 1.1% and 2.72% while satisfying the constraint.
Most of the open-source decision tree algorithms are based on three splitting criteria (Entropy, Gini Index, and Gain Ratio). Therefore, the advantages and disadvantages of these three popular algorithms need to be studied more thoroughly. Comparisons of the three algorithms were mainly performed with respect to the predictive performance. In this work, we conducted a comparative experiment on the splitting criteria of three decision trees, focusing on their interpretability. Depth, homogeneity, coverage, lift, and stability were used as indicators for measuring interpretability. To measure the stability of decision trees, we present a measure of the stability of the root node and the stability of the dominating rules based on a measure of the similarity of trees. Based on 10 data collected from UCI and Kaggle, we compare the interpretability of DT (Decision Tree) algorithms based on three splitting criteria. The results show that the GR (Gain Ratio) branch-based DT algorithm performs well in terms of lift and homogeneity, while the GINI (Gini Index) and ENT (Entropy) branch-based DT algorithms performs well in terms of coverage. With respect to stability, considering both the similarity of the dominating rule or the similarity of the root node, the DT algorithm according to the ENT splitting criterion shows the best results.
본 연구는 데이터 마이닝 기반 의사결정 나무 분석을 적용해 Z세대 스포츠 소비 스타일을 탐색 하여 Z세대가 주도할 스포츠 소비 시장을 예측하기 위한 기초자료를 제공하고자 했다. 따라서 Z세대 중 만 19세 이상 남성 및 여성을 표본으로 선정해 본 조사를 실시했으며, 총 429명의 자료를 최종 분석에 사용했다. 자료처리는 SPSS statistics(ver. 21.0) 프로그램을 이용하여 빈도분석, 탐색적 요인분석, 재검사 신 뢰도 및 신뢰도 분석, 의사결정 나무 분석을 실시했다. 본 연구의 주요 결과는 다음과 같다. 첫째, 합리 효율성 지수가 높고, 심미적 소비 지수가 낮을 경우 여성 집단으로 분류될 확률이 96.8%로 나타났다. 반면에 합리 효율성과 가격 지향 지수가 낮을 경우 남성 집단으로 분류될 확률이 100%로 나타났다. 둘째, 브랜드 지향, 가격 지향, 합리 효율성 지수가 높을 경우 수도권 집단으로 분류될 확률이 97.3%로 나타났다. 앞서 제시한 결과와는 상반적으로 브랜드 지향, 기념 의례, 지위 상징 지수가 낮을 경우 이외 지역 집단으로 분 류될 확률이 82.1%로 나타났다. 셋째, 지위 상징, 유행 지향 지수가 높으며, 기능성 지수가 낮을 경우 일상 생활 및 패션 집단으로 분류될 확률이 77.6%로 나타났다. 이와 반대로 지위 상징 지수가 낮고, 소속감 유지, 소비 향유 지수가 높을 경우 운동 및 경기 집단으로 분류될 확률이 81.0%로 나타났다.
As a non-parametric data mining method, decision tree classification has performed well in many applications. The complexity of the model increases as the decision tree algorithm proceeds to grow the decision tree as the rule of decision making. While the increase of the complexity enhances the accuracy, it degrades the generalization which predicts the unseen data. This phenomenon is called as overfitting. To avoid the overfitting, pruning has been introduced. Pruning enables to make the generalization better, reduces the complexity, and avoids the overfitting. Although various pruning methods have been proposed, selecting the best pruning methods or making balance between complexity and generalization with pruning is not a simple problem. In this paper, we explore the methods of pruning and analyze them to suggest the optimal approach for applications.
PURPOSES: This study was initiated to analyze the characteristics of bus traffic accidents, by bus types, using the decision tree in order to establish customized safety alternatives by bus types, including the intra-city bus, rural area bus, and inter-city bus.
METHODS: In this study, the major elements involved in bus traffic accidents were identified using decision trees and CHAID algorithm. The decision tree was used to identify the characteristics of major elements influencing bus traffic accidents. In addition, the CHAID algorithm was applied to branch the decision trees.
RESULTS : The number of casualties and severe injuries are high in bus accidents involving pedestrians, bicycles, motorcycles, etc. In the case of light injury caused by bus accidents, different results are found. In the case of intra-city bus accidents, the probability of light injury is of 77.2% when boarding a non-owned car and breaching of duty to drive safely are involved. In the case of rural area bus accidents, the elements showing the highest probability of light injury are boarding an owned car, vehicle-to-vehicle accidents, and breaching of duty to drive safely. In the case of intra-city bus accidents, boarding owned car, streets, and vehicle-to-vehicle accidents work as the critical elements.
CONCLUSIONS: In this study, the bus accident data were categorized by bus types, and then the influential elements were identified using decision trees. As a result, the characteristics of bus accidents were found to be different depending on bus types. The findings in this study are expected to be utilized in establishing effective alternatives to reduce bus accidents.
In the manufacturing industry fields, thousands of quality characteristics are measured in a day because the systems of process have been automated through the development of computer and improvement of techniques. Also, the process has been monitored in database in real time. Particularly, the data in the design step of the process have contributed to the product that customers have required through getting useful information from the data and reflecting them to the design of product. In this study, first, characteristics and variables affecting to them in the data of the design step of the process were analyzed by decision tree to find out the relation between explanatory and target variables. Second, the tolerance of continuous variables influencing on the target variable primarily was shown by the application of algorithm of decision tree, C4.5. Finally, the target variable, loss, was calculated by a loss function of Taguchi and analyzed. In this paper, the general method that the value of continuous explanatory variables has been used intactly not to be transformed to the discrete value and new method that the value of continuous explanatory variables was divided into 3 categories were compared. As a result, first, the tolerance obtained from the new method was more effective in decreasing the target variable, loss, than general method. In addition, the tolerance levels for the continuous explanatory variables to be chosen of the major variables were calculated. In further research, a systematic method using decision tree of data mining needs to be developed in order to categorize continuous variables under various scenarios of loss function.
Self-efficacy (one's perceptions of their capability to perform a task) plays an important role in work-related performance and motivation. For example, self-efficacy is known to have much influence on job performance, job satisfaction, motivation, etc. As such it is important to know what factors collectively enhance the selfefficacy of employees, so that injured workers contribute to the organization they belong to after they come back to their workplace. The aim of this study is to identify such industrial accident-related factors and extract rules among the factors in order to establish self-efficacy enhancement strategies for injured workers. In this study, a binary decision tree model for self-efficacy prediction was built using a panel data provided from Korea Workers’ Compensation & Welfare Service. As a result, eight variables with the largest influence on self-efficacy were selected in the prediction model, and it correctly classified 70.1% of instances. The result suggests social support during the treatment period and offering paid time off such as vacation leave, sick leave and bereavement leave are important factors to enhance self-efficacy that will improve the work performance of injured workers.
Currently, R&D investment of government is increased dramatically. However, the budget of the government is different depend- ing on the size of ministry and priorities, and then it is difficult to obtain consensus on the budget. They did not establish decision support systems to evaluate and execute R&D budget. In this paper, we analyze factors affecting research funds by linear regression and decision tree analysis in order to increase investment efficiency in national research project. Moreover, we suggested strategies that budget is estimated reasonably.
본 연구는 불특정 다수의 도로이용자들이 경로우회 시 갖는 의사결정과정속에 내포된 비선형성과 불확실성을 고려한 정도 있는 모형구축으로 주요 우회결정요인을 분석하는 것이 주요 목적이다. 이를 위하여 고속도로 및 국도를 이용하는 운전자를 대상으로 우회여부에 관련된 SP조사를 실시하였고, 조사결과에 대하여 의사결정나무와 신경망이론의 결합된 모형을 구축하여 운전자 우회결정요인을 분석하였다. 분석결과 운전자 우회여부결정에 영향을 미치는 요인은 우회도로 인지여부, 교통정보 신뢰도 및 이용빈도, 경로전환빈도, 나이순으로 나타났다. 또한 오분류표를 통한 기존 모형과의 예측력의 비교결과 결합된 모형의 오분류율이 8.7%로 기존 모형인 로짓모형 12.8%, 의사결정나무 단독 모형 13.8%와 비교했을 때 가장 예측력이 높은 것으로 나타나 운전자 우회결정요인 분석에 관한 모형의 적용 타당성을 확인할 수 있었다. 본 연구의 결과는 향후 교통량 분산효과와 도로망 효율 증대를 위한 효과적인 우회관리전략 수립 시 기초 자료로 활용가능하리라 사료된다.
This paper proposes a methodology to measure the risk level in real-time for Business Activity Monitoring (BAM). A decision-tree methodology was employed to analyze the effect of process attributes on the result of the process execution. In the course of
의사결정나무 알고리즘은 데이터마이닝 기법중 하나인데 관심이 되는 데이터들에 대하여 분류 및 예측을 가능하게 해준다. 이 기법은 데이터 형태의 특성을 분석할 수 있고 산업재해 형태의 차이점을 찾아내는데 사용될 수 있다. 본 연구에서는 산업재해 데이터의 특성을 파악하고자 C4.5 알고리즘을 사용하였다. 본 연구에서 분석을 위하여 사용된 데이터는 강원도에서 발생한 2년 동안의 산업재해 관련 데이터로서 연구에 적용된 데이터의 수는 19,909개로 구성되어 있다
The consequences of rapid industrial advancement, diversified types of business and unexpected industrial accidents have caused a lot of damage to many unspecified persons both in a human way and a material way Although various previous studies have been analyzed to prevent industrial accidents, these studies only provide managerial and educational policies using frequency analysis and comparative analysis based on data from past industrial accidents. The main objective of this study is to find an optimal algorithm for data analysis of industrial accidents and this paper provides a comparative analysis of 4 kinds of algorithms including CHAID, CART, C4.5, and QUEST. Decision tree algorithm is utilized to predict results using objective and quantified data as a typical technique of data mining. Enterprise Miner of SAS and AnswerTree of SPSS will be used to evaluate the validity of the results of the four algorithms. The sample for this work chosen from 19,574 data related to construction industries during three years (2002~2004) in Korea.
Due to the convenience of use and rapid access to their information to find, many Internet users utilize the search engines or portal sites to find their information and web sites. However, many researches related with web site evaluation pointed out that many factors have to be considered to increase the usefulness of the site and the degree of user's preference about the site. In this research, based on the previous research, preference factors are derived to evaluate the portal sites. And then, five portal sites are evaluated by the questionnaire. CHAID, a decision tree technique, is used to analyze the results of survey and the relationships of preference factors. This research can be an indicator when we analyze the preference factors of portal site and other kinds of web sites using decision tree.
Classification is an important area in a data mining. There are various ways in classification methodologies : the decision tree and the neural network, etc. Recently, Rough set theory has been presented as a method for classification. Rough set theory is a new approach in decision making in the presence of uncertainty and vagueness. In the process of constructing the tree, appropriate attributes have to be selected as nodes of the tree. In this paper, we present a new approach to selection of attributes for the construction of decision tree using the Rough set theory. The suggested method makes more simple classification rules in the decision tree and reduces the volume of the data to be treated.
Expectation and interest about e-CRM are rising for more efficient customer management in on-line including electronic commerce. The decision-making tree can be used usefully as the data mining technology for e-CRM. In this paper, the representative decision making techniques, CART, C4.5, CHAID analyzed the differences in personalization point of view with actuality customer data through an experiment. With these analysis data, it is proposed a new decision-making tree system that has big advantage in personalization techniques. Through new system, it can get following advantage. First, it can form superior model more qualitatively in personalization by adding individual's weight value. Second it can supply information personalized more to customer. Third, it can have high position about customer's loyalty than other site of similar types of business. Fourth, it can reduce expense that cost marketing and decision-making. Fifth, it becomes possible that know that customer through smooth communication with customer who use personalized service wants and make from goods or service's quality to more worth thing.