논문 상세보기

The Study on the Rater Reliability of Three Scoring Methods in Assessing Argumentative Essays: Holistic, Analytic, and Multiple-Trait Scoring Methods

  • 언어ENG
  • URLhttps://db.koreascholar.com/Article/Detail/272854
구독 기관 인증 시 무료 이용이 가능합니다. 5,800원
외국어교육연구 (Foreign Language Education Research)
서울대학교 외국어교육연구소 (Foreign Language Education Research Institute)
초록

Various studies have been conducted to minimize the subjectivity and increase the accuracy in assessing written texts, and the present study focused on the scoring rubrics which were the basic criteria for evaluating writing. Three different scoring rubrics (holistic, analytic and multiple-trait scoring method) were compared in evaluating argumentative essays written by Korean high school students. The present study aims to investigate the rater-reliability of the three scoring methods, holistic, analytic, and multiple-trait scoring methods. Scores of the five raters which were obtained from using the three scoring methods were compared. It was found that there were significant mean differences in the three scoring methods. Raters gave the relatively low scores when they used the holistic scoring. Next, the highest inter-rater reliability was found in the multiple-trait scoring. All the three scoring methods showed an acceptable level of reliability above .07. However, raters showed the highest reliability when they used a multiple-trait scoring rubric. Also, high correlation was found among components of analytic and multiple-trait scoring methods, indicating that the multiple-trait scoring rubric can replace the analytic scoring rubric. Finally, raters expressed a favor over the multiple-trait scoring. The result of this study suggests some implications for writing assessment in Korean secondary English classes.

목차
Abstract
I. INTRODUCTION
 1. The Background and Purpose of the Study
 2. Research Hypothesis
II. LITERATURE REVIEW
 1. Approaches to Scoring
 2. Previous Studies
III. METHODOLOGY
 1. Participants
 2. Materials and Procedure
 3. Data Analysis
IV. RESULTS AND DISCUSSION
 1. The Differences in the Mean Scores
 2. The Inter-Rater Reliability of the Three Scoring Methods
 3. The Correlation among Components of Analytic and Multiple-trait Scoring Methods
 4. Raters’ Attitudes
V. CONCLUSION
 1. Major Findings and Pedagogical Implications
 2. Limitations and Suggestions for the Further Research
References
Appendix 1. Multiple-Trait Scoring Rubric for an Argumentative Essay
저자
  • Jonggeum Park(Seoul National University)