The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick’s framework. Specifically, the study focused on the content, substantive, and external aspects of construct validity. However, the findings provided weak evidence supporting the utility of the VST as a measure of receptive vocabulary for Korean EFL learners. The test was deemed too easy and lacked the ability to effectively differentiate among varying levels of second language proficiency, with many test items exhibiting unexpected behavior. Additionally, the VST showed a weak correlation with another measure of second language proficiency. In light of these findings, the study offers specific recommendations for improving the test's validity and usefulness.