고고학 연구동향 분석에서 AI 토픽모델링의 유용성과 한계1) ― 청동기시대 석기 연구를 중심으로 한 비교 고찰 ―

진영민

논문 상세보기

고고학 연구동향 분석에서 AI 토픽모델링의 유용성과 한계1) ― 청동기시대 석기 연구를 중심으로 한 비교 고찰 ― KCI 등재

Utility and Limitations of AI-based Topic Modeling in Archaeological Research Trend Analysis A Comparative Study Focusing on Bronze Age Stone Tool Research

진영민

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/449832

구독 기관 인증 시 무료 이용이 가능합니다. 8,300원

韓國靑銅器學報 (한국청동기학보)

제38권 (2026.04)
pp.148-182

한국청동기학회 (Society for Korean Bronze Culture)

초록

본고는 빅데이터 시대의 새로운 연구 방법론으로 부상한 AI 기반 토픽모델링 (BERTopic)을 고고학 연구동향 분석에 적용하고, 그 실체적 효용성과 한계를 규명하는 데 목적이 있다. 이를 위해 2006년부터 2013년까지 한국 청동기시대 석기 관련 연구 논 문 75편을 대상으로 텍스트 마이닝을 수행하였다. 기존의 빅데이터 기반 연구동향 분석이 거시적 경향성 파악에 치중하여 개별 연구의 논리적 맥락을 소거하는 한계를 극복하고자, 본고는 전문가의 정성적 검토가 가능한 규모로 데이터셋을 통제(Controlled Dataset)하여 AI 분석 결과의 미시적 정합성을 정밀하게 검증하였다. 이후 그 결과를 동일시기의 연구 성과를 정량·정성적으로 고찰한 연구사적 논문 결과(손준호 2013)와 직접 비교·분석을 진 행하였다. 텍스트 분석 결과, AI는 방대한 문헌 속에서 ‘형식·편년 중심’과 ‘생산·생계 중심’이라는 거시적 연구 지형을 신속하게 파악하고, 텍스트 이면에 잠재된 방법론적 맥락(자연과학적 분석 등)을 수치로 입증하는 데 탁월한 효용을 보였다. 그러나 미시적 분석 단계에서는 비 판적 논조를 파악하지 못하는 ‘문맥 소거’, 이질적인 시공간 데이터를 기계적으로 결합하 는 ‘사실 왜곡’, 그리고 연구의 질적 경중을 가리지 못하는 ‘가치 평가 부재’라는 결정적 한계를 드러냈다. 이에 필자는 AI의 연산 능력을 맹신하는 태도를 경계하고, 연구자의 경험적 통찰이 AI 의 기계적 객관성을 보완하는 ‘전문가 매개(Expert-Mediated) 통합 분석 모델’을 제안하 였다. 이는 AI에게 1차적인 데이터 처리와 지도 작성을 맡기되, 사실 검증(Fact Verification), 논리적 맥락의 복원(Contextual Calibration), 연구사적 가치 부여 (Qualitative Valuation)의 최종 권한은 인간 연구자가 수행해야 함을 의미한다. 결론적 으로 디지털 고고학의 미래는 데이터의 양적 팽창에 함몰되지 않고, 연구자의 전문적 식견 을 통해 데이터에 학술적 생명력을 불어넣는 방향으로 나아가야 함을 역설하였다.

This study aims to apply AI-based topic modeling (BERTopic), which has emerged as a new research methodology in the era of big data, to the analysis of archaeological research trends and to investigate its practical utility and limitations. To this end, text mining was performed on 75 research papers related to stone tools of the Korean Bronze Age published between 2006 and 2013. To overcome the limitations of existing big data-based trend analyses, which tend to focus on identifying macroscopic trends while erasing the logical context of individual studies, this paper precisely verified the microscopic consistency of the AI analysis results by using a "Controlled Dataset" of a size that allows for qualitative review by experts. Subsequently, the results were directly compared and analyzed against the findings of a historiographical review paper (Son, 2013) that quantitatively and qualitatively examined the research achievements of the same period. The text analysis results showed that AI demonstrated excellent utility in rapidly identifying macroscopic research landscapes, such as ‘typology/chronology-centered’ and ‘production/subsistence-centered’ studies within the vast literature, and in numerically verifying methodological contexts (e.g., scientific analysis) latent behind the text. However, at the microscopic analysis stage, it revealed critical limitations: ‘context erasure,’ failing to grasp critical arguments; ‘factual distortion,’ mechanically combining heterogeneous spatiotemporal data; and ‘absence of valuation,’ unable to distinguish the qualitative significance of studies. Accordingly, the author cautions against blind reliance on AI's computational capabilities and proposes an ‘Expert-Mediated Integrated Analysis Model’ where the researcher's empirical insight complements AI's mechanical objectivity. This implies that while AI is entrusted with primary data processing and mapping, the final authority for ‘Fact Verification,’ ‘Contextual Calibration’ (restoring logical context), and ‘Qualitative Valuation’ (assigning historiographical value) must reside with the human researcher. In conclusion, the study emphasizes that the future of digital archaeology should not be submerged in the quantitative expansion of data but should evolve in a direction where the researcher's professional expertise breathes scholarly life into the data.

키워드

텍스트 마이닝BERTopic연구동향청동기시대 석기디지털 고고학AI 접수일 Text MiningBERTopicResearch TrendsBronze Age Stone ToolsDigital ArchaeologyAI

‖목차‖
‖요약‖
I. 서론
II. 연구 대상 및 방법
    1. 인문사회과학 분야의 텍스트 마이닝(Text Mining) 연구 동향
    2. 연구 대상 및 자료 수집
    3. 연구 방법
III. BERTopic을 이용한 연구동향 분석
    1. 분석 개요 및 토픽의 형성
    2. 토픽별 분석 결과 해석
    3. 시각화를 통한 토픽 간 유사도와 거리 분석
    4. 시계열 분석 및 계층적 구조
IV. 전통적 문헌 고찰과 AI의 전산적(Computational) 텍스트 분석 비교
    1. 거시적 경향성의 일치, ‘3대 범주와 대그룹의 대응’
    2. 임베딩 기술의 명암, ‘표면적 어휘를 넘어선 방법론적 맥락의 포착과 한계’
    3. 문맥 독해의 한계, ‘주제어의 함정과 논리적 맥락의 소거’
    4. 확률적 군집화의 오류, ‘사실 관계의 왜곡 위험과 정성적 검증의 필요성’
    5. 질적 평가의 부재, ‘빈도와 가치의 불일치’
V. 전문가 매개(Expert-Mediated) 통합 분석 모델의 제안
    1. 제1단계: 사실 관계의 정합성 검증
    2. 제2단계: 논리적 맥락의 복원
    3. 제3단계: 연구사적 가치 부여
VI. 결론
참고문헌
Utility and Limitations of AI-based Topic Modeling inArchaeological Research Trend AnalysisA Comparative Study Focusing on Bronze Age Stone Tool Research

저자

진영민(고려대학교 문화유산융합연구소 연구교수) | Jin Youngmin (Korea University, Institute of Cultural Heritage Convergence, Research Professor)

같은 권호 다른 논문