Network Analysis of Disease Relationships Using Large Language Model Embeddings

Yonggwan Shin; Seyoung Kim; Bonggyun Ko

논문 상세보기

Network Analysis of Disease Relationships Using Large Language Model Embeddings KCI 등재

LLM 임베딩 기반 질병 네트워크 분석: 대규모 언어 모델을 활용한 질병 간 연관성 탐구

Yonggwan Shin, Seyoung Kim, Bonggyun Ko

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/440345

구독 기관 인증 시 무료 이용이 가능합니다. 4,000원

한국산업경영시스템학회지 (Journal of Society of Korea Industrial and Systems Engineering)

Vol.48 No.1 (2025.03)
pp.1-8

한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)

초록

This study presents a novel methodology for analyzing disease relationships from a network perspective using Large Language Model (LLM) embeddings. We constructed a disease network based on 4,489 diseases from the International Classification of Diseases (ICD-11) using OpenAI’s text-embedding-3-small model. Network analysis revealed that diseases exhibit small-world characteristics with a high clustering coefficient (0.435) and form 16 major communities. Notably, mental health-related diseases showed high centrality in the network, and a clear inverse relationship was observed between community size and internal density. The embedding-based relationship analysis revealed meaningful patterns of disease relationships, suggesting the potential of this methodology as a novel tool for studying disease associations. Results suggest that mental health conditions play a more central role in disease relationships than previously recognized, and disease communities show distinct organizational patterns. This approach shows promise as a valuable tool for exploring large-scale disease relationships and generating new research hypotheses.

키워드

Disease Network Large Language Models Embedding Analysis Network Science Disease Relationships

1. 서 론
2. 연구 방법
    2.1 데이터 수집 및 전처리
    2.2 네트워크 구축
    2.3 네트워크 중심성
3. 연구 결과
    3.1 네트워크의 전역적 특성
    3.2 질병 중심성 분석
    3.3 커뮤니티 구조
    3.4 커뮤니티 분석 및 핵심 질병 식별
    3.5 네트워크 구조 분석
4. 결론 및 고찰
Acknowledgement
References

저자

Yonggwan Shin(R&D Center, XRAI Inc.) | 신용관 (㈜엑스알에이아이 기업부설연구소)
Seyoung Kim(Department of Mathematics and Statistics, Chonnam National University) | 김세영 (전남대학교 수학, 통계학과)
Bonggyun Ko(Department of Mathematics and Statistics, Chonnam National University) | 고봉균 (전남대학교 수학, 통계학과) Corresponding author

같은 권호 다른 논문