논문 상세보기

유전 변이 추출에서 염기 정확도 재보정을 위한 데이터베이스 분석

  • 언어KOR
  • URLhttps://db.koreascholar.com/Article/Detail/360226
모든 회원에게 무료로 제공됩니다.
한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)
초록

The base quality score recalibration (BQSR) is an important step in the variant calling from high-throughput sequence data. Motivated by the fact that BQSR necessarily requires a database of known variants such as the dbSNP, we present an extensive analysis on BQSR results for human and rice genome. We showed that the recalibration results depended on the size of the database. The more variants are there in the database, the larger averaged value of the recalibrated quality scores is obtained. This implies that the recalibrated quality score is lower than it should be when the number of variants in the database is not large enough. Based on the finding that the size of the database should play a crucial role in BQSR, we proposed a method to create a database when the size of a database is not large enough for BQSR results to be reliable. We demonstrated that, in the case of human, the database constructed by the proposed method generated almost the same results as the human dbSNP. In the case of rice, however, we showed that the proposed database is more reasonable than the rice dbSNP by illustrating how the proposed method is effective.

저자
  • 김선희(공주대학교 산업시스템공학과)
  • 이동주(공주대학교 산업시스템공학과)
  • 이창용(공주대학교 산업시스템공학과) Corresponding author