With the development of next generation sequencing (NGS) technology, the variation of sequences represented as SNP between cultivars becomes available at genome level. The major domestic cultivars with high yield have been developed by breeding of indica and japonica, it is important to localize the region of origin according to the genotype for further characterization of unique features of cultivars. For the localization of SNP at genome level, the paired end sequences of 6 major domestic rice cultivars, Ilmi, Ilpoom, Sulgaeng, Bakjinju, Hwayoung and Woonkwang were compared against Japonica and Indica Rice Genomes as reference genomes. The genomic DNAs were prepared from callus tissues and paired-end of the fragments were sequenced with NGS Sequencer, Illumina HISeq. About 50x coverage of paired-end sequences were trimmed according to the quality of the sequences, and errors were corrected with statistical analysis of kmers of 15. The trim-corrected sequences were mapped and variants were analyzed against reference genomes. The overall change rate of Ilmi against Nipponbare IRGSP 1.0 and Indica BGI 93-11 reference genomes were 0.92 base/1kb (1/1,079 base) and 8.09 base/1kb (1 base/123 bases), respectively. Among 6 cultivars, overall rate of Bakjinju showed the lowest overall change rate of 0,53 base/1kb, and Hwayoung showed highest frequency of 0.92 base/1kb. Compared to high level in the range of change rate of 7.0-9.3 base/1kb against indica, domestic cultivars showed lower range of change rate 0.2-3.3 base/1kb with unique local high peak against japonica genome depend on the chromosomes. Compared to assembly of genome sequences, the variation of nucleotides compared to reference sequences is much faster and simple to characterize the genotype. The types of variation and the effect on functional categories will be presented.
Rice is one of the most important crop in the world and the genome sequences of a rice cultivar, Nipponbare has been used not only for rice research, but also as the model reference genome sequences in monocotyledon species among the crops. With the development of the next generation sequencing(NGS) techniques producing high order of coverage and with the need for epigenomic analysis, the need for resequencing of the domestic rice cultivars with the reference “Nipponbare” genome sequence at the de novo level.
According the simulation of Nipponbare reference genomes with Eulerian methods, the target size of maximal contig and N50 of ones of the domestic cutivar rice were estimated to be 10M and 1M, respectively. To achieve the target size, various mate-paired libraries were constructed and sequenced. With the high order of coverage obtained from domestic rice cultivar, Ilmi, with NGS technology, the effect of trimming and error correction on reads quality profiles and size distribution of contigs were analyzed. Also the computational parameters for validation of assembled contigs were analyzed.
For the analysis of epigenomic methylation modification of the genome sequences, the methyl binding microarray technology was developed. The various methyl-CG binding proteins were characterized. The binding and scanning of methyl-CG domain to promoter binding microarray and their downstream genes were analyzed. Also rice mutants related to methylation were selected to understand the effect of methylation on gene expresson and its effect on phenotypes.