One Major Challenge of Sequencing Large Plant Genomes Is to Know How Big They Really Are

Doležel, J., Čížková, J., Šimková, H., Bartoš, J.

Klíčová slova: flow cytometry; genome size; nuclear DNA content; reference genome assembly; standardization
Abstrakt: Any project seeking to deliver a plant or animal reference genome sequence must address the question as to the completeness of the assembly. Given the complexity introduced particularly by the presence of sequence redundancy, a problem which is especially acute in polyploid genomes, this question is not an easy one to answer. One approach is to use the sequence data, along with the appropriate computational tools, the other is to compare the estimate of genome size with an experimentally measured mass of nuclear DNA. The latter requires a reference standard in order to provide a robust relationship between the two independent measurements of genome size. Here, the proposal is to choose the human male leucocyte genome for this standard: its 1C DNA amount (the amount of DNA contained within unreplicated haploid chromosome set) of 3.50 pg is equivalent to a genome length of 3.423 Gbp, a size which is just 5% longer than predicted by the most current human genome assembly. Adopting this standard, this paper assesses the completeness of the reference genome assemblies of the leading cereal crops species wheat, barley and rye.
DOI: 10.3390/ijms19113554
Fulltext: kontaktujte autory z ÚEB
Autoři z ÚEB: Jan Bartoš, Jana Čížková, Jaroslav Doležel, Hana Šimková