Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome

Li, Ran and Fu, Weiwei and Su, Rui and Tian, Xiaomeng and Du, Duo and Zhao, Yue and Zheng, Zhuqing and Chen, Qiuming and Gao, Shan and Cai, Yudong and Wang, Xihong and Li, Jinquan and Jiang, Yu (2019) Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome. Frontiers in Genetics, 10. ISSN 1664-8021

[thumbnail of pubmed-zip/versions/1/package-entries/fgene-10-01169.pdf] Text
pubmed-zip/versions/1/package-entries/fgene-10-01169.pdf - Published Version

Download (2MB)

Abstract

It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome from one individual would be insufficient to represent the whole genomic contents of goats. By comparing nine de novo assemblies from seven sibling species of domestic goat with ARS1 and using resequencing and transcriptome data from goats for verification, we identified a total of 38.3 Mb sequences that were absent in ARS1. The pan-sequences contain genic fractions with considerable expression. Using the pan-genome (ARS1 together with the pan-sequences) as a reference genome, variation calling efficacy can be appreciably improved. A total of 56,657 spurious SNPs per individual were repressed and 24,414 novel SNPs per individual on average were recovered as a result of better reads mapping quality. The transcriptomic mapping rate was also increased by ∼1.15%. Our study demonstrated that comparing de novo assemblies from closely related species is an efficient and reliable strategy for finding missing sequences from the reference genome and could be applicable to other species. Pan-genome can serve as an improved reference genome in animals for a better exploration of the underlying genomic variations and could increase the probability of finding genotype-phenotype associations assessed by a comprehensive variation database containing much more differences between individuals. We have constructed a goat pan-genome web interface for data visualization (http://animal.nwsuaf.edu.cn/panGoat).

Item Type: Article
Subjects: Asian STM > Medical Science
Depositing User: Managing Editor
Date Deposited: 02 Feb 2023 11:02
Last Modified: 30 Dec 2023 13:25
URI: http://journal.send2sub.com/id/eprint/547

Actions (login required)

View Item
View Item