The Genome of a Diploid Cotton Gossypium raimondii

Working group session: 
Comparative Genomics and Bioinformatics
Presentation type: 
oral
Authors: 
Yu, Shuxun; Yu, Shuxun
Presenter: 
Yu, Shuxun; Yu, Shuxun
Correspondent: 
Yu, Shuxun; Yu, Shuxun
Abstract: 
We sequenced and assembled the draft genome of Gossypium raimondii, which is widely known as a contributor of the D-subgenome to the economically important natural textile fiber producer, G. hirsutum. Next-generation Illumina pair-end (PE) sequencing strategies were performed to obtain 103.6-fold genome coverage of cleaned DNA sequence from various shotgun libraries with insert sizes ranging from 170 bp to 40 kbp. Over 73% of the assembled sequences were anchored on 13 G. raimondii chromosomes or linkage groups. The genome was predicted to contain 40,976 protein-coding genes with 92.2% of them further confirmed by transcriptome data. We observed two whole genome duplication (WGD) events, one occurred at about 56~63 and the other at about 13~20 million years ago, and identified 2355 synteny blocks in the G. raimondii genome. About 40% of the gene models are present in more than one block, suggesting that the G. raimondii genome has undergone substantial chromosome rearrangements. Nearly 57% of the genome is composed of transposable elements, most of which may come from the expansion of long terminal repeats (LTRs) since 4 MYA until now. The G. raimondii genome not only provides a major source of candidate genes for cotton research, but also it may serve as a platform for assembly of the tetraploid G. hirsutum genome.