Genome data from sweet and grain sorghum (<em>Sorghum bicolor</em>).

View Dataset
Zheng, Lei, Ying;Guo, Xiao, Sen;He, Bing;Sun, Lian, Jun;Peng, Yao;Dong, Shanshan;Liu, Teng, Fei;Jiang, Shuye;Ramachandran, Srinivasan;Liu, Chun, Ming;Jing, Hai, Chun

Description

Sorghum is produced globally as a source of food, feed, fiber, and fuel. Grain and sweet sorghums differ in a number of important traits including stem sugar and juice accumulation, plant height, and the production of grain and biomass. The first sorghum whole-genome sequences are now available for analysis, but additional genomic sequences will be required to study genome-wide and intraspecific variation for dissecting the genetic basis of these important traits and for tailor-designed breeding of this important C4 crop.In a joint effort with scientists from the Institute of Botany of Chinese Academy of Sciences (Beijing) and Temasek Life Sciences Laboratory (Singapore), BGI resequenced two sweet and one grain sorghum inbred lines: E-Tian, Ji2731, and Keller. E-Tian (literally meaning Russian Sweet in Chinese) is a sweet sorghum line introduced into China in the early 1970s. Ji2731 is a Chinese kaoliang grain sorghum that is well adapted to Northeast China. Keller is an American-bred elite sweet sorghum line shown to perform well across a wide range of environmental conditions.Using the re-sequencing data, a set of nearly 1,500 genes differentiating sweet and grain sorghum were identified. These genes fall into 10 major metabolic pathways involved in sugar and starch metabolisms, lignin and coumarin biosynthesis, nucleic acid metabolism, stress responses and DNA damage repair. In addition, 1,057,018 SNPs, 99,948 indels of 1-10bp in length and 16,487 presence/absence variations were uncovered, and 17,111 CNVs were detected. The majority of the SNPs, large-effect SNPs, indels and presence/absence variations resided in genes containing leucine rich repeats, PPR repeats and disease resistance R genes possessing diverse biological functions or under diversifying selection, but were absent in genes which are essential for life.This is the first publically available data that allows the identification of genome-wide patterns of genetic variation in sorghum. The high-density SNP and indel markers presented here will be a valuable resource for future genotype and phenotype studies and the molecular breeding of this important crop and for related species.

Citations (9)

Mentions (0)

Metrics

Dataset Index

4.9

FAIR Score

31%

Citations

9

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

GigaScience

Assigned Domain

Subfield

Plant Science

Field

Agricultural and Biological Sciences

Domain

Life Sciences

Confidence Score

54%

Source

Scholar Data Model

Keywords

Genomic

Normalization Factors

FT

15.38

CTw

1.00

MTw

1.00