This file has suggested templates for:
Copy/paste what you need into a file in the appropriate subdirectory with a file name like this display name---YYYY-MM-DD.md
- create the template here
create templates here for
- genomes
- variation data
- pathways
- expression data
- tools
Sorghum bicolor (L.) Moench subsp. bicolor, is a widely grown cereal crop, particularly in Africa, ranking 5th in global cereal production (FAOSTAT 2008; http://www.fao.org/in-action/inpho/crop-compendium/cereals-grains/). It is a C4 grass also used for sugar production, brewing, feedstock, and as a biofuel crop. Its diploid genome (~730 Mbp) has a haploid chromosome number of 10. The inbred variety ‘BTx623’ is the current reference genome for sorghum. It has short stature and an early maturing genotype used primarily to produce grain sorghum hybrids. It is a line susceptible to sugarcane aphid and sensitive to low nitrogen, and therefore often used in functional comparative studies.
BTx623 (GRIN: PI 564163) is part of the following population panels:
- Sorghum Association Panel (SAP) - 407 accessions (Casa et al, 2008)
- Sorghum Bioenergy Association Panel (BAP) - 386 accessions (Brenton et al, 2016)
The genome assembly of Sorghum bicolor cv. Moench was published in 2009 (Paterson et al, 2009). The present assembly corresponds to v3.1.1 at the US Department of Energy Joint Genome Institute (JGI) described in (McCormick et al, 2018), and is also known as the NCBIv3 assembly. Sequencing by the JGI's Community Sequencing Program in collaboration with the Plant Genome Mapping Laboratory at the University of Georgia, followed a whole-genome shotgun strategy reaching 8X coverage with scaffolds -where possible- being assigned to the genetic map. JGI did two additional rounds of improvements. The most recent update of release v3.0 included ~351 Mb of finished sorghum sequence. A total of 349 clones were manually inspected, then finished and validated using a variety of technologies including Sanger, 454 and Illumina. They were integrated into chromosomes by aligning to v1.0 assembly. As a result, 4,426 gaps were closed, and a total of 4.96 Mb of sequence was added to the assembly. Overall contiguity (contig N50) increased by a factor of 5.8X from 204.5 Kb to 1.2 Mb. For more details, see Phytozome.
NCBI accession: GCA_000003195.3
Gene predictions resulted from combining homology-based and ab initio methods with expressed sequences from sorghum, maize and sugarcane, using the JGI annotation pipeline (Goodstein et al, 2012). The SorghumBase browser presents data from the current JGI v3.1.1 release, which comprises the v3.0.1 assembly and v3.1.1 gene set (February 2017). Read more at Phytozome.
This is a modern annotation using resources used in the original v1.0 release (Sbi1 assembly and Sbi1.4 gene set) and geneAtlas RNA-seq data. The main genome is in 10 chromosomes with small unmapped pieces, some of which contain annotated genes. The NCBIv3 release (Phytozome v3.1.1) is essentially the same as Phytozome v3.1 except for 82 genes/loci that were inactivated due to 4 scaffolds entirely present in chromosome(s) that were removed.
Assembly information | |
---|---|
Assembly name | Sorghum_bicolor_NCBIv3 |
Assembly date | June 2017 |
Assembly accession | GCA_000003195.3 |
WGS accession | ABXC00000000 |
Assembly provider | |
Sequencing description | |
Sequencing technologies | Sanger; Illumina |
Sequencing method | |
Genome coverage | 8x |
Assembly description | ARACHNE_modified v. 200721016 |
Publication | Paterson et al (2009); McCormick et al (2018) |
Repeats were annotated with the Ensembl Genomes repeat feature pipeline (Aken et al, 2016), which uses six classes of repeats loaded from ENA.
Repeat feature | Frequency | Coverage (Mb) | % of the genome covered |
---|---|---|---|
Low complexity (Dust) features | 685,783 | 29 | 4 |
RepeatMasker (with RepBase library) | 455,749 | 451 | 62.1 |
RepeatMasker (with REdat library) | 392,778 | 409 | 56.2 |
Tandem repeat (TRF) features | 245,654 | 41 | 5.7 |
Genome-wide epigenetic marks related to root system morphology and architecture in sorghum BTx623 deposited under BioProject PRJNA454504 (Gladman et al, manuscript in prep.). Three repetitions available for each of seven samples: leaf lower (vegetative whorl), leaf upper (growing anthesis), panicle (upper anthesis), root bottom (anthesis), root top (vegetative), stem 1 cm (vegetative) and stem mid internode (anthesis).
For an example of two samples with very similar patterns of epigenetic variation for the msd2 gene region, click here.
Baseline gene expression data from seven sorghum BTx623 datasets curated and processed by the EMBL-EBI Expression Atlas (Emms et al, 2016; Makita et al, 2015; Davidson et al, 2012; Turco et al, 2017; Wang et al, 2018; Olson et al, 2014) and BioProject PRJNA293229 SRP062564 by Thurber et al (2015).
Click here for an example of baseline gene expression for the msd2 gene.
Variation in SorghumBase is available for short variants (genetic variation, which in turn may be naturally occurring or chemically induced), longer than 50 nucleotides (structural variants) and QTL variants associated with physical traits.
Genetic variation data for a sorghum gene is available graphically and in tabular form, and for each variant, a Variant page provides more detailed information. Below are examples of each of these data representations.
- Image of genetic variants mapping to the smaller transcript of the msd2 gene.
- Table of the genetic variants mapping to the smaller transcript of the msd2 gene.
- Summary page of a chemically induced variant (tmp_6_47487513_G_A).
Single Nucleotide Polymorphisms (SNPs). Currently in SorghumBase, there are two SNPs data sets for sorghum BTx623:
- The Morris SNP dataset (Morris et al, 2013) consists of ~265,000 SNPs genotyped in 378 accessions from the US sorghum association panel (SAP).
- The Mace SNP dataset (Mace et al, 2013) includes almost 6.5 million SNPs genotyped in 45 Sorghum bicolor lines including the BTx623 reference genome plus 2 S. propinquum lines reported by Mace and colleagues (Mace et al, 2013). The data were obtained by resequencing the genomes of the 44 S. bicolor lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group (i.e., major races of cultivated S. bicolor, landraces, improved inbreds, progenitors, wild and weedy), and the first resequenced genome of S. propinquum, all of which were mapped to the BTx623 S. bicolor reference genome.
Ethyl methanesulfonate (EMS)-induced mutations. Currently in SorghumBase, there is one collection of EMS-induced mutant lines. EMS is a chemical commonly used to cause point mutations, that is, to change single nucleotides in the DNA of a plant seed.
- The Xin EMS dataset (Jiao et al, 2016) includes almost 1.5 million EMS-induced G/C to A/T transition mutations annotated from 252 M3 families selected from the 6,400 sorghum mutant library in BTx623 background described by Xin and colleagues (Xin et al, 2008). Genomic DNA used for sequencing was pooled from 20 M3 plants per M2 family (Jiao et al, 2016).
Data for structural variation for the sorghum BTx623 reference has been imported from the Database of Genomic Variants Archive (dGVA) from a single study containing around 28 thousand structural variations described by (Zheng et al, 2011). Here is an example.
Quantitative Trait Locus (QTLs). Data corresponding to 5,843 QTL features for 220 sorghum traits were imported from Sorghum QTL Atlas and are provided with predicted syntenic locations in maize and rice.
Example region with QTLs associated with multiple traits including greenbug resistance, fresh biomass, and flag leaf height. Hint: For additional regions with QTL data in the current sorghum assembly (v.3), use the physical or genetic (cM) coordinates kindly provided by the Sorghum QTL Atlas team.
Aken, Bronwen L., Sarah Ayling, Daniel Barrell, Laura Clarke, Valery Curwen, Susan Fairley, Julio Fernandez Banet, et al. 2016. "The Ensembl Gene Annotation System." Database: The Journal of Biological Databases and Curation. PMID: 27337980. https://doi.org/10.1093/database/baw093.
Brenton, Zachary W., Elizabeth A. Cooper, Mathew T. Myers, Richard E. Boyles, Nadia Shakoor, Kelsey J. Zielinski, Bradley L. Rauh, William C. Bridges, Geoffrey P. Morris, and Stephen Kresovich. 2016. "A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy." Genetics 204 (1): 21–33. PMID: 27356613. https://doi.org/10.1534/genetics.115.183947.
Casa, Alexandra M., Gael Pressoir, Patrick J. Brown, Sharon E. Mitchell, William L. Rooney, Mitchell R. Tuinstra, Cleve D. Franks, and Stephen Kresovich. 2008. "Community Resources and Strategies for Association Mapping in Sorghum." Crop Science 48 (1): 30–40. https://doi.org/10.2135/cropsci2007.02.0080.
Davidson, Rebecca M., Malali Gowda, Gaurav Moghe, Haining Lin, Brieanne Vaillancourt, Shin-Han Shiu, Ning Jiang, and C. Robin Buell. 2012. "Comparative Transcriptomics of Three Poaceae Species Reveals Patterns of Gene Expression Evolution." The Plant Journal: For Cell and Molecular Biology 71 (3): 492–502. PMID: 22443345. https://doi.org/10.1111/j.1365-313X.2012.05005.x.
Emms, David M., Sarah Covshoff, Julian M. Hibberd, and Steven Kelly. 2016. "Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species." Molecular Biology and Evolution 33 (7): 1796–1806. PMID: 27016024. https://doi.org/10.1093/molbev/msw057.
Goodstein, David M., Shengqiang Shu, Russell Howson, Rochak Neupane, Richard D. Hayes, Joni Fazo, Therese Mitros, et al. 2012. "Phytozome: A Comparative Platform for Green Plant Genomics." Nucleic Acids Research 40 (Database issue): D1178–86. PMID: 22110026. https://doi.org/10.1093/nar/gkr944.
Jiao, Yinping, John J. Burke, Ratan Chopra, Gloria Burow, Junping Chen, Bo Wang, Chad Hayes, Yves Emendack, Doreen Ware, and Zhanguo Xin. 2016. "A Sorghum Mutant Resource as an Efficient Platform for Gene Discovery in Grasses." The Plant Cell. PMID: 27354556. https://doi.org/10.1105/tpc.16.00373.
McCormick, Ryan F., Sandra K. Truong, Avinash Sreedasyam, Jerry Jenkins, Shengqiang Shu, David Sims, Megan Kennedy, et al. 2018. "The Sorghum Bicolor Reference Genome: Improved Assembly, Gene Annotations, a Transcriptome Atlas, and Signatures of Genome Organization." The Plant Journal: For Cell and Molecular Biology 93 (2): 338–54. PMID: 29161754. https://doi.org/10.1111/tpj.13781.
Mace, Emma S., Shuaishuai Tai, Edward K. Gilding, Yanhong Li, Peter J. Prentis, Lianle Bian, Bradley C. Campbell, et al. 2013. "Whole-Genome Sequencing Reveals Untapped Genetic Potential in Africa’s Indigenous Cereal Crop Sorghum." Nature Communications 4: 2320. PMID: 23982223. http://doi.org/10.1038/ncomms3320.
Makita, Yuko, Setsuko Shimada, Mika Kawashima, Tomoko Kondou-Kuriyama, Tetsuro Toyoda, and Minami Matsui. 2015. "MOROKOSHI: Transcriptome Database in Sorghum Bicolor." Plant & Cell Physiology 56 (1): e6. PMID: 25505007. https://doi.org/10.1093/pcp/pcu187.
Morris, Geoffrey P., Punna Ramu, Santosh P. Deshpande, C. Thomas Hash, Trushar Shah, Hari D. Upadhyaya, Oscar Riera-Lizarazu, et al. 2013. "Population Genomic and Genome-Wide Association Studies of Agroclimatic Traits in Sorghum." Proceedings of the National Academy of Sciences of the United States of America 110 (2): 453–58. PMID: 23267105. https://doi.org/10.1073/pnas.1215985110.
Olson, Andrew, Robert R. Klein, Diana V. Dugas, Zhenyuan Lu, Michael Regulski, Patricia E. Klein, and Doreen Ware. 2014. "Expanding and Vetting Sorghum Bicolor Gene Annotations through Transcriptome and Methylome Sequencing." The Plant Genome 7 (2): plantgenome2013.08.0025. https://doi.org/10.3835/plantgenome2013.08.0025.
Paterson, A. H., J. E. Bowers, R. Bruggmann, I. Dubchak, J. Grimwood, H. Gundlach, G. Haberer, et al. 2009. "The Sorghum Bicolor Genome and the Diversification of Grasses." Nature 457 (7229): 551–56. PMID: 19189423. https://doi.org/10.1038/nature07723.
Turco, Gina M., Kaisa Kajala, Govindarajan Kunde-Ramamoorthy, Chew-Yee Ngan, Andrew Olson, Shweta Deshphande, Denis Tolkunov, et al. 2017. "DNA Methylation and Gene Expression Regulation Associated with Vascularization in Sorghum Bicolor." The New Phytologist 214 (3): 1213–29. PMID: 28186631. https://doi.org/10.1111/nph.14448.
Xin, Zhanguo, Ming Li Wang, Noelle A. Barkley, Gloria Burow, Cleve Franks, Gary Pederson, and John Burke. 2008. "Applying Genotyping (TILLING) and Phenotyping Analyses to Elucidate Gene Function in a Chemically Induced Sorghum Mutant Population." BMC Plant Biology. PMID: 18854043. https://doi.org/10.1186/1471-2229-8-103.
Wang, Bo, Michael Regulski, Elizabeth Tseng, Andrew Olson, Sara Goodwin, W. Richard McCombie, and Doreen Ware. 2018. "A Comparative Transcriptional Landscape of Maize and Sorghum Obtained by Single-Molecule Sequencing." Genome Research 28 (6): 921–32. PMID: 29712755 https://doi.org/10.1101/gr.227462.117.
Zheng, Lei-Ying, Xiao-Sen Guo, Bing He, Lian-Jun Sun, Yao Peng, Shan-Shan Dong, Teng-Fei Liu, et al. 2011. "Genome-Wide Patterns of Genetic Variation in Sweet and Grain Sorghum (Sorghum Bicolor)." Genome Biology 12 (11): R114. PMID: 22104744. http://dx.doi.org/10.1186/gb-2011-12-11-r114.
Image source: The GRIN database
General information about this species can be found in Wikipedia