- What is CottonGen?
- What data and information does CottonGen contain?
- How is CottonGen supported?
- How is CottonGen structured?
- How do I cite CottonGen?
- How can I contact CottonGen?
- Can I submit my data to CottonGen?
- Can I download data from CottonGen?
- Who should I contact if I think I have found a mistake in the database?
- Who contributes data and/or assistance to CottonGen?
- How can I find what’s new in CottonGen?
- How do I know what’s coming next in CottonGen
- Do I need a CottonGen account to access data?
- What types of markers does CottonGen house?
- Is there a tutorial or help section on how to find marker information?
- What types of maps does CottonGen house?
- Can I view a list of all the maps with detailed information, such as genome, population size, etc?
- Is there a tutorial or help section on how to use the comparative map (CMap) viewer?
- Is there a tutorial or help section on how to use the Genome Browser (GBrowse) viewer?
- What genome data is available on CottonGen GBrowse?
CottonGen is a cotton community genomics, genetics and breeding database for facilitating basic, translational and applied research in cotton. It is built using the open-source Tripal database infrastructure. CottonGen consolidates and expands the data from CottonDB and the Cotton Marker Database, providing enhanced tools for easy querying, visualization and downloading of research data.
The current major data components of CottonGen are:
- Colleagues: contact information of researchers whose published data is included in CottonGen
- Feature sequences (contig, EST, gene, mRNA, region)
- Genome assemblies and annotations
- Genes and transcripts
- Genotype of SNP and SSR Markers
- Genetic maps
- Genetic markers
- Phenotypic trait evaluations
- Publications of relevant cotton research
- Quantitative Trait Loci (QTL)
More information on searching these data types can be found in the CottonGen User Manual: Data Searches. For details on data numbers, please see the Data Overview Page.
The CottonGen project is funded by:
- Cotton Incorporated
- The USDA-ARS Crop Germplasm Research Unit at College Station, TX
- Bayer CropScience
- Corteva Agriscience
- The Southern Association of Agricultural Experiment Station Directors
- The USDA funded National Research Support Project 10
CottonGen uses Tripal, a toolkit for construction of online genomic and genetic databases. Tripal provides a web-front end for the Generic Database schema Chado which houses biological data. Tripal also integrates with Drupal, a Content Management System (CMS) that enables non-technical users to provide content, provides user management and social capabilities, and provides site developers with an interface that facilitates easier construction of websites.
If you use CottonGen data or tools in your research, please use the following citation to reference us (not the database URL):
Yu J, Jung S, Cheng CH, Ficklin SP, Lee T, Zheng P, Jones D, Percy R, Main D (2014) CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Research 42(D1), D1229-D1236.
To cite original data sources, look for the "Publication" link in the left-hand menu of CottonGen feature pages.
- How can I contact CottonGen?
The CottonGen staff can be reached via this contact form, or by selecting "Contact Us" under Help in the header menu.
We accept data that is peer reviewed via manuscript submission. Data submission templates are available for uploading map, marker and germplasm evaluation data. If you have any questions please contact us using this contact form and we will work with you to input your data. If you have expression or genomic data, please contact us before attempting to submit.
Yes. There are different ways to download data from CottonGen:
- Via the Download page for genome annotations and other large static bulk data sets
- Via the MainLab FTP repository
- CottonGen chado data can be downloaded through CottonGen-related search pages. After completing a search you have the option to download the matching data (refer to the CottonGen User Manual: Data Searches for details).
- CottonGen CMap data can be downloaded via the CMap Map Set Info page
- CottonGen genome alignment data and sequences under a specific chromosome and region can be downloaded via CottonGen JBrowse (refer to this JBrowse video tutorial for details).
Please contact us via this contact form, or by selecting "Contact Us" under Help in the header menu.
CottonGen has several collaborators and many contributors. See our Data Contributor page for details. If you are looking for a specific individual, you can search using their name here.
You can find what has recently been accomplished on the Work Completed page, found under General in the header menu.
You can find work in progress at CottonGen through the Work in Progress page, found under General in the header menu.
No, you do not need an account to view publicly available data. CottonGen also houses private breeding data for which breeders need an account to access and transfer.
CottonGen currently contains data on the following marker types:
- Amplified fragment length polymorphism (AFLP)
- Bacterial artificial chromosome end sequences (BAC-end)
- Cleaved amplified polymorphic sequences (CAPS)
- Diversity Arrays Technology Markers (DArT)
- Exon targeted intron-exon splice junction (ET-ISJ)
- Intron targeted intron-exon splice junction (IT-ISJ)
- Insertion or deletion of bases (InDel)
- Morphological markers
- Restriction site associated DNA markers (RAD)
- Random amplification of polymorphic DNA (RAPD)
- Retrotransposon-microsatellite amplified polymorphism (REMAP)
- Restriction fragment length polymorphism (RFLP)
- Resistance gene analog polymorphism (RGAP)
- Reverse transcription (RT)
- Quantitative reverse transcription PCR (RT-qPCR)
- Single-nucleotide polymorphism (SNP)
- Sequence related amplified polymorphism (SRAP)
- Simple sequence repeat (SSR)
- Sequence-tagged site (STS)
- Target region amplification polymorphism (TRAP)
The majority of markers in this database are SNP and SSR. All of the above marker types are searchable from the CottonGen Marker Search. View the Data Overview page for the number of each marker type.
Yes. Please see the CottonGen User Manual: Markers.
CottonGen currently has over 80 maps total, including genetic (many with QTLs), bin, consensus, and in silico maps. These can be viewed through the Map Data Summary page.
The complete list of maps available on CottonGen, including map details, is available on the Map Data Summary page.
Yes. Map Viewer User Manual shows how to use CottonGen Map Viewer and CMap User Tutorial shows how to use Comparative Map (CMap)
Yes. Please see this JBrowse video tutorial.
The following genome assemblies are available on CottonGen JBrowse :
- Gossypium arboreum A2 genome CGP-BGI assembly v2.0 (annotation v1.0)
- Gossypium barbadense AD2 genome HAU-SGI assembly v1.0 (annotation v1.0)
- Gossypium hirsutum AD1 genome Tx-JGI assebly v1.1
- Gossypium hirsutum AD1 genome CGP-BGI assembly v1.0 (annotation v1.0)
- Gossypium hirsutum AD1 genome NAU-NBI assembly v1.1 (annotation v1.0)
- Gossypium raimondii D5 genome JGI assembly v2.0 (annotation v2.1)
- Gossypium raimondii D5 genome CGP-BGI assembly v1.0
- Gossypium arboreum chloroplast (A genome)
- Gossypium barbadense chloroplast (AD genome)
- Gossypium hirsutum chloroplast (AD genome)
- Gossypium raimondii chloroplast (D genome)
CottonGen BIMS (Breeding Information Management System) is a Tripal module being developed by Mainlab Bioinformatics which allows users to explore the publically available breeding data in CottonGen while also providing a data management solution for private breeding programs.
The BIMS user manual is available here.