Topics
- What is CottonGen?
- What data and information does CottonGen contain?
- How is CottonGen supported?
- How is CottonGen structured?
- How do I cite CottonGen?
- How can I contact CottonGen?
Data Curation, Submission and Download
- Can I submit my data to CottonGen?
- Can I download data from CottonGen?
- Who should I contact if I think I have found a mistake in the database?
- Who contributes data and/or assistance to CottonGen?
- How can I find what’s new in CottonGen?
- How do I know what’s coming next in CottonGen
- Do I need a CottonGen account to access data?
- What types of markers does CottonGen house?
- Is there a tutorial or help section on how to find marker information?
- What types of maps does CottonGen house?
- Can I view a list of all the maps with detailed information, such as genome, population size, etc?
- Is there a tutorial or help section on how to use the comparative map (CMap) viewer?
- Is there a tutorial or help section on how to use the Genome Browser (GBrowse) viewer?
- What genome data is available on CottonGen GBrowse?
CottonGen BIMS (Breeding Information Management System)
About CottonGen
CottonGen is a cotton community genomics, genetics and breeding database facilitating basic, translational and applied research in cotton. It is built using the open-source Tripal database infrastructure. CottonGen consolidates and expands the data from CottonDB and the Cotton Marker Database, providing enhanced tools for easy querying, visualization and downloading of research data.
The current major data components of CottonGen are:
- Colleagues: contact information of researchers whose published data is included in CottonGen
- Feature sequences (contig, EST, gene, mRNA, region)
- Genome assemblies and annotations
- Genes and transcripts
- Genotype of SNP and SSR Markers
- Germplasms
- Genetic maps
- Genetic markers
- Phenotypic trait evaluations
- Publications of relevant cotton research
- Quantitative Trait Loci (QTL)
More information on searching these data types can be found in the CottonGen User Manual: Data Searches. For details on data numbers, please see the Data Overview Page.
The CottonGen project is funded by:
- Cotton Incorporated
- The USDA-ARS Crop Germplasm Research Unit at College Station, TX
- Bayer CropScience
- Corteva Agriscience
- Monsanto
- The Southern Association of Agricultural Experiment Station Directors
- The USDA funded National Research Support Project 10
CottonGen uses Tripal, a toolkit for construction of online genomic and genetic databases. Tripal provides a web-front end for the Generic Database schema Chado which houses biological data. Tripal also integrates with Drupal, a Content Management System (CMS) that enables non-technical users to provide content, provides user management and social capabilities, and provides site developers with an interface that facilitates easier construction of websites.
If you use CottonGen data or tools in your research, please use the following citation to reference us (not the database URL):
Yu J, Jung S, Cheng CH, Ficklin SP, Lee T, Zheng P, Jones D, Percy R, Main D (2014) CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Research 42(D1), D1229-D1236.
To cite original data sources, look for the "Publication" link in the left-hand menu of CottonGen feature pages.
- How can I contact CottonGen?
The CottonGen staff can be reached via this contact form, or by selecting "Contact Us" under Help in the header menu.
Data Curation, Submission and Download
We accept data that is peer reviewed via manuscript submission. Data submission templates are available for uploading map, marker and germplasm evaluation data. If you have any questions please contact us using this contact form and we will work with you to input your data. If you have expression or genomic data, please contact us before attempting to submit.
Yes. There are different ways to download data from CottonGen:
- Via the Download page for genome annotations and other large static bulk data sets
- Via the MainLab FTP repository
- CottonGen chado data can be downloaded through CottonGen-related search pages. After completing a search you have the option to download the matching data (refer to the CottonGen User Manual: Data Searches for details).
- CottonGen Map data can be downloaded via the Map Data Summary page: Click on the map name to go the map you want to download, then click on 'Map Loci' and 'Map QTL', then copy and paste to same the map data.
- CottonGen genome alignment data and sequences under a specific chromosome and region can be downloaded via CottonGen JBrowse (refer to this JBrowse video tutorial for details).
Please contact us via this contact form, or by selecting "Contact Us" under Help in the header menu.
CottonGen has several collaborators and many contributors. See our Data Contributor page for details. If you are looking for a specific individual, you can search using their name here.
You can find what has recently been accomplished on the Work Completed page, found under General in the header menu.
You can find work in progress at CottonGen through the Work in Progress page, found under General in the header menu.
CottonGen Data
No, you do not need an account to view publicly available data. CottonGen also houses private breeding data for which breeders need an account to access and transfer.
CottonGen currently contains data on the following marker types:
- Amplified fragment length polymorphism (AFLP)
- Bacterial artificial chromosome end sequences (BAC-end)
- Cleaved amplified polymorphic sequences (CAPS)
- Diversity Arrays Technology Markers (DArT)
- Exon targeted intron-exon splice junction (ET-ISJ)
- Intron targeted intron-exon splice junction (IT-ISJ)
- Insertion or deletion of bases (InDel)
- Isoenzyme
- Morphological markers
- Restriction site associated DNA markers (RAD)
- Random amplification of polymorphic DNA (RAPD)
- Retrotransposon-microsatellite amplified polymorphism (REMAP)
- Restriction fragment length polymorphism (RFLP)
- Resistance gene analog polymorphism (RGAP)
- Reverse transcription (RT)
- Quantitative reverse transcription PCR (RT-qPCR)
- Single-nucleotide polymorphism (SNP)
- Sequence related amplified polymorphism (SRAP)
- Simple sequence repeat (SSR)
- Sequence-tagged site (STS)
- Target region amplification polymorphism (TRAP)
The majority of markers in this database are SNP and SSR. All of the above marker types are searchable from the CottonGen Marker Search. View the Data Overview page for the number of each marker type.
Yes. Please see the CottonGen User Manual: Markers.
CottonGen currently has over 112 maps total, including genetic (many with QTLs), bin, consensus, and in silico maps. These can be viewed through the Map Data Summary page.
The complete list of maps available on CottonGen, including map details, is available on the Map Data Summary page.
Yes. Map Viewer User Manual shows how to use CottonGen Map Viewer and how to use Comparative Map
Yes. Please see this JBrowse video tutorial.
The page CottonGen JBrowse lists all current whole genome sequences. Click on genome name from the list to learn about details of the assembly.
Breeding Information Management System (BIMS)
CottonGen BIMS (Breeding Information Management System) is a Tripal module being developed by Mainlab Bioinformatics which allows users to explore the publically available breeding data in CottonGen while also providing a data management solution for private breeding programs.
The BIMS user manual is available here.