Maximizing Utility of Community-based Resources: CottonSNP Arrays

Working group session: 
Comparative Genomics and Bioinformatics
Presentation type: 
15 minute Oral
Hulse-Kemp, Amanda M
Yu, Jing
Scheffler, Jodi
Main, Dorrie
Scheffler, Brian
Author Affliation: 
USDA-ARS, Genomics and Bioinformatics Research Unit, Raleigh, NC; North Carolina State University, Raleigh, NC;
Washington State University, Pullman, WA
USDA-ARS, Crop Genetics Research Unit, Stoneville, MS
Washington State University, Pullman, WA
USDA-ARS, Genomics and Bioinformatics Research Unit, Stoneville, MS
The amount of data being produced and stored in databases has increased exponentially as the speed of data generation continues to increase. Utilization of data sets is dependent on excellent data stewardship and management by the crop community. Initiation of these practices has been suggested and monitored by the CottonGen database, but the potential utility of the database through inclusion of samples ultimately falls on the individual researchers within the community. Arrays are specifically dependent, as their utilization in communities (including the cotton community) has been steadily increasing but they fall outside of traditional data required for input into a standardized database such as NCBI. As collection of array data from the Cotton SNP63K (public) and other arrays (CottonSNP80K) increases, utility of the collected data can be maximized in the community by good data stewardship and a community-based standard for inclusion of both raw and processed data into the public domain for maximizing utility of the community-based resource. We will discuss data stewardship and good practices for inclusion and utilization of array based data in coordination with the CottonGen database.