Amplicon pipelines are used to identify bacterial or fungal communities based on specific genes. These workflows support both single-end and paired-end reads.
The Sequence Read Archive (SRA) is a publicly accessible database of raw sequencing data hosted NCBI. It contains high-throughput sequencing data from numerous research projects worldwide, covering a wide range of study types and organisms, including metagenomic and microbiome studies. Researchers deposit their sequencing reads in SRA to make data publicly available in accordance with scientific publication practices, fostering transparency and reproducibility in science. This is typically referenced in the paper with a BioProject number in the Data Availability section.
Why Import SRA Data for Microbiome Analysis?Access to Publicly Available Data: Researchers can leverage publicly available SRA datasets to replicate studies, validate findings, or perform meta-analyses against their own data.Cost Efficiency: Using existing SRA datasets reduces the need for sequencing of new samples when performing secondary analyses or meta-analyses. By importing previously published data, users can rapidly evaluate different datasets or integrate multiple studies without the need to collect and sequence new samples.Extended Contextual Analysis: By analyzing previously published microbiome datasets from SRA, researchers can add context to their own data. This can provide valuable insights by comparing microbiomes across different populations, environments, diseases, or experimental conditions.Data Reuse for Novel Insights: Many research questions can be answered by revisiting older datasets with new hypotheses or more advanced analysis techniques. The Cosmos-Hub allows users to explore data from fresh perspectives using its comprehensive suite of microbiome analysis tools, potentially leading to novel discoveries.Be sure to leverage the SRA Run Selector to export tables of SRA accession IDs as well as relevant metadata. Typically, original manuscripts need to be referenced for primer sequences.
After upload, choose a workflow that matches your data type. The Cosmos-Hub will only show valid options based on your selections.
Do not close your browser while uploading. Progress is shown in the bottom-right. Allow up to 24 hours for processing. Results and statuses will appear in the Cohorts and Metadata section.
Occasionally a sample will fail to upload. Here are a number of possible causes:
Poor internet connection
Internet connection interruption
Browser closed while uploading
Incorrect primer pairs
Corrupted or low quality data files
Other notes:
If a sample fails to upload, you can retry it
If a sample is not in the proper format or fails to run for another reason, the status will show “failed” and you will not be charged credits.