Skip to content

Commit 53d67a2

Browse files
committed
start ncbi genome submission tutorial
1 parent b078a8b commit 53d67a2

36 files changed

+53
-3
lines changed

dataWrangling/NCBI_Data_Submission.md

+1-3
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: "Introduction to Data Wrangling"
2+
title: "How to submit your sequencing data to NCBI"
33
layout: single
44
author: Rick Masonbrink
55
author_profile: true
@@ -8,8 +8,6 @@ header:
88
overlay_image: /assets/images/dna.jpg
99
---
1010

11-
# How to submit your sequencing data to NCBI
12-
1311
Submitting sequencing data is a requirement for nearly every journal, and thus getting it in order as soon as your sequencing is received will make this process go much faster. This is mostly because the details of your sequencing will be fresh in your mind, and with the variability in BioSample/metadata submissions by type, you may not record sufficient enough detail to submit at a later date.
1412

1513
First you'll have to create an account on ncbi before submission. Here is a current link --10/04/2018
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
---
2+
title: "How to submit a genome and gene annotation NCBI"
3+
layout: single
4+
author: Rick Masonbrink
5+
author_profile: true
6+
header:
7+
overlay_color: "444444"
8+
overlay_image: /assets/images/dna.jpg
9+
---
10+
11+
Depositing a genome and its gene annotation into NCBI ensures that it is accessible to the scientific community, is properly archived, and links the relative metadata. Submission typically involves preparing the sequence data, genome, gene annotation, and assembled organelle genomes.
12+
13+
# The most efficient order for submitting a genome to NCBI
14+
1. Create a Bioproject - This serves as an umbrella for your study, linking multiple datasets together.
15+
2. Create Biosamples - This describes each of the biological samples that were acquired for assembling or annotating your genome.
16+
3. Submit sequencing to SRA - Each independent sequencing sample that is not a technical replicate will need to be deposited as a separate file.
17+
4. Submit genome and annotation - Once the information for the previous submissions is complete, you should have all of the information you need to deposit your genome, annotation,assembled organellar genomes, and plasmids (if any).
18+
19+
# Bioproject submission
20+
21+
If you do not have one yet, you will need to create an account with NCBI for the submission. Once that is accomplished you can go to "My submissions" and click on Bioproject at the top left of the window.
22+
23+
The first tab to fill out is the "Submitter tab"
24+
25+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectSubmitter.png)
26+
27+
Since I created this genome and am submitting it, my submitter information is simple. This may become more complex if submitting for a group.
28+
29+
Then you will need to choose the type of data you will be submitting. Here we assembled a genome, so checking this box will give us the required options needed for later genome submission steps later. Then the scope of your sample, which often times for genome submissions the option checked is Monoisolate. We will also be needing locus tag prefixes to submit our annotation in a later step, so check this box.
30+
31+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectProjectType.png)
32+
33+
This takes you to the Target tab, where you will list the genus species of your organism and fill out one of the five options in the second line of boxes. The description is entirely optional, but including as much information as possible will help future users utilize your data in subsequent analyses.
34+
35+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectTarget.png)
36+
37+
Then you will be taken to the General Info tab, where you will specify the release date of your data, add a public description, and the relevance of your genome. You'll have to answer a few questions about external links to your data, grants that funded your genome assembly/annotation, consortium associations and if you are using a data provider.
38+
39+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectGeneralInfo.png)
40+
41+
The next tab is BIOSAMPLE, which is where you will list the previously created biosample accessions that are associated with your sequencing data used to assemble the genome. In this case, I have not created my Biosample yet, so I leave the area blank.
42+
43+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectBiosample.png)
44+
45+
The next tab is PUBLICATIONS, where you would list any composition associated with this genome and annotation.
46+
47+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectPublications.png)
48+
49+
The last tab is REVIEW & SUBMIT. This is your last opportunity to change any of the information in your submission, as after it is accepted you will have to email NCBI to make changes.
50+
51+
![](assets/NCBI_Genome_and_Annotation_Submission_Assets/BioprojectReviewAndSubmit.png)
52+
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading
Loading

0 commit comments

Comments
 (0)