GenBank

Use a streamlined submission process to submit the following data types (complete or partial sequences): SARS-CoV-2, Influenza A, B, or C, Norovirus, Dengue, prokaryotic ribosomal RNA (rRNA) and/or ribosomal intergenic spacer (IGS), eukaryotic nuclear rRNA and/or internal transcribed spacer (ITS), organelle rRNA, metazoan (multicellular animal) COX1, and eukaryotic nuclear mRNA.

What You Should Expect

Overview

This submission tool is for the specific data types indicated below.

SARS-CoV-2
Prokaryotic rRNA/IGS
Eukaryotic nuclear rRNA/ITS
Eukaryotic organelle rRNA
Metazoan (multicellular animal) COX1
Eukaryotic nuclear mRNA
Influenza, Norovirus or Dengue virus

Why are certain data types highlighted here?

Prepare the following information for your GenBank submission:

General: contact details, authors, publication, data release date.
Sequencing technology information
FASTA-formatted sequences
Organism name(s)
Source metadata, such as: isolate, strain, collection date, geo_loc_name
Feature Annotation: information about genes, coding regions, and other biologically significant features in the sequence.
- Features are automatically annotated for specific submissions, including SARS-CoV-2, Influenza, Dengue virus, Norovirus, metazoan COX1, rRNA, and rRNA-ITS.
- For eukaryotic mRNA submissions, we ask you to annotate CDS features. You have the flexibility to provide CDS annotations through a web form, a 5-column tab-delimited feature table, or by providing the corresponding protein sequences.

Previous Next

FASTA

Prepare your sequence(s) in the FASTA format that starts with a definition line, followed with a hard return and the sequence. The simplest definition line requires the “> “symbol and a sequence_ID.

Example:

>Seq1
CCTTTAT...
>Seq2
GGTAGGT...

Use only ASCII characters for your definition line and IUPAC codes for your sequences. Upload a FASTA file as a plain-text file (prepared with a text editor). The file may have one or more sequences.

Previous Next

Source

Source modifiers will be requested as part of submission and use a controlled vocabulary to describe how, when, and where you obtained your samples. You can also uniquely identify your samples from the same organism with source modifier such as isolate, clone, strain or specimen voucher.

You will be asked to provide values for certain source modifiers based on your submission type. Additional modifiers will be available to add.

Previous Next

Processing

Your submission will be automatically screened for common problems such as vector or adaptor contamination, chimeras, source problems, and various sequence errors. The types of checks performed will depend on the submission type.

An error report with explanations of the errors will be provided, if applicable. You must fix the errors before accession numbers are assigned. After your errors are corrected, your submission will be reviewed by NCBI for further processing and accession assignment.

Submit your sequence data on desktop. The desktop view allows you to easily:

Enter your information
Enter or upload metadata
Upload large source files
Review your submission

Email me a link to get started

GenBank FAQ

NCBI provides multiple submission tools for different types of sequence data. BankIt can be used to submit most types of data (such as genomic DNA; nuclear or organellar sequences; single genes or complete plasmids/organelles). Submission Portal GenBank has specialized wizards designed to submit specific data types (SARS CoV-2, Influenza, Norovirus, and Dengue; prokaryotic rRNA; organellar rRNA; eukaryotic rRNA-ITS; metazoan COX1; eukaryotic nuclear mRNA), which streamlines the submission process. Work is in progress to accommodate more data types in Submission Portal GenBank as we work towards consolidating and enhancing our submission tools. Similarly, large genomes, transcriptomes, and most unassembled reads are submitted through alternative Submission Portal wizards (Genomes-WGS, TSA, and SRA, respectively). If you have questions about which tool is appropriate for your data type, please contact info@ncbi.nlm.nih.gov.
For the latest submission options, begin by entering keywords describing the data you need to submit in the 'Search' box labeled 'What do you want to submit?' NCBI is in the process of improving the submission experience based on submitter feedback and activity. Some submission tools are being retired or combined to make it easier to submit and simplify requirements.

GenBank

GenBank is the world's largest nucleotide archive containing sequences from all branches of life. The archive is a foundation for medical and biological discovery.

Submit assembled SARS-CoV-2, Influenza, Norovirus, Dengue virus, rRNA, rRNA-ITS, metazoan COX1, Eukaryotic nuclear mRNA sequences.
Learn more Submit
Submit genomic DNA, organelle, ncRNA, plasmids, other viruses, phages, other mRNA, synthetic constructs.
Learn more Submit
Submit assembled eukaryotic and prokaryotic genomes (WGS or Complete).
Learn more Submit

Sequence Read Archive (SRA)

SRA is the largest publicly-available repository of high throughput sequencing data. The archive accepts data from all branches of life as well as metagenomic and environmental surveys.

Submit unassembled, high throughput sequencing reads

SARS-CoV-2 submission instructions

Learn more Submit

Other Tools

TSA

Submit computationally assembled, transcribed RNA sequences after submitting unassembled reads to SRA. Learn more
GEO

Submit RNA-seq, ChIP-seq, and other types of gene expression and epigenomics datasets. Learn more
BioProject & BioSample

Choose a tool above if submitting sequence data. Learn more

Medical Genetics & Variation Tools

Submit clinical data, small & large human genomics variants, and genotype & phenotype data.

Submission Portal

GenBank

What You Should Expect

Overview

Data

FASTA

Source

Processing

Submit

GenBank FAQ

GenBank

Sequence Read Archive (SRA)

Other Tools

TSA

GEO

BioProject & BioSample

Medical Genetics & Variation Tools

Other Resources

Submission Portal

GenBank

What You Should Expect

Overview

Data

FASTA

Source

Processing

Submit

GenBank FAQ

Why are there multiple tool options to submit sequence data to GenBank?

In the past, I submitted data to GenBank on another page. Am I in the right place?

GenBank

Sequence Read Archive (SRA)

Other Tools

Medical Genetics & Variation Tools

Other Resources