An official website of the United States government
The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before
sharing sensitive information, make sure you’re on a federal
government site.
The site is secure.
The https:// ensures that you are connecting to the
official website and that any information you provide is encrypted
and transmitted securely.
A BioProject is a collection of biological data for a single initiative, originating from a single organization or from a consortium. A BioSample includes descriptive information about the biological source materials relating to experimental assays.
Automatically create these during a sequence data submission or link them during submission.
What You Should Expect
You should register BioProject or BioSamples separately from your data only in the following situations:
Large and long-term projects where samples are collected over a course of year or more
If an NCBI curator instructed you to register a separate BioProject or BioSamples
If you are submitting an annotated genome before submitting the reads or the unannotated genome
A BioProject describes the research effort/study under which you will submit all your sample information and sequence data files.
Provide required information best describing your data:
Data Type
Sample scope
Organism (when scope is is monoisolate or multiisolate) or common taxonomic branch (when scope is multispecies and the species are fairly closely related). Example: Enter "primates" if you are making a comparison of primates from different continents.
BioProject release date (Note: BioProjects are automatically released when public sequence data is linked to them).
Optional fields:
Research grant(s)
BioSample and publication information, if you have them. You will be able to skip these by pressing "Continue."
BioSample has specific required attributes for different types of samples that were collected.
You will choose the one that best describes your samples:
Pathogen affecting public health
Microbe
Model organisms
Metagenome
Plant
Virus
Each BioSample must have metadata that makes it unique from all other submitted samples. The following three attributes do not make a sample unique:
Sample name
Sample title
Description
You can add custom attributes by renaming existing, non-required columns to a relevant attribute.
Submit your sequence data on desktop. The desktop view allows you to easily:
You should only do this if you are instructed by a curator, have a multi year project,
or if you are submitting an annotated genome before submitting the reads or the unannotated genome.
You should only do this if you are instructed by a curator or if you have a multi year project and samples are collected over a long period of time.
Yes, this is the preferred approach to data submission. The exception is if you are submitting an assembled genome with annotation.
GenBank
GenBank is the world's largest nucleotide archive containing sequences from all branches of life. The archive is a foundation for medical and biological discovery.
Submit genomic DNA, organelle, ncRNA, plasmids, other viruses, phages, other mRNA, synthetic constructs.
Submit assembled prokaryotic and eukaryotic genomes.
Sequence Read Archive (SRA)
SRA is the largest publicly-available repository of high throughput sequencing data. The archive accepts data from all branches of life as well as metagenomic and environmental surveys.
Submit unassembled, high throughput sequencing reads