About – SARS-CoV-2 Database

About SARS-CoV-2 databases

Data

The content of SARS-CoV-2 databases has been compiled from several publicly available sequence databases available through INSDC (International Nucleotide Sequence Database Collaboration). Besides, we have used publicly available data from Johns Hopkins University Coronavirus Resource Centre and literature databases such as Europe PMC and PubMed for curation.

Sequence data

The sequence data, including assembly and nucleotide and protein sequences, has been downloaded from ENA. The sequences can be downloaded from the “Download “pages.

Contextual database

The contextual database has been compiled from available metadata parsed from BioProject (Study), BioSample (Sample), Assembly (Analysis) and Run (Experiment) files. The parsed contextual data has been curated to ensure consistency across the database. Phylogenetic linage has been predicted using the Pangolin2.0 (https://github.com/cov-lineages/pangolin).

An API gives programmatic access to the data. For more details, see here.

BLAST database

The BLAST database has been created in-house and can be downloaded from the “Download” pages.

About this website

The SARS-CoV-2 database has been developed by the ELIXIR Norway node at UiT The Arctic University of Norway. The UiT team can be contacted at mmp@uit.no. The SARS-CoV-2 database is a part of the Norwegian COVID-19 Data Portal maintained by ELIXIR Norway.

To get more information about the EMBL-EBI COVID-19 Data Portal and SARS-CoV-2 Data hub just follow the active links.

ELIXIR members provide a range of services for studying the SARS-CoV-2 and the COVID-19 disease that can be accessed either through ELIXIR, the Norwegian COVID-19 Data Portal or the COVID-19 Data Portal.