Cyanohub BETA
Banner image credit: NOAA-GLERL

Welcome to the Cyanohub Database Home

This is a placeholder for a searchable database of algae organisms and their genes collected by scientists from the University of Michigan, the NOAA Great Lakes Environmental Research Lab, and other research organizations and agencies.

Our database is being developed under Neo4j, which utilizes graph database technology to provide us with insights into potential relationships between seemingly unrelated organisms, genes, and other observed/sampled algae-related life forms.

The database is intended to be a single location that scientists can get cyanobacteria and Great Lakes specific genomic information. This includes, but is not limited to, links between specific cyanobacteria strains their genes and environmental metadata, links between different strains based on gene content and expression, and the ability to get sequences from identified genes or relationships. Users are anticipated to begin a search of the database with one of two main queries:

From these points users would be able to refine their search or produce data tables of returned strain-gene-metadata relationships. Identified genes would also have links available for obtaining sequences from additional databases (NCBI - National Center for Biotechnology Information, JGI-IMG - Joint Genome Institute Integrated Microbial Genomes System, etc.).

This database will be markedly different from current sites like NCBI and IMG that act more as repositories for sequence data, which while searchable, provide little to no information regarding cross-strain relationships and the metadata behind such relationships. Our database is built around gene-content specific relationships between multiple strains of cyanobacteria with the capability of incorporating new strains over time to add newly sequenced genomes (more environments etc.). Existing data/relationships will be preserved while adding more information to expand the database without the need for intensive computation and re-building the entire database each time new genomes become available.

Displayed below is a table of known strains currently in our database, with a sample location where they were spotted.

Table of cultured Microcystis strains

Strain Name Country Toxic Genome
Size (Mb)
NCBI Tax. ID No. Genes (documented)
PCC_7005USANo4.92678705081
PCC_7806NETHERLANDSYes5.32678725169
PCC_9432CANADANo511602804952
PCC_7941CANADAYes4.82136184646
PCC_9443CENTRAL AFRICAN REPUBLICYes5.111602815139
PCC_9701FRANCENo4.77211234673
PCC_9717FRANCEYes5.211602865234
PCC_9806USANo4.211602824127
PCC_9807SOUTH AFRICAYes5.111602835104
PCC_9808AUSTRALIAYes511602845040
PCC_9809USAYes4.911602855043
NIES-44JAPANNo4.64494394348
NIES-843JAPANYes5.84494475605
TAIHU98CHINANo4.911344574735
DIANCHI905CHINAYes4.912358084775
LE3USAYes4.64383
LSC13-02USAYes4.34181
LE013-01USANo43864
SPC777BRAZILYes5.54823005458

Search Database

Basic Fields

  


Yes
No
NA



Advanced Fields


























Cloud
Shell
Soft-core
Core
NA