Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Abstract
Echinobase (https://echinobase.org) is a central online platform that generates, manages and hosts genomic data relevant to echinoderm research. While the resource primarily serves the echinoderm research community, the recent release of an excellent quality genome for the frequently studied purple sea urchin (Strongylocentrotus purpuratus genome, v5.0) has provided an opportunity to adapt to the needs of a broader research community across other model systems. To this end, establishing pipelines to identify orthologous genes between echinoderms and other species has become a priority in many contexts including nomenclature, linking to data in other model organisms, and in internal functionality where data gathered in one hosted species can be associated with genes in other hosted echinoderms. This paper describes the orthology pipelines currently employed by Echinobase and how orthology data are processed to yield 1:1 ortholog mappings between a variety of echinoderms and other model taxa. We also describe functions of interest that have recently been included on the resource, including an updated developmental time course for S.purpuratus, and additional tracks for genome browsing. These data enhancements will increase the accessibility of the resource to non-echinoderm researchers and simultaneously expand the data quality and quantity available to core Echinobase users. Database URL: https://echinobase.org.
Figure 1. (A) Rationale behind taxon choice for orthology analyses at Echinobase. Black arrows represent analyses that have been performed. We have so far used Anneissia japonica for Crinoidea analysis and both Acanthaster planci and Patiria miniata for Asteroidea. Gray arrows show analyses planned for integration (e.g. between S. purpuratus and Mus musculus, Xenopus tropicalis). This demonstrates our use as S. purpuratus as our reference echinoderm, in that orthologs can be laterally inferred between other echinoderms and non-echinoderms via S. purpuratus. 1:1 ortholog counts for intra-echinoderm analyses are expressed as fractions of the total number of genes in the genome of that species. (B) A species tree, showing the phylogenetic positions of different model metazoan taxa relative to echinoderms (box highlighted in yellow). Branch lengths are not drawn to scale. Numbers beside the nodes are estimated times in millions of years which, along with the topology, are adapted from the studies by Cary and Hinman, Delsuc et al. and Dohrmann and Wörheide (34–36). Echinoderms are an extremely diverse group; different classes of echinoderm are distantly related to each other.
Figure 2. A sample gene summary page for foxa1. The updated developmental time course is shown in the top right. The ortholog of the gene in different echinoderms, as predicted by our pipeline, is seen under the ‘Echinobase Gene ID’ section. The gene can be viewed in JBrowse by clicking the links under the ‘Genomic’ section. Additional orthologs to non-echinoderm species as predicted by NCBI and are reported under the ‘Orthology’ section.
Figure 3. A screenshot of the S. purpuratus v5.0 genome browser displaying the labeled 18 hours post-fertilization ATAC-seq peaks and peak scores.
Alliance of Genome Resources Consortium,
Alliance of Genome Resources Portal: unified model organism research platform.
2020, Pubmed
Alliance of Genome Resources Consortium,
Alliance of Genome Resources Portal: unified model organism research platform.
2020,
Pubmed
Altenhoff,
The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.
2018,
Pubmed
Beatman,
A nomenclature for echinoderm genes.
2021,
Pubmed
,
Echinobase
Cary,
Genome-wide use of high- and low-affinity Tbrain transcription factor binding sites during echinoderm development.
2017,
Pubmed
,
Echinobase
Cary,
Analysis of sea star larval regeneration reveals conserved processes of whole-body regeneration across the metazoa.
2019,
Pubmed
,
Echinobase
Cary,
EchinoBase: Tools for Echinoderm Genome Analyses.
2018,
Pubmed
,
Echinobase
Cary,
Echinoderm development and evolution in the post-genomic era.
2017,
Pubmed
,
Echinobase
Chang,
Asymmetric distribution of hypoxia-inducible factor α regulates dorsoventral axis establishment in the early sea urchin embryo.
2017,
Pubmed
,
Echinobase
Delsuc,
A phylogenomic framework and timescale for comparative studies of tunicates.
2018,
Pubmed
Dohrmann,
Dating early animal evolution using phylogenomic data.
2017,
Pubmed
Emms,
OrthoFinder: phylogenetic orthology inference for comparative genomics.
2019,
Pubmed
Fitch,
Distinguishing homologous from analogous proteins.
1970,
Pubmed
Fortriede,
Xenbase: deep integration of GEO & SRA RNA-seq and ChIP-seq data in a model organism database.
2020,
Pubmed
,
Echinobase
Gabaldón,
Functional and evolutionary implications of gene orthology.
2013,
Pubmed
Hu,
An integrative approach to ortholog prediction for disease-focused and other functional studies.
2011,
Pubmed
Hu,
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.
2019,
Pubmed
Huerta-Cepas,
PhylomeDB: a database for genome-wide collections of gene phylogenies.
2008,
Pubmed
Langmead,
Fast gapped-read alignment with Bowtie 2.
2012,
Pubmed
Lechner,
Proteinortho: detection of (co-)orthologs in large-scale analysis.
2011,
Pubmed
Li,
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.
2011,
Pubmed
Li,
The Sequence Alignment/Map format and SAMtools.
2009,
Pubmed
Li,
TreeFam: a curated database of phylogenetic trees of animal gene families.
2006,
Pubmed
Li,
OrthoMCL: identification of ortholog groups for eukaryotic genomes.
2003,
Pubmed
Ostlund,
InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.
2010,
Pubmed
Quinlan,
BEDTools: a flexible suite of utilities for comparing genomic features.
2010,
Pubmed
Ramírez,
deepTools: a flexible platform for exploring deep-sequencing data.
2014,
Pubmed
Remm,
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.
2001,
Pubmed
Sodergren,
The genome of the sea urchin Strongylocentrotus purpuratus.
2006,
Pubmed
,
Echinobase
Sun,
Signal-dependent regulation of the sea urchin skeletogenic gene regulatory network.
2014,
Pubmed
,
Echinobase
Thomas,
PANTHER: a library of protein families and subfamilies indexed by function.
2003,
Pubmed
Tu,
Quantitative developmental transcriptomes of the sea urchin Strongylocentrotus purpuratus.
2014,
Pubmed
,
Echinobase
Wagner,
Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples.
2012,
Pubmed
Zhang,
Model-based analysis of ChIP-Seq (MACS).
2008,
Pubmed