Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Nucleic Acids Res
2022 Jan 07;50D1:D970-D979. doi: 10.1093/nar/gkab1005.
Show Gene links
Show Anatomy links
Echinobase: leveraging an extant model organism database to build a knowledgebase supporting research on the genomics and biology of echinoderms.
Arshinoff BI
,
Cary GA
,
Karimi K
,
Foley S
,
Agalakov S
,
Delgado F
,
Lotay VS
,
Ku CJ
,
Pells TJ
,
Beatman TR
,
Kim E
,
Cameron RA
,
Vize PD
,
Telmer CA
,
Croce JC
,
Ettensohn CA
,
Hinman VF
.
Abstract
Echinobase (www.echinobase.org) is a third generation web resource supporting genomic research on echinoderms. The new version was built by cloning the mature Xenopus model organism knowledgebase, Xenbase, refactoring data ingestion pipelines and modifying the user interface to adapt to multispecies echinoderm content. This approach leveraged over 15 years of previous database and web application development to generate a new fully featured informatics resource in a single year. In addition to the software stack, Echinobase uses the private cloud and physical hosts that support Xenbase. Echinobase currently supports six echinoderm species, focused on those used for genomics, developmental biology and gene regulatory network analyses. Over 38 000 gene pages, 18 000 publications, new improved genome assemblies, JBrowse genome browser and BLAST + services are available and supported by the development of a new echinoderm anatomical ontology, uniformly applied formal gene nomenclature, and consistent orthology predictions. A novel feature of Echinobase is integrating support for multiple, disparate species. New genomes from the diverse echinoderm phylum will be added and supported as data becomes available. The common code development design of the integrated knowledgebases ensures parallel improvements as each resource evolves. This approach is widely applicable for developing new model organism informatics resources.
Figure 1. The Echinobase landing page and portal. A central feature of the page is an animated ‘news slider’ that presents a slide show of community news and new features of the resource. A horizontal navigation menu shared between all Echinobase pages provides consistency, and large ‘tiles’ of grouped content, e.g. ‘Genes and Expression’, ‘Genome Browsers’ and ‘BLAST’ aggregate commonly used tools and content.
Figure 2. The Echinobase Gene Page. The top section of the page contains species agnostic information such as gene symbols, names and functions. It also contains synonyms, where legacy symbols and short names are stored and make genes locatable via database searches. The synonyms are also used to link Gene Pages to matched terms in the literature section. Due to the large amount of content aggregated on Gene Pages, much of the data is sorted into ‘tabs’ at the top of the page (orange arrow). Where a tab contains content, this is indicated by a numeral or icon. A transcriptional profile for the gene is also present, currently only for S. purpuratus, and will be expanded to additional species as data becomes available. The species specific content is arranged in vertical columns, each topped by a drop down menu that enables the user to define the content of that column (red arrow). In this example the third column header has been selected, displaying the three options for its content. The ‘+’ symbol highlighted by the green arrow allows additional columns to be displayed. As we will be adding both a fourth species and paralogs to the display options in the near future, the ability to customize the data viewed will become even more important. Where we store and integrate multiple genome versions, for example for S. purpuratus, the user can also select which genome version is displayed as a gene model via a drop-down menu, and which genome version is linked to the Gene Page in JBrowse. This is only the case when we have integrated multiple versions, and this is only the case in some species. As different genome builds (e.g. S. purpuratus v3.1 and v5.0) have different sets of gene models, not all genes will be available in all genome versions. When this happens the number of available genomes will change in the user display. The chromosome maps and gene model exon/intron graphics are generated dynamically from database content by JavaScript, and selecting a new genome version will dynamically display the new gene structure for the selected version. The rocket icon that appears adjacent to many sequence based hyperlinks will load the corresponding sequence into BLAST, while the magnifying glass icon will open the sequence in a display window.
Figure 3. Echinobase literature pages. This screenshot illustrates a paper that has been processed by automated systems only, so all links, identified genes, authors and anatomy terms, figures and legends etc. were added by machine-based methods. The only organism detected by these tools was ‘echinodermata’, and a curator must annotate this page to include the species in the title, Heliocidaris erythrogramma, add additional genes, experimental reagents (morpholinos etc.), curate the gene expression patterns in the figures etc. When this is done all these data will also be displayed on this page. Authors with Echinobase community pages are hyperlinked, as are terms in the abstract and figure legends matching genes/synonyms, anatomy/synonyms etc. When text matches more than one target, selecting the link will display a disambiguation page.
Alliance of Genome Resources Consortium,
The Alliance of Genome Resources: Building a Modern Data Ecosystem for Model Organism Databases.
2019, Pubmed
Alliance of Genome Resources Consortium,
The Alliance of Genome Resources: Building a Modern Data Ecosystem for Model Organism Databases.
2019,
Pubmed
Allot,
LitSuggest: a web-based system for literature recommendation and curation using machine learning.
2021,
Pubmed
Altenhoff,
The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.
2018,
Pubmed
Beatman,
A nomenclature for echinoderm genes.
2021,
Pubmed
,
Echinobase
Bowes,
Xenbase: gene expression and improved integration.
2010,
Pubmed
Camacho,
BLAST+: architecture and applications.
2009,
Pubmed
Cameron,
SpBase: the sea urchin genome database and web site.
2009,
Pubmed
,
Echinobase
Cary,
EchinoBase: Tools for Echinoderm Genome Analyses.
2018,
Pubmed
,
Echinobase
Davidson,
Network design principles from the sea urchin embryo.
2009,
Pubmed
,
Echinobase
Emms,
OrthoFinder: phylogenetic orthology inference for comparative genomics.
2019,
Pubmed
Foley,
Integration of 1:1 orthology maps and updated datasets into Echinobase.
2021,
Pubmed
,
Echinobase
Foltz,
Preface.
2019,
Pubmed
Fortriede,
Xenbase: deep integration of GEO & SRA RNA-seq and ChIP-seq data in a model organism database.
2020,
Pubmed
,
Echinobase
Gene Ontology Consortium,
The Gene Ontology resource: enriching a GOld mine.
2021,
Pubmed
Hu,
An integrative approach to ortholog prediction for disease-focused and other functional studies.
2011,
Pubmed
Hu,
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.
2019,
Pubmed
James-Zorn,
Navigating Xenbase: An Integrated Xenopus Genomics and Gene Expression Database.
2018,
Pubmed
,
Echinobase
Karimi,
Xenbase: a genomic, epigenomic and transcriptomic model organism database.
2018,
Pubmed
,
Echinobase
Karimi,
The Virtual Xenbase: transitioning an online bioinformatics resource to a private cloud.
2014,
Pubmed
,
Echinobase
Karimi,
Classifying domain-specific text documents containing ambiguous keywords.
2021,
Pubmed
,
Echinobase
Lechner,
Proteinortho: detection of (co-)orthologs in large-scale analysis.
2011,
Pubmed
Müller,
Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.
2018,
Pubmed
Nenni,
Xenbase: Facilitating the Use of Xenopus to Model Human Disease.
2019,
Pubmed
,
Echinobase
Ostlund,
InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.
2010,
Pubmed
Remm,
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.
2001,
Pubmed
Skinner,
JBrowse: a next-generation genome browser.
2009,
Pubmed
Sodergren,
The genome of the sea urchin Strongylocentrotus purpuratus.
2006,
Pubmed
,
Echinobase
Stein,
The generic genome browser: a building block for a model organism system database.
2002,
Pubmed
UniProt Consortium,
UniProt: the universal protein knowledgebase in 2021.
2021,
Pubmed
Vize,
Database and Informatic Challenges in Representing Both Diploid and Tetraploid Xenopus Species in Xenbase.
2015,
Pubmed
,
Echinobase