Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Abstract
Xenbase (www.xenbase.org) is a knowledge base for researchers and biomedical scientists that employ the amphibian Xenopus as a model organism in biomedical research to gain a deeper understanding of developmental and disease processes. Through expert curation and automated data provisioning from various sources Xenbase strives to integrate the body of knowledge on Xenopus genomics and biology together with the visualization of biologically significant interactions. Most current studies utilize next generation sequencing (NGS) but until now the results of different experiments were difficult to compare and not integrated with other Xenbase content. Xenbase has developed a suite of tools, interfaces and data processing pipelines that transforms NCBI Gene Expression Omnibus (GEO) NGS content into deeply integrated gene expression and chromatin data, mapping all aligned reads to the most recent genome builds. This content can be queried and visualized via multiple tools and also provides the basis for future automated ''gene expression as a phenotype'' and gene regulatory network analyses.
Figure 1. GEO and SRA data collection and processing. (A) Automated systems detect new data and load metadata for curators to use in manual annotation processes. The results are then used to generate a run file that feeds the raw sequencing data from the SRA in the appropriate manner into the GEO bioinformatics pipeline. The pipeline output then supplies files to various Xenbase resources, such as the FTP repository, the JBrowse genome browser and the Xenbase database. (B) Details of the RNA-seq and ChIP-seq data processing pipelines utilizing CSBB (see https://github.com/csbbcompbio/CSBB-v3.0).
Figure 2. The GEO and SRA simple search interface and search results. (A) The Simple Search Interface. Any common search term, such as a GEO ID, gene symbol, tissue, author or reagent can be queried. In this example the query was ‘wnt8a’, returned two GSE results. (B) Within the GSE page opened by selecting the search result (A) of interest, in addition to the metadata and other information on the GSE a table is displayed providing access to the processed data. Results of the DEG analysis between samples can be viewed in Table format by selecting the DEG link (red arrow) or by selecting multiple check boxes under ‘Compare’ (blue arrow) then selecting this button. Alternatively, bigWig tracks can be viewed by choosing those of interest and selecting the ‘Load in JBrowse’ button.
Figure 3. Visualization of GEO data. (A) An example of a heatmap displaying TPM values for a set of six experiments from within a single GSE. This view is generated using the check boxes described in Figure 2B. Users can change the color map, switch to sorting results by Log fold change, FDR and more using various tools in the interface. Cutoffs/thresholds can also be changed, as can the number of DEGs display. Mouse over will display the values for any individual tile. (B) The table view of a set of DEGs from a GSM, compared to its controls. Results can be sorted by any column (e.g. LogFC, FDR etc.) using the up or down arrowheads. (C) Results loaded into JBrowse displaying ChIP results as a bigWig, the corresponding peak call or RNA-Seq data illustrating the impact on a target gene of injecting a morpholino against foxH1. These data can be loaded using the ‘Load in JBrowse’ tool illustrated in Figure 2, or via the hierarchical menu on the left side of the JBrowse window. In this example results from multiple different studies are compared. (D) Alternatively tracks of interest can be loaded using the faceted track viewer available for both Xenopus laevis and Xenopus tropicalis genomes. In this example the ‘GEO tracks’ button in the top left of panel C was selected, then data filtered by entering MO into the ‘Contains text’ box to restricts tracks displayed to morpholino data. This method also allows users to combine results loaded via the GEO results interface to additional results from other studies (GSEs), which can also be loaded via the faceted track viewer. The selected tracks are viewed by clicking on the ‘Back to browser’ button. While different studies cannot be stringently compared, the same pipeline and thresholds were used across all dataset, so are useful for hypothesis generation and further experimentation.
Barrett,
NCBI GEO: archive for functional genomics data sets--update.
2013, Pubmed
Barrett,
NCBI GEO: archive for functional genomics data sets--update.
2013,
Pubmed
Buels,
JBrowse: a dynamic web platform for genome visualization and analysis.
2016,
Pubmed
Davis,
The Encyclopedia of DNA elements (ENCODE): data portal update.
2018,
Pubmed
James-Zorn,
Navigating Xenbase: An Integrated Xenopus Genomics and Gene Expression Database.
2018,
Pubmed
,
Echinobase
Karimi,
Xenbase: a genomic, epigenomic and transcriptomic model organism database.
2018,
Pubmed
,
Echinobase
Langmead,
Fast gapped-read alignment with Bowtie 2.
2012,
Pubmed
Li,
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.
2011,
Pubmed
Li,
The Sequence Alignment/Map format and SAMtools.
2009,
Pubmed
Mitros,
A chromosome-scale genome assembly and dense genetic map for Xenopus tropicalis.
2019,
Pubmed
Nenni,
Xenbase: Facilitating the Use of Xenopus to Model Human Disease.
2019,
Pubmed
,
Echinobase
Ramírez,
deepTools: a flexible platform for exploring deep-sequencing data.
2014,
Pubmed
Risso,
Normalization of RNA-seq data using factor analysis of control genes or samples.
2014,
Pubmed
Segerdell,
Enhanced XAO: the ontology of Xenopus anatomy and development underpins more accurate annotation of gene expression and queries on Xenbase.
2013,
Pubmed
,
Echinobase
Session,
Genome evolution in the allotetraploid frog Xenopus laevis.
2016,
Pubmed
,
Echinobase
Zhang,
Model-based analysis of ChIP-Seq (MACS).
2008,
Pubmed