Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Evolutionary analyses of genes in Echinodermata offer insights towards the origin of metazoan phyla.
Foley S
,
Vlasova A
,
Marcet-Houben M
,
Gabaldón T
,
Hinman VF
.
???displayArticle.abstract???
Despite recent studies discussing the evolutionary impacts of gene duplications and losses among metazoans, the genomic basis for the evolution of phyla remains enigmatic. Here, we employ phylogenomic approaches to search for orthologous genes without known functions among echinoderms, and subsequently use them to guide the identification of their homologs across other metazoans. Our final set of 14 genes was obtained via a suite of homology prediction tools, gene expression data, gene ontology, and generating the Strongylocentrotus purpuratus phylome. The gene set was subjected to selection pressure analyses, which indicated that they are highly conserved and under negative selection. Their presence across broad taxonomic depths suggests that genes required to form a phylum are ancestral to that phylum. Therefore, rather than de novo gene genesis, we posit that evolutionary forces such as selection on existing genomic elements over large timescales may drive divergence and contribute to the emergence of phyla.
Fig. 2. GO term enrichment analysis of the genes duplicated at the S. purpuratus level (age #1) for duplications without large expansions, summarized with the REVIGO server. Panel A covers biological processes, and panel B covers molecular function categories.
Fig. 3. GO term enrichment analysis of the S. purpuratus orphan (i.e., potentially lineage specific) genes, summarized with the REVIGO server. Panel A covers biological processes, and panel B covers molecular function categories.
Fig. 4. Gene expression in both embryonic and adult tissues represented as a heatmap. In each case, genes in the green bracket corresponding to the purple sea urchin were only recovered in echinoderms, and genes in the green bracket corresponding to the blue InterPro logo were the only ones to record InterPro domains. Panel A shows the raw TPM values for each key developmental timepoint, ranging from 0 to 72 h post-fertilization. Panel B shows the log expression of those TPM values. Expression of our genes of interest is sustained throughout development, generally peaking at 18-24 h. Panel C shows TPM values across six different adult tissues. Panel D shows the log of those values. Expression is also sustained in adult tissues, particularly in the coelomocyte and testes.
Albalat,
Evolution by gene loss.
2016,
Pubmed
Al-Shahrour,
FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes.
2004,
Pubmed
Altenhoff,
The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.
2018,
Pubmed
Altenhoff,
The Quest for Orthologs benchmark service and consensus calls in 2020.
2020,
Pubmed
Altschul,
Basic local alignment search tool.
1990,
Pubmed
Arshinoff,
Echinobase: leveraging an extant model organism database to build a knowledgebase supporting research on the genomics and biology of echinoderms.
2022,
Pubmed
,
Echinobase
Bhattacharya,
Recent Advances in Protein Homology Detection Propelled by Inter-Residue Interaction Map Threading.
2021,
Pubmed
Camacho,
BLAST+: architecture and applications.
2009,
Pubmed
Capella-Gutiérrez,
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.
2009,
Pubmed
Cary,
Analysis of sea star larval regeneration reveals conserved processes of whole-body regeneration across the metazoa.
2019,
Pubmed
,
Echinobase
Cary,
Genome-wide use of high- and low-affinity Tbrain transcription factor binding sites during echinoderm development.
2017,
Pubmed
,
Echinobase
Dehal,
Two rounds of whole genome duplication in the ancestral vertebrate.
2005,
Pubmed
Deline,
Evolution and Development at the Origin of a Phylum.
2020,
Pubmed
,
Echinobase
Deutekom,
Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes.
2021,
Pubmed
Dolmatov,
Molecular mechanisms of fission in echinoderms: Transcriptome analysis.
2018,
Pubmed
,
Echinobase
Edgar,
MUSCLE: multiple sequence alignment with high accuracy and high throughput.
2004,
Pubmed
Emms,
OrthoFinder: phylogenetic orthology inference for comparative genomics.
2019,
Pubmed
Fernández,
Gene gain and loss across the metazoan tree of life.
2020,
Pubmed
Foley,
Integration of 1:1 orthology maps and updated datasets into Echinobase.
2021,
Pubmed
,
Echinobase
Fuentes,
PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies.
2022,
Pubmed
Gabaldón,
Large-scale assignment of orthology: back to phylogenetics?
2008,
Pubmed
Gabaldón,
Functional and evolutionary implications of gene orthology.
2013,
Pubmed
Glover,
Advances and Applications in the Quest for Orthologs.
2019,
Pubmed
Grabherr,
Full-length transcriptome assembly from RNA-Seq data without a reference genome.
2011,
Pubmed
Haas,
De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.
2013,
Pubmed
Hu,
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.
2019,
Pubmed
Hu,
An integrative approach to ortholog prediction for disease-focused and other functional studies.
2011,
Pubmed
Huerta-Cepas,
PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.
2011,
Pubmed
Huerta-Cepas,
Assigning duplication events to relative temporal scales in genome-wide studies.
2011,
Pubmed
Huerta-Cepas,
ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data.
2016,
Pubmed
Jones,
InterProScan 5: genome-scale protein function classification.
2014,
Pubmed
Katoh,
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.
2013,
Pubmed
Lassmann,
Kalign--an accurate and fast multiple sequence alignment algorithm.
2005,
Pubmed
Lechner,
Proteinortho: detection of (co-)orthologs in large-scale analysis.
2011,
Pubmed
Lessios,
Speciation genes in free-spawning marine invertebrates.
2011,
Pubmed
,
Echinobase
Li,
Protein remote homology detection based on bidirectional long short-term memory.
2017,
Pubmed
Meyer,
The arm of the starfish: The far-reaching applications of Patiria miniata as a model system in evolutionary, developmental, and regenerative biology.
2022,
Pubmed
,
Echinobase
Mistry,
Pfam: The protein families database in 2021.
2021,
Pubmed
Murrell,
FUBAR: a fast, unconstrained bayesian approximation for inferring selection.
2013,
Pubmed
Nguyen,
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.
2015,
Pubmed
Oliver,
Whole-genome positive selection and habitat-driven evolution in a shallow and a deep-sea urchin.
2010,
Pubmed
,
Echinobase
Ostlund,
InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.
2010,
Pubmed
Persson,
Domainoid: domain-oriented orthology inference.
2019,
Pubmed
Potter,
HMMER web server: 2018 update.
2018,
Pubmed
Remm,
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.
2001,
Pubmed
Schurch,
How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
2016,
Pubmed
Sharma,
A genomics approach reveals insights into the importance of gene losses for mammalian adaptations.
2018,
Pubmed
Sun,
Signal-dependent regulation of the sea urchin skeletogenic gene regulatory network.
2014,
Pubmed
,
Echinobase
Supek,
REVIGO summarizes and visualizes long lists of gene ontology terms.
2011,
Pubmed
Suyama,
PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.
2006,
Pubmed
Tu,
Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis.
2012,
Pubmed
,
Echinobase
Wallace,
M-Coffee: combining multiple sequence alignment methods with T-Coffee.
2006,
Pubmed
Weaver,
Datamonkey 2.0: A Modern Web Application for Characterizing Selective and Other Evolutionary Processes.
2018,
Pubmed
Wehe,
DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony.
2008,
Pubmed
Weisman,
The Origins and Functions of De Novo Genes: Against All Odds?
2022,
Pubmed
Wilburn,
Remote homology search with hidden Potts models.
2020,
Pubmed
Zheng,
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.
2019,
Pubmed