Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Unravelling the Complex Duplication History of Deuterostome Glycerol Transporters.
Yilmaz O
,
Chauvigné F
,
Ferré A
,
Nilsen F
,
Fjelldal PG
,
Cerdà J
,
Finn RN
.
???displayArticle.abstract???
Transmembrane glycerol transport is an ancient biophysical property that evolved in selected subfamilies of water channel (aquaporin) proteins. Here, we conducted broad level genome (>550) and transcriptome (>300) analyses to unravel the duplication history of the glycerol-transporting channels (glps) in Deuterostomia. We found that tandem duplication (TD) was the major mechanism of gene expansion in echinoderms and hemichordates, which, together with whole genome duplications (WGD) in the chordate lineage, continued to shape the genomic repertoires in craniates. Molecular phylogenies indicated that aqp3-like and aqp13-like channels were the probable stem subfamilies in craniates, with WGD generating aqp9 and aqp10 in gnathostomes but aqp7 arising through TD in Osteichthyes. We uncovered separate examples of gene translocations, gene conversion, and concerted evolution in humans, teleosts, and starfishes, with DNA transposons the likely drivers of gene rearrangements in paleotetraploid salmonids. Currently, gene copy numbers and BLAST are poor predictors of orthologous relationships due to asymmetric glp gene evolution in the different lineages. Such asymmetries can impact estimations of divergence times by millions of years. Experimental investigations of the salmonid channels demonstrated that approximately half of the 20 ancestral paralogs are functional, with neofunctionalization occurring at the transcriptional level rather than the protein transport properties. The combined findings resolve the origins and diversification of glps over >800 million years old and thus form the novel basis for proposing a pandeuterostome glp gene nomenclature.
Figure 1. Molecular phylogeny of deuterostome glycerol transporters. The tree is mid-point rooted and inferred via maximum likelihood (RAxML: generalized time-reversible (GTR) gamma model with 3000 bootstraps and search for the best-scoring maximum likelihood (ML) tree) and Bayesian analysis (15 million Markov chain Monte Carlo (MCMC) generations) of 315,034 nucleotide sites of 249 taxa partitioned by codon. Numbers within or external to each collapsed triangular branch refer to the number of taxa with Bayesian posterior probabilities annotated at each node. The scale bar indicates the expected rate of substitutions per site. Modes of gene duplication are shown on respective branches with additional duplications annotated for Teleostei and Homininae. Circled branches within the AQP10 channel cluster illustrate the origin of the polygenic aqp10 subfamily in actinopterygian fishes. (See Supplementary Figure S1 for the fully annotated tree including RAxML bootstrap support values).
Figure 2. Synteny and molecular phylogeny of non-vertebrate chordate and Ambulacrarian glycerol-transporting channels (glps). (A) Syntenies are assembled from genome sequences and arranged according to the taxonomic position of the selected organisms. Gene coding direction is indicated by the pointed end of each gene symbol with linkages represented by solid lines between paralogs. A circular arrow indicates that the chromosome (Chr) or scaffold is flipped in relation to the European starfish super 8 scaffold. Orange arrows above gene symbols indicate tandem duplication. (B) Mid-point rooted Bayesian majority rule consensus tree (5 million MCMC generations) of 97,721 nucleotide sites of 116 taxa partitioned by codon. Bayesian posterior probabilities are annotated at each node with tandem “T” or unspecified “D” modes of duplication shown on selected nodes. The scale bar indicates the expected rate of substitutions per site. (See Supplementary Figure S3 for the fully annotated tree). (C) Mid-point rooted FastTree using GTR model optimized with gamma20 likelihood showing support values at each node. (D) Mid-point rooted maximum likelihood tree (PAUP: heuristic search optimized for parsimony GTR model with NST = 6; partitioned by codon) with RAxML support values shown at each node.
Figure 3. Molecular phylogenies of actinopterygian glps. (A) Mid-point rooted Bayesian majority rule consensus tree of aqp3 channels (30 million MCMC generations) of 365,143 nucleotide sites of 382 taxa partitioned by codon. (See Supplementary Figure S5 for the fully annotated tree). (B) Bayesian majority rule consensus tree of aqp7 channels (15 million MCMC generations) of 225,618 nucleotide sites of 232 taxa partitioned by codon. The tree is rooted with reedfish (Erpetoichthys calabaricus) aqp7. (See Supplementary Figure S6 for the fully annotated tree). (C) Mid-point rooted Bayesian majority rule consensus tree of aqp9 channels (40 million MCMC generations) of 440,522 nucleotide sites of 460 taxa partitioned by codon. (See Supplementary Figure S7 for the fully annotated tree). (D) Mid-point rooted Bayesian majority rule consensus tree of aqp10 channels (50 million MCMC generations) of 457,471 nucleotide sites of 431 taxa partitioned by codon. (See Supplementary Figure S8 for the fully annotated tree). Bayesian posterior probabilities are indicated with color nodes as specified in the key. Six non-WGD lineage-dependent duplications are highlighted together with modes of gene duplication or loss in each tree.
Figure 4. Evolution and expression of glps in paleotetraploid salmonids. (A) Schematic representation of the evolution, coding direction, and diversification of glps in a non-teleostean bichir and protocanthopterygian teleosts (Esociformes and Salmoniformes). Whole genome (R3 and R4) and tandem (T) duplication events are annotated on relevant nodes. (B) glp tissue expression profiles (RT-PCR) from four male and four female Atlantic salmon using elongation initiation factor 1a (eif1a) as a reference. Negative controls: cDNA without reverse transcriptase (RT) or no cDNA (NT).
Figure 5. Functional characterization of Atlantic salmon Glps. Osmotic water permeability (Pf) and glycerol uptake (Pgly) of X. laevis oocytes injected with water (control) or cRNAs encoding the Atlantic salmon Glps. Data are box and whisker plots with the number of biologically independent oocytes indicated beside each plot. *** p < 0.001, statistically different (unpaired Student’s t-test), with respect to control oocytes. NS: not statistically significant with respect to control oocytes.
Figure 6. A timeline for the evolution of vertebrate glps. (A) A schematic representation of historical events that occurred during the evolution of vertebrate glps. Data are arranged in accordance with the divergence of each lineage through geological time [61,77]. Ma: millions of years ago. The presence of single copy aqp3, -7, -9, -10, or -13 orthologs in Sarcopterygii is indicated with a “+” with extra copies or loss within each subfamily superscripted “+” (blue) or “-” (red). Four rounds (R1–R4) of whole genome duplication, together with three interchromosomal duplications/translocations and >20 tandem duplication and gene loss events, are shown. Numbers in square brackets refer to the number of taxa sourced for coding sequence (CDS) assembly. (B) Proposed aquaporin (aqp) nomenclature in non-sarcopterygian vertebrates, with the genes colored according to their origins. Numbers in parentheses refer to gene copy numbers. (C) Bayesian divergence time estimation of aqp10aa1 and -10aa2 duplication in Cyprinodontoidei. (D) Bayesian divergence time estimation of aqp3a1 and -3a2 duplication in African cichlids.
Figure 7. The origin of deuterostome Glp subfamilies. A proposed scheme for the origin of glp gene subfamilies in Deuterostomia as a function of geological time. Modes of gene expansion are indicated as T: tandem duplication; or R1 and R2: whole genome duplication. Mz: Mesozoic; Cz: Cenozoic; and Ma: millions of years ago.
Abascal,
Diversity and evolution of membrane intrinsic proteins.
2014, Pubmed
Abascal,
Diversity and evolution of membrane intrinsic proteins.
2014,
Pubmed
Agre,
Aquaporins: a family of water channel proteins.
1993,
Pubmed
Amores,
Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication.
2011,
Pubmed
Berthelot,
The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates.
2014,
Pubmed
Betancur-R,
Phylogenetic classification of bony fishes.
2017,
Pubmed
Bezerra-Neto,
Plant Aquaporins: Diversity, Evolution and Biotechnological Applications.
2019,
Pubmed
Cerdà,
The Physiological Role and Regulation of Aquaporins in Teleost Germ Cells.
2017,
Pubmed
Cerdà,
Piscine aquaporins: an overview of recent advances.
2010,
Pubmed
Chauvigné,
Subcellular localization of selectively permeable aquaporins in the male germ line of a marine teleost reveals spatial redistribution in activated spermatozoa.
2013,
Pubmed
Chauvigné,
Mitochondrial aquaporin-8-mediated hydrogen peroxide transport is essential for teleost spermatozoon motility.
2015,
Pubmed
Chauvigné,
The vertebrate Aqp14 water channel is a neuropeptide-regulated polytransporter.
2019,
Pubmed
Chauvigné,
The pH sensitivity of Aqp0 channels in tetraploid and diploid teleosts.
2015,
Pubmed
David,
Recent duplication of the common carp (Cyprinus carpio L.) genome as revealed by analyses of microsatellite loci.
2003,
Pubmed
de Boer,
Bursts and horizontal evolution of DNA transposons in the speciation of pseudotetraploid salmonids.
2007,
Pubmed
de Maré,
Structural Basis for Glycerol Efflux and Selectivity of Human Aquaporin 7.
2020,
Pubmed
Dong,
Genome Wide Identification, Phylogeny, and Expression of Aquaporin Genes in Common Carp (Cyprinus carpio).
2016,
Pubmed
Driedzic,
Rainbow smelt: the unusual case of cryoprotection by sustained glycerol production in an aquatic animal.
2015,
Pubmed
Engelund,
Differential expression and novel permeability properties of three aquaporin 8 paralogs from seawater-challenged Atlantic salmon smolts.
2013,
Pubmed
Finn,
The lineage-specific evolution of aquaporin gene clusters facilitated tetrapod terrestrial adaptation.
2014,
Pubmed
Finn,
Evolution and functional diversity of aquaporins.
2015,
Pubmed
Finn,
Insect glycerol transporters evolved by functional co-option and gene replacement.
2015,
Pubmed
Finn,
Aquaporin evolution in fishes.
2011,
Pubmed
Han,
phyloXML: XML for evolutionary biology and comparative genomics.
2009,
Pubmed
Holland,
A new look at an old question: when did the second whole genome duplication occur in vertebrate evolution?
2018,
Pubmed
Holm,
Aquaporin 6 is permeable to glycerol and urea.
2004,
Pubmed
Hughes,
Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data.
2018,
Pubmed
Irisarri,
Phylogenomics uncovers early hybridization and adaptive loci shaping the radiation of Lake Tanganyika cichlid fishes.
2018,
Pubmed
Ishibashi,
Cloning and identification of a new member of water channel (AQP10) as an aquaglyceroporin.
2002,
Pubmed
Ishibashi,
Molecular cloning and expression of a member of the aquaporin family with permeability to glycerol and urea in addition to water expressed at the basolateral membrane of kidney collecting duct cells.
1994,
Pubmed
Ishibashi,
Cloning and functional expression of a new water channel abundantly expressed in the testis permeable to water, glycerol, and urea.
1997,
Pubmed
Ishibashi,
Cloning and functional expression of a new aquaporin (AQP9) abundantly expressed in the peripheral leukocytes permeable to water and urea, but not to glycerol.
1998,
Pubmed
Katoh,
Recent developments in the MAFFT multiple sequence alignment program.
2008,
Pubmed
Kozono,
Functional expression and characterization of an archaeal aquaporin. AqpM from methanothermobacter marburgensis.
2003,
Pubmed
Kumar,
TimeTree: A Resource for Timelines, Timetrees, and Divergence Times.
2017,
Pubmed
Kuraku,
Time scale for cyclostome evolution inferred with a phylogenetic diagnosis of hagfish and lamprey cDNA sequences.
2006,
Pubmed
Kuraku,
Insights into cyclostome phylogenomics: pre-2R or post-2R.
2008,
Pubmed
Lee,
Structural basis for conductance by the archaeal aquaporin AqpM at 1.68 A.
2005,
Pubmed
Lien,
The Atlantic salmon genome provides insights into rediploidization.
2016,
Pubmed
Lynch,
The evolutionary fate and consequences of duplicate genes.
2000,
Pubmed
Macqueen,
A well-constrained estimate for the timing of the salmonid whole genome duplication reveals major decoupling from species diversification.
2014,
Pubmed
Madeira,
Human aquaporin-11 is a water and glycerol channel and localizes in the vicinity of lipid droplets in human adipocytes.
2014,
Pubmed
Maurel,
Functional characterization of the Escherichia coli glycerol facilitator, GlpF, in Xenopus oocytes.
1994,
Pubmed
Meyer,
Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of novel gene functions.
1999,
Pubmed
Morinaga,
Mouse aquaporin 10 gene (AQP10) is a pseudogene.
2002,
Pubmed
Nakatani,
Reconstruction of the vertebrate ancestral genome reveals dynamic genome reorganization in early vertebrates.
2007,
Pubmed
,
Echinobase
Nguyen,
Genomicus 2018: karyotype evolutionary trees and on-the-fly synteny computing.
2018,
Pubmed
Pao,
Evolution of the MIP family of integral membrane transport proteins.
1991,
Pubmed
Park,
Phylogenetic characterization of the MIP family of transmembrane channel proteins.
1996,
Pubmed
Pasquier,
Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database.
2016,
Pubmed
Pommerrenig,
Functional evolution of nodulin 26-like intrinsic proteins: from bacterial arsenic detoxification to plant nutrient transport.
2020,
Pubmed
Price,
FastTree 2--approximately maximum-likelihood trees for large alignments.
2010,
Pubmed
Rambaut,
Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.
2018,
Pubmed
Ravi,
Elephant shark (Callorhinchus milii) provides insights into the evolution of Hox gene clusters in gnathostomes.
2009,
Pubmed
Reed,
Osmotic significance of glycerol accumulation in exponentially growing yeasts.
1987,
Pubmed
Reis,
Using Phylogenomic Data to Explore the Effects of Relaxed Clocks and Calibration Strategies on Divergence Time Estimation: Primates as a Test Case.
2018,
Pubmed
Robergs,
Glycerol. Biochemistry, pharmacokinetics and clinical and practical applications.
1998,
Pubmed
Ronquist,
MrBayes 3: Bayesian phylogenetic inference under mixed models.
2003,
Pubmed
Sacerdot,
Chromosome evolution at the origin of the ancestral vertebrate genome.
2018,
Pubmed
Setiamarga,
Interrelationships of Atherinomorpha (medakas, flyingfishes, killifishes, silversides, and their relatives): The first evidence based on whole mitogenome sequences.
2008,
Pubmed
Simakov,
Deeply conserved synteny resolves early events in vertebrate evolution.
2020,
Pubmed
Sodergren,
The genome of the sea urchin Strongylocentrotus purpuratus.
2006,
Pubmed
,
Echinobase
Sonah,
Analysis of aquaporins in Brassicaceae species reveals high-level of conservation and dynamic role against biotic and abiotic stress in canola.
2017,
Pubmed
Stavang,
Phylogenomic and functional analyses of salmon lice aquaporins uncover the molecular diversity of the superfamily in Arthropoda.
2015,
Pubmed
Storz,
Gene duplication, genome duplication, and the functional diversification of vertebrate globins.
2013,
Pubmed
Sun,
Comparative genomic analysis of dha regulon and related genes for anaerobic glycerol metabolism in bacteria.
2003,
Pubmed
Suyama,
PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.
2006,
Pubmed
Sweet,
Glycerol facilitator of Escherichia coli: cloning of glpF and identification of the glpF product.
1990,
Pubmed
Tamás,
Fps1p controls the accumulation and release of the compatible solute glycerol in yeast osmoregulation.
1999,
Pubmed
Tanaka,
Aquaporin10 is a pseudogene in cattle and their relatives.
2015,
Pubmed
Tine,
European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation.
2014,
Pubmed
Tingaud-Sequeira,
The zebrafish genome encodes the largest vertebrate repertoire of functional aquaporins with dual paralogy and substrate specificities similar to mammals.
2010,
Pubmed
Tipsmark,
Aquaporin expression dynamics in osmoregulatory tissues of Atlantic salmon during smoltification and seawater acclimation.
2010,
Pubmed
Toxopeus,
Mechanisms underlying insect freeze tolerance.
2018,
Pubmed
Van de Peer,
The evolutionary significance of ancient genome duplications.
2009,
Pubmed
Van Ekert,
Molecular and functional characterization of Bemisia tabaci aquaporins reveals the water channel diversity of hemipteran insects.
2016,
Pubmed
Verkman,
Water permeability and fluidity of renal basolateral membranes.
1986,
Pubmed
Virkki,
Cloning and functional expression of an MIP (AQP0) homolog from killifish (Fundulus heteroclitus) lens.
2001,
Pubmed
Wegmann,
Effect of temperature on glycerol retention in the halotolerant algae dunaliella and asteromonas.
1980,
Pubmed
Wolf,
Genome reduction as the dominant mode of evolution.
2013,
Pubmed
Xu,
The allotetraploid origin and asymmetrical genome evolution of the common carp Cyprinus carpio.
2019,
Pubmed
Xu,
Genome sequence and genetic diversity of the common carp, Cyprinus carpio.
2014,
Pubmed
Zardoya,
Phylogeny and evolution of the major intrinsic protein family.
2005,
Pubmed
Zardoya,
Origin of plant glycerol transporters by horizontal gene transfer and functional recruitment.
2002,
Pubmed
Zardoya,
A phylogenetic framework for the aquaporin family in eukaryotes.
2001,
Pubmed
Zimmer,
Dating the early evolution of plants: detection and molecular clock analyses of orthologs.
2007,
Pubmed