Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Nat Plants
2020 Mar 01;63:215-222. doi: 10.1038/s41477-020-0594-6.
Show Gene links
Show Anatomy links
Prickly waterlily and rigid hornwort genomes shed light on early angiosperm evolution.
Yang Y
,
Sun P
,
Lv L
,
Wang D
,
Ru D
,
Li Y
,
Ma T
,
Zhang L
,
Shen X
,
Meng F
,
Jiao B
,
Shan L
,
Liu M
,
Wang Q
,
Qin Z
,
Xi Z
,
Wang X
,
Davis CC
,
Liu J
.
???displayArticle.abstract???
Angiosperms represent one of the most spectacular terrestrial radiations on the planet1, but their early diversification and phylogenetic relationships remain uncertain2-5. A key reason for this impasse is the paucity of complete genomes representing early-diverging angiosperms. Here, we present high-quality, chromosomal-level genome assemblies of two aquatic species-prickly waterlily (Euryale ferox; Nymphaeales) and the rigid hornwort (Ceratophyllum demersum; Ceratophyllales)-and expand the genomic representation for key sectors of the angiosperm tree of life. We identify multiple independent polyploidization events in each of the five major clades (that is, Nymphaeales, magnoliids, monocots, Ceratophyllales and eudicots). Furthermore, our phylogenomic analyses, which spanned multiple datasets and diverse methods, confirm that Amborella and Nymphaeales are successively sister to all other angiosperms. Furthermore, these genomes help to elucidate relationships among the major subclades within Mesangiospermae, which contain about 350,000 species. In particular, the species-poor lineage Ceratophyllales is supported as sister to eudicots, and monocots and magnoliids are placed as successively sister to Ceratophyllales and eudicots. Finally, our analyses indicate that incomplete lineage sorting may account for the incongruent phylogenetic placement of magnoliids between nuclear and plastid genomes.
Fig. 1. Comparative genomics analyses.a, The genomic features of E. ferox (pseudomolecules size: 721.2 Mb) and C. demersum (pseudomolecules size: 703.8 Mb). From inside to outside: GC content in 500-bp sliding windows (i; minimum–maximum, 0.2–0.8); repeat density in 10-kb sliding windows (ii; minimum–maximum: 0–1.0, coloured from white to dark green); gene density in 100-kb sliding windows (iii; minimum–maximum, 0–30, coloured from white to dark orange); and SNV density in 100-kb sliding windows (iv; minimum–maximum: 0–0.025). The links in the centre connect syntenic gene blocks that were detected using MCscan. Chr, chromosome. b, Distribution of average synonymous substitution levels (Ks) between syntenic blocks after evolutionary rate correction. c, Syntenic blocks (involving ≥10 colinear genes) between genomes. The corresponding median Ks value is shown for each block, and polyploidization events are represented by different colours.
Fig. 2. Phylogenomic analyses of early-diverging angiosperms.a, Chronogram of early-diverging angiosperms on the basis of the dataset SSCG-CDS inferred using MCMCTree. Bootstrap support percentages and posterior probabilities are indicated for each internal branch (from left to right, SSCG-CDS concatenation analysis using maximum likelihood (CA-ML), SSCG-CDS ASTRAL, LCG-CDS STAG, SSCG-Codon12 CA-ML, SSCG-Codon12 ASTRAL, LCG-Codon12 STAG), and an asterisk indicates 100 bootstrap support percentages and 1.0 posterior probabilities in all analyses. i, ii and iii indicate each internal branch. C, Carboniferous; J, Jurassic; K, Cretaceous; N, Neogene; P, Permian; , Triassic; Q, Quaternary; , Palaeogene. b, Species tree analysis using DiscoVista. Rows correspond to focal splits, and the spectrum indicates the support value for splits that are compatible with a species tree. Teal indicates themonophyly of a clade, and the different shades of teal indicate the level of its bootstrap support percentage (0 to 100%). Orange indicates rejection of a clade, and a 95% cut-off (instead of astandard 75%) was selected for strong rejection due to higher support values with genome-scale data. c, Superimposed ultrametric gene trees in a consensus DensiTree plot. The datasets SSCG-CDS and SSCG-Codon12 are shown in red and orange, respectively. d, The frequency of three topologies (q1–q3) around focal internal branches of ASTRAL species trees in the datasets SSCG-CDS and SSCG-Codon12. Each internal branch (labelled i, ii and iii) with four neighbouring branches can lead to three possible topologies (for example, q1, q2 and q3). e, Gene tree compatibility. The portion of gene trees for which focal splits are highly (or weakly) supported (or rejected). Weakly rejected splits are those that are not in the tree but are compatible if low support branches (below 75%) are contracted.
Amborella Genome Project,
The Amborella genome and the evolution of flowering plants.
2013, Pubmed
Amborella Genome Project,
The Amborella genome and the evolution of flowering plants.
2013,
Pubmed
Bouckaert,
DensiTree: making sense of sets of phylogenetic trees.
2010,
Pubmed
Buggs,
The deepening of Darwin's abominable mystery.
2017,
Pubmed
Burton,
Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions.
2013,
Pubmed
Cai,
Widespread ancient whole-genome duplications in Malpighiales coincide with Eocene global climatic upheaval.
2019,
Pubmed
Chaw,
Stout camphor tree genome fills gaps in understanding of flowering plant genome evolution.
2019,
Pubmed
Chen,
Liriodendron genome sheds light on angiosperm phylogeny and species-pair differentiation.
2019,
Pubmed
Cosentino,
SonicParanoid: fast, accurate and easy orthology inference.
2019,
Pubmed
De Smet,
Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants.
2013,
Pubmed
Dilcher,
An Early Cretaceous fruit with affinities to Ceratophyllaceae.
2009,
Pubmed
Endress,
Reconstructing the ancestral angiosperm flower and its initial specializations.
2009,
Pubmed
Enright,
An efficient algorithm for large-scale detection of protein families.
2002,
Pubmed
Estep,
Allopolyploidy, diversification, and the Miocene grassland expansion.
2014,
Pubmed
Friedman,
The meaning of Darwin's 'abominable mystery'.
2009,
Pubmed
Friis,
Diversity in obscurity: fossil flowers and the early history of angiosperms.
2010,
Pubmed
Grabherr,
Full-length transcriptome assembly from RNA-Seq data without a reference genome.
2011,
Pubmed
Haas,
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.
2003,
Pubmed
Hillis,
Is sparse taxon sampling a problem for phylogenetic inference?
2003,
Pubmed
Hoang,
UFBoot2: Improving the Ultrafast Bootstrap Approximation.
2018,
Pubmed
Jarvis,
Whole-genome analyses resolve early branches in the tree of life of modern birds.
2014,
Pubmed
Jiao,
Ancestral polyploidy in seed plants and angiosperms.
2011,
Pubmed
Jones,
InterProScan 5: genome-scale protein function classification.
2014,
Pubmed
Kalyaanamoorthy,
ModelFinder: fast model selection for accurate phylogenetic estimates.
2017,
Pubmed
Katoh,
MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.
2002,
Pubmed
Koren,
Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.
2017,
Pubmed
Li,
Origin of angiosperms and the puzzle of the Jurassic gap.
2019,
Pubmed
Li,
OrthoMCL: identification of ortholog groups for eukaryotic genomes.
2003,
Pubmed
Li,
Inference of human population history from individual whole-genome sequences.
2011,
Pubmed
Liu,
Phybase: an R package for species tree analysis.
2010,
Pubmed
Luo,
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.
2012,
Pubmed
Moore,
Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms.
2007,
Pubmed
Nei,
Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions.
1986,
Pubmed
Nguyen,
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.
2015,
Pubmed
One Thousand Plant Transcriptomes Initiative,
One thousand plant transcriptomes and the phylogenomics of green plants.
2019,
Pubmed
Qiu,
The earliest angiosperms: evidence from mitochondrial, plastid and nuclear genomes.
1999,
Pubmed
Rannala,
Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci.
2003,
Pubmed
Rendón-Anaya,
The avocado genome informs deep angiosperm phylogeny, highlights introgressive hybridization, and reveals pathogen-influenced gene space adaptation.
2019,
Pubmed
Rice,
The Chromosome Counts Database (CCDB) - a community resource of plant chromosome numbers.
2015,
Pubmed
Sanderson,
r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock.
2003,
Pubmed
Sayyari,
DiscoVista: Interpretable visualizations of gene tree discordance.
2018,
Pubmed
Servant,
HiC-Pro: an optimized and flexible pipeline for Hi-C data processing.
2015,
Pubmed
Shi,
Chromosome conformation capture resolved near complete genome assembly of broomcorn millet.
2019,
Pubmed
Soltis,
Nuclear genomes of two magnoliids.
2019,
Pubmed
Solís-Lemus,
PhyloNetworks: A Package for Phylogenetic Networks.
2017,
Pubmed
Stamatakis,
RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.
2014,
Pubmed
Stanke,
Using native and syntenically mapped cDNA alignments to improve de novo gene finding.
2008,
Pubmed
Steemans,
Origin and radiation of the earliest vascular land plants.
2009,
Pubmed
Steinegger,
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.
2017,
Pubmed
Sun,
Deep phylogenetic incongruence in the angiosperm clade Rosidae.
2015,
Pubmed
Suyama,
PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.
2006,
Pubmed
Tang,
Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps.
2008,
Pubmed
Tarailo-Graovac,
Using RepeatMasker to identify repetitive elements in genomic sequences.
2009,
Pubmed
Thomas,
Full-genome evolutionary histories of selfing, splitting, and selection in Caenorhabditis.
2015,
Pubmed
Vekemans,
Gamma paleohexaploidy in the stem lineage of core eudicots: significance for MADS-box gene and species diversification.
2012,
Pubmed
Walker,
Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.
2014,
Pubmed
Wang,
Genome Alignment Spanning Major Poaceae Lineages Reveals Heterogeneous Evolutionary Rates and Alters Inferred Dates for Key Evolutionary Events.
2015,
Pubmed
Wang,
Incomplete lineage sorting rather than hybridization explains the inconsistent phylogeny of the wisent.
2018,
Pubmed
Wang,
Statistical inference of chromosomal homology based on gene colinearity and applications to Arabidopsis and rice.
2006,
Pubmed
Wickett,
Phylotranscriptomic analysis of the origin and early diversification of land plants.
2014,
Pubmed
Yang,
PAML 4: phylogenetic analysis by maximum likelihood.
2007,
Pubmed
Yang,
Orthology inference in nonmodel organisms using transcriptomes and low-coverage genomes: improving accuracy and matrix occupancy for phylogenomics.
2014,
Pubmed
Zeng,
Resolution of deep angiosperm phylogeny using conserved nuclear genes and estimates of early divergence times.
2014,
Pubmed
Zhang,
Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms.
2012,
Pubmed
Zhang,
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.
2018,
Pubmed
Zhou,
Evolutionary genomics of grape (Vitis vinifera ssp. vinifera) domestication.
2017,
Pubmed