Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
BMC Bioinformatics
2012 Jan 01;13 Suppl 17:S26. doi: 10.1186/1471-2105-13-S17-S26.
Show Gene links
Show Anatomy links
Phylogenetic reconstruction in the order Nymphaeales: ITS2 secondary structure analysis and in silico testing of maturase k (matK) as a potential marker for DNA bar coding.
Biswal DK
,
Debnath M
,
Kumar S
,
Tandon P
.
???displayArticle.abstract???
BACKGROUND: The Nymphaeales (waterlilly and relatives) lineage has diverged as the second branch of basal angiosperms and comprises of two families: Cabombaceae and Nymphaceae. The classification of Nymphaeales and phylogeny within the flowering plants are quite intriguing as several systems (Thorne system, Dahlgren system, Cronquist system, Takhtajan system and APG III system (Angiosperm Phylogeny Group III system) have attempted to redefine the Nymphaeales taxonomy. There have been also fossil records consisting especially of seeds, pollen, stems, leaves and flowers as early as the lower Cretaceous. Here we present an in silico study of the order Nymphaeales taking maturaseK (matK) and internal transcribed spacer (ITS2) as biomarkers for phylogeny reconstruction (using character-based methods and Bayesian approach) and identification of motifs for DNA barcoding.
RESULTS: The Maximum Likelihood (ML) and Bayesian approach yielded congruent fully resolved and well-supported trees using a concatenated (ITS2+ matK) supermatrix aligned dataset. The taxon sampling corroborates the monophyly of Cabombaceae. Nuphar emerges as a monophyletic clade in the family Nymphaeaceae while there are slight discrepancies in the monophyletic nature of the genera Nymphaea owing to Victoria-Euryale and Ondinea grouping in the same node of Nymphaeaceae. ITS2 secondary structures alignment corroborate the primary sequence analysis. Hydatellaceae emerged as a sister clade to Nymphaeaceae and had a basal lineage amongst the water lilly clades. Species from Cycas and Ginkgo were taken as outgroups and were rooted in the overall tree topology from various methods.
CONCLUSIONS: MatK genes are fast evolving highly variant regions of plant chloroplast DNA that can serve as potential biomarkers for DNA barcoding and also in generating primers for angiosperms with identification of unique motif regions. We have reported unique genus specific motif regions in the Order Nymphaeles from matK dataset which can be further validated for barcoding and designing of PCR primers. Our analysis using a novel approach of sequence-structure alignment and phylogenetic reconstruction using molecular morphometrics congrue with the current placement of Hydatellaceae within the early-divergent angiosperm order Nymphaeales. The results underscore the fact that more diverse genera, if not fully resolved to be monophyletic, should be represented by all major lineages.
Figure 1. ML topology of Nymphaeales from the aligned concatenated super matrix dataset using PhyML 3.0. Phylogeny reconstruction of the Order Nymphaeales based on concatenated dataset of two different loci (ITS2+ matK) using a taxon set of 51 taxa (including three outgroup taxa, Cycas revoluta, Cycas siamensis and Ginkgo biloba) with aLRT values (best ML tree, majority rule, aLRT values similar to 100 bootstrap replicates)
Figure 2. Bayesian Phylogram (majority rule consensus tree) inferred from the aligned supermatrix dataset (ITS2 + matk). Nymphaeale phylogeny reconstruction using sequence evolution model using GTR with 10 million generations, sample frequency, 1000, burn-in: 10% discarded in MrBayes 3.2. The third family Hydatellaceae represented by the genus Trithuria sps formed a sister basal lineage to Cabombaceae and Nymphaeaceae and clustered with the outgroup (Cycas and Ginkgo).
Figure 3. Median-joining network graphs with uncorrected p distances inferred with Splitstree version 4.10 from the supermatrix (ITS2+ matK).
Figure 5. ITS2 Consensus secondary structures of Nymphaeales with color legend using RNAz and LocaRNA. Validation of conserved ITS2 secondary structures across the three Nymphaeale families (Cabombaceae, Nymphaeaceae and Hydatellaceae). The three families are represented by the genera A. Brasenia, B. Cabomba, C. Euryale, D. Nuphur, E. Nymphaea F. Victoria G. Trithuria. Standard nucleotide ambiguity codes are used.
Figure 6. Profile Neigbour Joining (PNJ) tree from primary sequence- secondary structure alignment of Nymphaeale ITS2 data using 4SALE and ProfDistS. Simple correction Jukes and Cantor formula (Jukes and Cantor, 1969) operated on sequence-structure alignments. Based on the GTR RNA sequence-structure specific substitution model evolutionary distances between sequence-structure pairs are estimated by maximum likelihood and are also extended on the profile level. The group Hydatellaceae clustered with Cabombaceae and emerged as a sister clade to Nymphaeaceae. Consensus bootstrap values with 100 replicates are shown next to branches and ProfDistS output tree file viewed in NJplot. Tree viewing Profiles are marked by "Pi" (profile generated by identity threshold), "Pb" (profiles generated by bootstrap threshold) and "Po" (old profile generated in a previous iteration).
Altschul,
Basic local alignment search tool.
1990, Pubmed
Altschul,
Basic local alignment search tool.
1990,
Pubmed
Bailey,
MEME SUITE: tools for motif discovery and searching.
2009,
Pubmed
Bailey,
Combining evidence using p-values: application to sequence homology searches.
1998,
Pubmed
Bailey,
MEME: discovering and analyzing DNA and protein sequence motifs.
2006,
Pubmed
Benson,
GenBank.
2011,
Pubmed
Chaveerach,
Molecular identification and barcodes for the genus Nymphaea.
2011,
Pubmed
Chen,
Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species.
2010,
Pubmed
Coleman,
ITS2 is a double-edged tool for eukaryote evolutionary comparisons.
2003,
Pubmed
Gatesy,
Alignment-ambiguous nucleotide sites and the exclusion of systematic data.
1993,
Pubmed
Gilbert,
Sequence file format conversion with command-line readseq.
2003,
Pubmed
Gruber,
The RNAz web server: prediction of thermodynamically stable and evolutionarily conserved RNA structures.
2007,
Pubmed
Guindon,
New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0.
2010,
Pubmed
Hollingsworth,
Choosing and using a plant DNA barcode.
2011,
Pubmed
Huson,
Application of phylogenetic networks in evolutionary studies.
2006,
Pubmed
Jonassen,
Efficient discovery of conserved patterns using a pattern graph.
1997,
Pubmed
Kück,
FASconCAT: Convenient handling of data matrices.
2010,
Pubmed
Lahaye,
DNA barcoding the floras of biodiversity hotspots.
2008,
Pubmed
Les,
Molecular evolutionary history of ancient aquatic angiosperms.
1991,
Pubmed
Levinson,
Slipped-strand mispairing: a major mechanism for DNA sequence evolution.
1987,
Pubmed
Li,
rbcL and matK earn two thumbs up as the core DNA barcode for ferns.
2011,
Pubmed
Little,
DNA barcode sequence identification incorporating taxonomic hierarchy and within taxon variability.
2011,
Pubmed
Lorenz,
ViennaRNA Package 2.0.
2011,
Pubmed
Perrière,
WWW-query: an on-line retrieval system for biological sequence banks.
1996,
Pubmed
Posada,
Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests.
2004,
Pubmed
Qiu,
The earliest angiosperms: evidence from mitochondrial, plastid and nuclear genomes.
1999,
Pubmed
Ronquist,
MrBayes 3: Bayesian phylogenetic inference under mixed models.
2003,
Pubmed
Saarela,
Hydatellaceae identified as a new branch near the base of the angiosperm phylogenetic tree.
2007,
Pubmed
Savolainen,
Phylogenetics of flowering plants based on combined analysis of plastid atpB and rbcL gene sequences.
2000,
Pubmed
Seibel,
Synchronous visual analysis and editing of RNA sequence and secondary structure alignments using 4SALE.
2008,
Pubmed
Smith,
Freiburg RNA Tools: a web server integrating INTARNA, EXPARNA and LOCARNA.
2010,
Pubmed
Takezaki,
Phylogenetic test of the molecular clock and linearized trees.
1995,
Pubmed
Tamura,
MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.
2007,
Pubmed
Thompson,
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
1994,
Pubmed
Waddell,
General time-reversible distances with unequal rates across sites: mixing gamma and inverse Gaussian distributions with invariant sites.
1997,
Pubmed
Wolf,
ProfDistS: (profile-) distance based phylogeny on sequence--structure alignments.
2008,
Pubmed