Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
Genome-wide patterns of codon bias are shaped by natural selection in the purple sea urchin, Strongylocentrotus purpuratus.
Kober KM
,
Pogson GH
.
???displayArticle.abstract???
Codon usage bias has been documented in a wide diversity of species, but the relative contributions of mutational bias and various forms of natural selection remain unclear. Here, we describe for the first time genome-wide patterns of codon bias at 4623 genes in the purple sea urchin, Strongylocentrotus purpuratus. Preferred codons were identified at 18 amino acids that exclusively used G or C at third positions, which contrasted with the strong AT bias of the genome (overall GC content is 36.9%). The GC content of third positions and coding regions exhibited significant correlations with the magnitude of codon bias. In contrast, the GC content of introns and flanking regions was indistinguishable from the genome-wide background, which suggested a limited contribution of mutational bias to synonymous codon usage. Five distinct clusters of genes were identified that had significantly different synonymous codon usage patterns. A significant correlation was observed between codon bias and mRNA expression supporting translational selection, but this relationship was driven by only one highly biased cluster that represented only 8.6% of all genes. In all five clusters preferred codons were evolutionarily conserved to a similar degree despite differences in their synonymous codon usage distributions and magnitude of codon bias. The third positions of preferred codons in two codon usage groups also paired significantly more often in stems than in loops of mRNA secondary structure predictions, which suggested that codon bias might also affect mRNA stability. Our results suggest that mutational bias has played a minor role in determining codon bias in S. purpuratus and that preferred codon usage may be heterogeneous across different genes and subject to different forms of natural selection.
Figure 1. Distribution of codon bias scores of (A) Nc and (B) Ncâ² in S. purpuratus.
Figure 2. Heatmap of codon usage frequencies in the five codon usage groups (CUG) in (A) Strongylocentrotus purpuratus and (B) Drosophila melanogaster. Codon usage frequencies were centered on the equal usage expectation (e.g., for fourfold degenerate codons, frequencies were centered on 1/4). Centered frequencies clustering was performed with hierarchical cluster analysis using pairwise complete-linkage by Euclidean distance for both columns (CUG) and rows (synonymous codons) using Cluster 3.0 (Eisen et al. 1998). Heatmap plots were generated with Java TreeView (Saldanha 2004). Synonymous codon labels are colored by the base composition of the third position (N3) with N3 of âAâ and âTâ nucleotides, colored pink, and âGâ and âC,â colored green. The asterisk (*) denotes genome-wide preferred codons in S. purpuratus (see Materials and Methods).
Figure 3. Plot of offsets β1 and β2 for a SCUMBLE model with four trends for S. purpuratus genes, colored by groups of genes clustered by codon usage distributions. Offset β1 is correlated most strongly to GC3 in all groups.
Figure 4. Codon bias and GC composition in protein coding gene regions for different codon usage groups. Codon bias is represented as Wrightâs Nc (Wright 1990). Regional GC composition is calculated from (A) GC3 content of exons, (B) GC content of exons, (C) GC content of introns, and (D) GC content of flanking regions. âRpâ denotes annotated ribosomal proteins. The genome-wide mean GC content is 36.9%.
Figure 5. Synonymous codon usage bias and mRNA expression levels for genes in different codon usage groups in (A) S. purpuratus and (B) Drosophila melanogaster. Codon usage bias is Novembreâs Ncâ². Red circles surround annotated ribosomal proteins (âRpâ). The black arrow labeled âUâ points to âubiquitin-like/S30 ribosomal fusion proteinâ (SPU_005280), an example of a highly expressed gene in S. purpuratus. The black arrow labeled âEâ points to Sp-Ets1/2 (SPU_002874), an example of a highly expressed, highly biased gene that does not belong to codon usage group 0.
Addison,
Multiple gene genealogies reveal asymmetrical hybridization and introgression among strongylocentrotid sea urchins.
2009, Pubmed,
Echinobase
Addison,
Multiple gene genealogies reveal asymmetrical hybridization and introgression among strongylocentrotid sea urchins.
2009,
Pubmed
,
Echinobase
Akashi,
Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy.
1994,
Pubmed
Akashi,
Translational selection and molecular evolution.
1998,
Pubmed
Akashi,
Translational selection and yeast proteome evolution.
2003,
Pubmed
Andersson,
Codon preferences in free-living microorganisms.
1990,
Pubmed
Andrew Cameron,
A basal deuterostome genome viewed as a natural experiment.
2007,
Pubmed
,
Echinobase
Bailly-Bechet,
Codon usage domains over bacterial chromosomes.
2006,
Pubmed
Britten,
The single-copy DNA sequence polymorphism of the sea urchin Strongylocentrotus purpuratus.
1978,
Pubmed
,
Echinobase
Bulmer,
Coevolution of codon usage and transfer RNA abundance.
1987,
Pubmed
Bulmer,
The selection-mutation-drift theory of synonymous codon usage.
1991,
Pubmed
Cameron,
SpBase: the sea urchin genome database and web site.
2009,
Pubmed
,
Echinobase
Carlini,
The relationship between third-codon position nucleotide content, codon bias, mRNA secondary structure and gene expression in the drosophilid alcohol dehydrogenase genes Adh and Adhr.
2001,
Pubmed
Castillo-Davis,
Genome evolution and developmental constraint in Caenorhabditis elegans.
2002,
Pubmed
Chamary,
Hearing silence: non-neutral evolution at synonymous sites in mammals.
2006,
Pubmed
Chamary,
Evidence for selection on synonymous mutations affecting stability of mRNA secondary structure in mammals.
2005,
Pubmed
Chen,
Codon usage between genomes is constrained by genome-wide mutational processes.
2004,
Pubmed
Clarke,
Darwinian evolution of proteins.
1970,
Pubmed
Cock,
Biopython: freely available Python tools for computational molecular biology and bioinformatics.
2009,
Pubmed
Comeron,
Population, evolutionary and genomic consequences of interference selection.
2002,
Pubmed
Comeron,
Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila.
1999,
Pubmed
Conesa,
Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.
2005,
Pubmed
Drummond,
Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution.
2008,
Pubmed
DuMont,
DNA variability and divergence at the notch locus in Drosophila melanogaster and D. simulans: a case of accelerated synonymous site divergence.
2004,
Pubmed
Duret,
Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis.
1999,
Pubmed
Duret,
Evolution of synonymous codon usage in metazoans.
2002,
Pubmed
Duret,
tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes.
2000,
Pubmed
Eisen,
Cluster analysis and display of genome-wide expression patterns.
1998,
Pubmed
Eyre-Walker,
Synonymous codon bias is related to gene length in Escherichia coli: selection for translational accuracy?
1996,
Pubmed
Fitch,
The large extent of putative secondary nucleic acid structure in random nucleotide sequences or amino acid derived messenger-RNA.
1974,
Pubmed
Grantham,
Codon catalog usage and the genome hypothesis.
1980,
Pubmed
Graze,
Regulatory divergence in Drosophila melanogaster and D. simulans, a genomewide analysis of allele-specific expression.
2009,
Pubmed
Harris,
The Gene Ontology (GO) database and informatics resource.
2004,
Pubmed
Hasegawa,
Secondary structure of MS2 phage RNA and bias in code word usage.
1979,
Pubmed
Hershberg,
Selection on codon bias.
2008,
Pubmed
Hofacker,
RNA secondary structure analysis using the Vienna RNA package.
2004,
Pubmed
Ikemura,
Codon usage and tRNA content in unicellular and multicellular organisms.
1985,
Pubmed
Ingvarsson,
Gene expression and protein length influence codon usage and rates of sequence evolution in Populus tremula.
2007,
Pubmed
Karolchik,
The UCSC Genome Browser Database: 2008 update.
2008,
Pubmed
King,
Non-Darwinian evolution.
1969,
Pubmed
Klämbt,
A model for messenger RNA sequences maximizing secondary structure due to code degeneracy.
1975,
Pubmed
Kloster,
SCUMBLE: a method for systematic and accurate detection of codon usage bias by maximum likelihood estimation.
2008,
Pubmed
Kober,
Phylogenomics of strongylocentrotid sea urchins.
2013,
Pubmed
,
Echinobase
Kudla,
Coding-sequence determinants of gene expression in Escherichia coli.
2009,
Pubmed
Lowe,
tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.
1997,
Pubmed
MANTEL,
Statistical aspects of the analysis of data from retrospective studies of disease.
1959,
Pubmed
Meyer,
Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs.
2005,
Pubmed
Mita,
Specific codon usage pattern and its implications on the secondary structure of silk fibroin mRNA.
1988,
Pubmed
Moriyama,
Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli.
1998,
Pubmed
Moriyama,
Codon usage bias and tRNA abundance in Drosophila.
1997,
Pubmed
Musto,
Translational selection on codon usage in Xenopus laevis.
2001,
Pubmed
Navon,
The role of codon selection in regulation of translation efficiency deduced from synthetic libraries.
2011,
Pubmed
Novembre,
Accounting for background nucleotide composition when measuring codon usage bias.
2002,
Pubmed
Oliver,
Whole-genome positive selection and habitat-driven evolution in a shallow and a deep-sea urchin.
2010,
Pubmed
,
Echinobase
Palumbi,
MITOCHONDRIAL DNA DIVERSITY IN THE SEA URCHINS STRONGYLOCENTROTUS PURPURATUS AND S. DROEBACHIENSIS.
1990,
Pubmed
,
Echinobase
Pederson,
The sea urchin's siren.
2006,
Pubmed
,
Echinobase
Plotkin,
Synonymous but not the same: the causes and consequences of codon bias.
2011,
Pubmed
Powell,
Rates of DNA evolution in Drosophila depend on function and developmental stage of expression.
1993,
Pubmed
Powell,
Evolution of codon usage bias in Drosophila.
1997,
Pubmed
Qin,
Intragenic spatial patterns of codon usage bias in prokaryotic and eukaryotic genomes.
2004,
Pubmed
Qiu,
Patterns of codon usage bias in Silene latifolia.
2011,
Pubmed
Rao,
Mutation bias is the driving force of codon usage in the Gallus gallus genome.
2011,
Pubmed
Saldanha,
Java Treeview--extensible visualization of microarray data.
2004,
Pubmed
Shabalina,
A periodic pattern of mRNA secondary structure created by the genetic code.
2006,
Pubmed
Sharp,
An evolutionary perspective on synonymous codon usage in unicellular organisms.
1986,
Pubmed
Sharp,
Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes.
1986,
Pubmed
Sharp,
DNA sequence evolution: the sounds of silence.
1995,
Pubmed
Shields,
Synonymous codon usage in Bacillus subtilis reflects both translational selection and mutational biases.
1987,
Pubmed
Shields,
"Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons.
1988,
Pubmed
Singh,
Patterns of mutation and selection at synonymous sites in Drosophila.
2007,
Pubmed
Sodergren,
The genome of the sea urchin Strongylocentrotus purpuratus.
2006,
Pubmed
,
Echinobase
Stajich,
The Bioperl toolkit: Perl modules for the life sciences.
2002,
Pubmed
Stenico,
Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases.
1994,
Pubmed
Stoletzki,
Synonymous codon usage in Escherichia coli: selection for translational accuracy.
2007,
Pubmed
Stoletzki,
The surprising negative correlation of gene length and optimal codon use--disentangling translational selection from GC-biased gene conversion in yeast.
2011,
Pubmed
Strathmann,
THE EVOLUTION AND LOSS OF FEEDING LARVAL STAGES OF MARINE INVERTEBRATES.
1978,
Pubmed
Tuller,
An evolutionarily conserved mechanism for controlling the efficiency of protein translation.
2010,
Pubmed
Vicario,
Developmental stage and level of codon usage bias in Drosophila.
2008,
Pubmed
Wei,
A database of mRNA expression patterns for the sea urchin embryo.
2006,
Pubmed
,
Echinobase
WOOLF,
On estimating the relation between blood group and disease.
1955,
Pubmed
Wright,
The 'effective number of codons' used in a gene.
1990,
Pubmed
Zama,
Codon usage pattern in alpha 2(I) chain domain of chicken type I collagen and its implications for the secondary structure of the mRNA and the synthesis pauses of the collagen.
1990,
Pubmed
Zuckerkandl,
Molecules as documents of evolutionary history.
1965,
Pubmed