Click
here to close Hello! We notice that
you are using Internet Explorer, which is not supported by Echinobase
and may cause the site to display incorrectly. We suggest using a
current version of Chrome,
FireFox,
or Safari.
BMC Genomics
2021 Oct 19;221:751. doi: 10.1186/s12864-021-07936-0.
Show Gene links
Show Anatomy links
Identification and prediction of developmental enhancers in sea urchin embryos.
Arenas-Mena C
,
Miljovska S
,
Rice EJ
,
Gurges J
,
Shashikant T
,
Wang Z
,
Ercan S
,
Danko CG
.
???displayArticle.abstract???
BACKGROUND: The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins.
RESULTS: We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo.
CONCLUSIONS: Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis.
Fig. 1. ATAC-seq, PRO-seq and Pol II ChIP-seq are used for the identification of TREs. A Experimental outlines of the 3 genomic profiles used. B IGV browser snapshot of replicate genomic profiles at the H2A.Z locus, a highly expressed gene [36], left, which also includes a gene expressed at lower levels, right side. Number of 3â² end reads per million of PRO-seq run-on transcripts are shown for the plus and minus strands. PRO-seq peaks mark transcriptional pause sites. MACS peak and dREG TRE predictions for the combined data sets are shown underscoring each genomic profile. The CRM panel underscores a genomic region with enhancer activity tested by deletion in large reporter constructs [25]. PRO-seq and ATAC-seq profiles are set to the same scale between 12 and 20âh stages, with the range indicated between brackets at the beginning of each track
Fig. 2. Genome-wide PRO-, ATAC- and ChIP-seq analysis. A Distribution of signal intensity and reproducibility estimation between distinct biological replicates for the different data sets in 12âh embryos. Overlap of points indicated by the color gradient. B Histograms of the number of reads per peak call for the different data sets in A. C Distribution of signal and reproducibility in 20âh embryos. D Histograms of the number of reads per peak call for the different data sets in C. E Venn diagrams of the overlap between ATAC and Pol II ChIP peak calls, and dREG predicted TREs in 20âh embryos. F Venn diagrams of the overlap of ATAC and dREG peak calls between stages
Fig. 3. ATAC-, Pol II ChIP- and PRO-seq sea urchin embryos at the SpHox11/13b locus. For ATAC- and PRO-seq, the scale in reads per million at the start of each track is maintained at the same range between states and equal between plus and minus strands. The whole region was scanned for enhancer activity by overlapping 3â5 Kb reporter constructs [24], only active CRMs are indicated, in green those active in both stages, and in gray those inactive or with unknown activity in these stages as indicated in the text
Fig. 4. Modeling of CRM reporter activity from of ATAC-, Pol II ChIP- and PRO-seq. A Violin/box-plot of the ATAC, Pol II ChIP peak call and dREG TRE prediction sizes, and the 389 CRMs. The inset plots the size distributions of active and inactive CRMs, which is not significatively different. B and C, ranked CRM expression plot in 12 and 24âh embryos, respectively. The blue line at 1 marks the CRM expression level when it equals that of the basal-promoter reporter. The red line by the curve âelbowâ marks the 2 fold above control chosen as the expression threshold. D Violin/box-plots of PRO-, ATAC-, and Pol II ChIP-seq significatively different signals between active and inactive CRMs in 12 and 20âh embryos. E, top, 12âh embryo Receiver Operating Characteristics (ROC) and, bottom, Precision-Recall Curves (PRC) of the logistic regression models trained and tested by 5 fold cross-validation repeated 200 times. Area Under the ROC (AUROC) and AUPRC as indicated for each model. Dotted lines mark random guess prediction performance, a mid-diagonal for ROC and a horizontal line at the fraction of active CRMs for PRC. The absolute AUPRC indicated in bold and the difference with random guess in parenthesis. F ROCs and PRCs in 20âh embryos. G, top, PRCs evaluating the enhancer activity predictions for the CRM promoter-overlapping data set of models trained with the entire 20âh CRM data set. Bottom, model predictions for the complementary, non-promoter overlapping data set. H Violin/box-plot of the AUPRC after cross-validation with different predictors, as indicated; All, includes the sum and max of the 3 genomic profiles allowing second order interactions among predictors; dREG-max, signifies the sum of the maximum values at dREG peaks
Fig. 5. Quantitative prediction of enhancer activity from PRO-seq data. Plot of the hold-out predicted against the actual reporter expression of linear regression models using ATAC and PRO-seq signal at dREG predictions tested by five-fold cross-validation. Violin/Box-plot of R2 values, with the average indicated underneath
Andersson,
A unified architecture of transcriptional regulatory elements.
2015, Pubmed
Andersson,
A unified architecture of transcriptional regulatory elements.
2015,
Pubmed
Andersson,
Determinants of enhancer and promoter activities of regulatory elements.
2020,
Pubmed
Arenas-Mena,
The origins of developmental gene regulation.
2017,
Pubmed
Arenas-Mena,
Hindgut specification and cell-adhesion functions of Sphox11/13b in the endoderm of the sea urchin embryo.
2006,
Pubmed
,
Echinobase
Barolo,
Three habits of highly effective signaling pathways: principles of transcriptional control by developmental cell signaling.
2002,
Pubmed
Barsi,
cis-Regulatory control of the initial neurogenic pattern of onecut gene expression in the sea urchin embryo.
2016,
Pubmed
,
Echinobase
Boettiger,
Synchronous and stochastic patterns of gene activation in the Drosophila embryo.
2009,
Pubmed
Bozek,
Developmental Transcriptional Enhancers: A Subtle Interplay between Accessibility and Activity: Considering Quantitative Accessibility Changes between Different Regulatory States of an Enhancer Deconvolutes the Complex Relationship between Accessibility and Activity.
2020,
Pubmed
Britten,
Gene regulation for higher cells: a theory.
1969,
Pubmed
Buenrostro,
Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position.
2013,
Pubmed
Chambers,
Mechanisms of Groucho-mediated repression revealed by genome-wide analysis of Groucho binding and activity.
2017,
Pubmed
Chu,
Chromatin run-on and sequencing maps the transcriptional regulatory landscape of glioblastoma multiforme.
2018,
Pubmed
Core,
Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers.
2014,
Pubmed
Cui,
Sequential Response to Multiple Developmental Network Circuits Encoded in an Intronic cis-Regulatory Module of Sea Urchin hox11/13b.
2017,
Pubmed
,
Echinobase
Davidson,
A provisional regulatory gene network for specification of endomesoderm in the sea urchin embryo.
2002,
Pubmed
,
Echinobase
Davidson,
Properties of developmental gene regulatory networks.
2008,
Pubmed
,
Echinobase
De Santa,
A large fraction of extragenic RNA pol II transcription sites overlap enhancers.
2010,
Pubmed
Engreitz,
Local regulation of gene expression by lncRNA promoters, transcription and splicing.
2016,
Pubmed
Ernst,
Mapping and analysis of chromatin state dynamics in nine human cell types.
2011,
Pubmed
Flytzanis,
Persistence and integration of cloned DNA in postembryonic sea urchins.
1985,
Pubmed
,
Echinobase
Gasperini,
Towards a comprehensive catalogue of validated and target-linked human enhancers.
2020,
Pubmed
Haberle,
Eukaryotic core promoters and the functional basis of transcription initiation.
2018,
Pubmed
Haberle,
Promoter architectures and developmental gene regulation.
2016,
Pubmed
Hajdu,
Transcriptional and post-transcriptional regulation of histone variant H2A.Z during sea urchin development.
2016,
Pubmed
,
Echinobase
Harmston,
The mystery of extreme non-coding conservation.
2013,
Pubmed
Jonkers,
Getting up to speed with transcription elongation by RNA polymerase II.
2015,
Pubmed
Klein,
A systematic evaluation of the design and context dependencies of massively parallel reporter assays.
2020,
Pubmed
Kwak,
Precise maps of RNA polymerase reveal how promoters direct initiation and pausing.
2013,
Pubmed
Langmead,
Fast gapped-read alignment with Bowtie 2.
2012,
Pubmed
Lee,
Exclusive developmental functions of gatae cis-regulatory modules in the Strongylocentrorus purpuratus embryo.
2007,
Pubmed
,
Echinobase
Li,
Encoding regulatory state boundaries in the pregastrular oral ectoderm of the sea urchin embryo.
2014,
Pubmed
,
Echinobase
Malik,
Parallel embryonic transcriptional programs evolve under distinct constraints and may enable morphological conservation amidst adaptation.
2017,
Pubmed
,
Echinobase
McMahon,
Introduction of cloned DNA into sea urchin egg cytoplasm: replication and persistence during embryogenesis.
1985,
Pubmed
,
Echinobase
Mikhaylichenko,
The degree of enhancer or promoter activity is reflected by the levels and directionality of eRNA transcription.
2018,
Pubmed
Monti,
Limb-Enhancer Genie: An accessible resource of accurate enhancer predictions in the developing limb.
2017,
Pubmed
Nam,
Functional cis-regulatory genomics for systems biology.
2010,
Pubmed
,
Echinobase
Nam,
Barcoded DNA-tag reporters for multiplex cis-regulatory analysis.
2012,
Pubmed
,
Echinobase
Oliveri,
Global regulatory logic for specification of an embryonic cell lineage.
2008,
Pubmed
,
Echinobase
Peter,
A gene regulatory network controlling the embryonic specification of endoderm.
2011,
Pubmed
,
Echinobase
Poglitsch,
Rapid isolation of nuclei from living immune cells by a single centrifugation through a multifunctional lysis gradient.
2011,
Pubmed
Quinlan,
BEDTools: a flexible suite of utilities for comparing genomic features.
2010,
Pubmed
Quintero-Cadena,
Enhancer Sharing Promotes Neighborhoods of Transcriptional Regulation Across Eukaryotes.
2016,
Pubmed
Ransick,
cis-regulatory processing of Notch signaling input to the sea urchin glial cells missing gene during mesoderm specification.
2006,
Pubmed
,
Echinobase
Revilla-i-Domingo,
A missing link in the sea urchin embryo gene regulatory network: hesC and the double-negative specification of micromeres.
2007,
Pubmed
,
Echinobase
Revilla-i-Domingo,
R11: a cis-regulatory node of the sea urchin embryo gene network that controls early expression of SpDelta in micromeres.
2004,
Pubmed
,
Echinobase
Saito,
Precrec: fast and accurate precision-recall and ROC curve calculations in R.
2017,
Pubmed
Saito,
The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.
2015,
Pubmed
Sethi,
Supervised enhancer prediction with epigenetic pattern recognition and targeted validation.
2020,
Pubmed
Shashikant,
Global analysis of primary mesenchyme cell cis-regulatory modules by chromatin accessibility profiling.
2018,
Pubmed
,
Echinobase
Tippens,
Transcription imparts architecture, function and logic to enhancer units.
2020,
Pubmed
Tu,
Quantitative developmental transcriptomes of the sea urchin Strongylocentrotus purpuratus.
2014,
Pubmed
,
Echinobase
Villar,
Enhancer evolution across 20 mammalian species.
2015,
Pubmed
Wang,
Identification of regulatory elements from nascent transcription using dREG.
2019,
Pubmed
Zabidi,
Enhancer-core-promoter specificity separates developmental and housekeeping gene regulation.
2015,
Pubmed
Zabidi,
Regulatory Enhancer-Core-Promoter Communication via Transcription Factors and Cofactors.
2016,
Pubmed
Zhang,
Model-based analysis of ChIP-Seq (MACS).
2008,
Pubmed
Zhu,
ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data.
2010,
Pubmed