Long Noncoding RNA as Modular Scaffold of Histone Modification Complexes

See allHide authors and affiliations

Science  06 Aug 2010:
Vol. 329, Issue 5992, pp. 689-693
DOI: 10.1126/science.1192002


Long intergenic noncoding RNAs (lincRNAs) regulate chromatin states and epigenetic inheritance. Here, we show that the lincRNA HOTAIR serves as a scaffold for at least two distinct histone modification complexes. A 5′ domain of HOTAIR binds polycomb repressive complex 2 (PRC2), whereas a 3′ domain of HOTAIR binds the LSD1/CoREST/REST complex. The ability to tether two distinct complexes enables RNA-mediated assembly of PRC2 and LSD1 and coordinates targeting of PRC2 and LSD1 to chromatin for coupled histone H3 lysine 27 methylation and lysine 4 demethylation. Our results suggest that lincRNAs may serve as scaffolds by providing binding surfaces to assemble select histone modification enzymes, thereby specifying the pattern of histone modifications on target genes.

Long intergenic noncoding RNAs (lincRNAs) regulate dosage compensation, imprinting, and developmental gene expression by establishing chromatin domains in an allele- and cell-type specific manner (1, 2). LincRNAs are intimately associated with chromatin-remodeling complexes (37), but molecular mechanisms of their functions are still lacking. Posttranslational modifications of histones recruit DNA-binding proteins and chromatin-remodeling machinery and are often coupled for combinatorial control (8). For instance, in embryonic stem cells many genes, such as the HOX, that encode developmental regulators are transcriptionally silent but possess bivalent histone H3 lysine 4 (H3K4) and lysine 27 (H3K27) methylation, which are resolved into univalent H3K4 or H3K27 methylation domains upon differentiation (9, 10). Here, we show that a lincRNA can coordinate histone modifications by binding to multiple histone modification enzymes.

The lincRNA HOTAIR is transcribed from the HOXC locus and targets polycomb repressive complex 2 (PRC2, which comprises H3K27 methylase EZH2, SUZ12, and EED) to silence HOXD and select genes on other chromosomes (7, 11). The genomic regions flanking HOXD are also bound by CoREST/REST repressor complexes (12), which contain LSD1 (KDM1/BHC110), a demethylase that mediates enzymatic demethylation of H3K4me2 (13) and that is required for proper repression of Hox genes in Drosophila (14). We therefore hypothesized that HOTAIR may coordinately interact with both PRC2 and LSD1. Immunoprecipitation (IP) of either endogenous LSD1 or FLAG-tagged LSD1 from primary foreskin fibroblasts or HeLa cells specifically retrieved endogenous HOTAIR RNA with enrichment comparable with that of EZH2 IP, the positive control (Fig. 1A and fig. S1A) (15). IP of three other chromatin proteins did not retrieve HOTAIR (fig. S1A), and neither LSD1, EZH2, nor FLAG-LSD1 IP retrieved U1 RNA, a nuclear ncRNA that served as a negative control. Purified biotinylated HOTAIR RNA, but not green fluorescent protein (GFP) RNA or an antisense HOTAIR fragment, specifically retrieved EZH2, SUZ12, and LSD1 from HeLa cell nuclear extract (Fig. 1B and fig. S1B). LSD1 forms a complex with CoREST (16), which can bridge LSD1 to the neuronal gene silencer REST (17). REST is believed to mediate silencing through two distinct effector arms: one via LSD1-CoREST, and separately via the adaptor protein CDYL and the H3K9 KMT G9a (18). HOTAIR specifically bound to CoREST and REST but not CDYL or G9a, nor to the putative PRC1 subunit YY1 (Fig. 1B). Further, biotinylated HOTAIR bound to purified PRC2 and LSD1 complexes in vitro (Fig. 1C and fig. S1C). These results suggest that HOTAIR directly interacts with PRC2 and LSD1 complexes.

Fig. 1

5′ domain of HOTAIR binds PRC2, and 3′ domain of HOTAIR binds LSD1. (A) LSD1 IP specifically retrieves HOTAIR RNA. Data (mean ± SD, n = 3 replicates) is relative to mock-IP (immunoglobulin G or FLAG). ND, not detectable. (B) In vitro transcribed (IVT) biotinylated HOTAIR retrieves EZH2, LSD1, CoREST, and REST but not G9a, CDYL, or YY1. (C) IVT biotinylated HOTAIR binds to purified PRC2 and LSD1 complexes. PRC2-3m, recombinant purified core PRC2 complex with three members (EZH2, SUZ12, EED); PRC2-5m, recombinant purified PRC2 complex with five members (+RbAP48, AEBP2);, tandem affinity purified protein complex associated with FLAG-HA-LSD1 from HeLa cells. Composition of protein complexes are shown in fig. S1C. (D) The first 300 bp (lined boxes) of HOTAIR is necessary and sufficient to bind PRC2; the last 646 bp (meshed boxes) is necessary and sufficient to bind LSD1 complex. The profiles were established by means of RNA pull-down of HeLa extract; retrieved proteins were detected through immunoblotting.

Using a series of HOTAIR deletion mutants, the PRC2-binding activity mapped to nucleotides 1 to 300 of HOTAIR, whereas the LSD1 complex–binding activity mapped to nucleotides 1500 to 2146 (Fig. 1D). Deletion mutants that retained nucleotides 1 to 300 bound EZH2 or SUZ12 with equal efficiency as full-length HOTAIR, and deletion mutants that retained nucleotides 1500 to 2146 retained LSD1-binding activity. Thus, HOTAIR is a modular bifunctional RNA that has distinct binding domains for PRC2 and LSD1 complexes. Computational analysis and RNA footprinting showed that the PRC2- and LSD1-binding domains of HOTAIR are likely to possess extensive but distinct secondary structures (fig. S2).

The presence of independent binding sites for PRC2 and LSD1 on HOTAIR suggests that HOTAIR may bridge PRC2 and LSD1 complexes. EZH2 IP retrieved LSD1, and conversely LSD1 IP retrieved EZH2 from foreskin fibroblasts (Fig. 2A). We estimate that less than 5% of the two complexes physically interact with each other, which is consistent with prior purification results that isolated PRC2 and CoREST-LSD1 as separate complexes (19, 20). RNA interference (RNAi) of HOTAIR or ribonuclease treatment of the IP abrogated the interaction between EZH2 and LSD1, suggesting that HOTAIR is required to bridge this interaction (Fig. 2A and fig. S3). Wild-type HeLa cells or HeLa cells stably expressing FLAG-LSD1 (FL-HeLa) expressed ~10-fold less HOTAIR than foreskin fibroblasts and showed undetectable endogenous interaction between PRC2 and LSD1. Enforced expression of HOTAIR in FL-HeLa cells to a level comparable with foreskin fibroblasts allowed robust interaction between PRC2 and LSD1 (Fig. 2B). Gel filtration chromatography confirmed that HOTAIR expression shifts PRC2 subunits into a higher molecular weight complex coincident with the LSD1 complex, suggesting the formation of a higher ordered complex composed of HOTAIR, PRC2, and LSD1 complexes in HOTAIR-overexpressing cells (fig. S4). Moreover, expression of each HOTAIR mutant that lacked the ability to bind either PRC2 or LSD1 in vitro failed to induce PRC2-LSD1 interaction in cells (Fig. 2C and fig. S3C).

Fig. 2

HOTAIR is necessary and sufficient for interaction between EZH2 and LSD1. (A) In foreskin fibroblasts, EZH2 interacts with LSD1 (lanes 1 and 4). Knockdown of HOTAIR (lanes 3 and 6), but not GFP (lanes 2 and 5), abolishes this interaction. HOTAIR levels (mean ± SD) are shown on the right. (B) HOTAIR expression in FLAG-LSD1 HeLa cells induces EZH2 and LSD1 interaction (lanes 3 and 6). (C) Full-length HOTAIR induces EZH2 and LSD1 interaction (lanes 3 and 10) but not HOTAIR mutants lacking either 5′ or 3′ domain (lanes 4 to 7 and 11 to 14). Presence of indicated RNA domains was confirmed by means of RT-PCR (bottom).

HOTAIR-mediated bridging of PRC2 and LSD1 complexes also enables their coordinate binding to target genes on chromatin. HOTAIR is required for H3K27 methylation and transcriptional silencing across the HOXD locus (7). Therefore, we mapped PRC2 (as indicated by SUZ12) and LSD1 occupancy across the HOX loci and on promoters genome-wide by means of chromatin IP followed by microarray analysis (ChIP-chip) in primary foreskin fibroblasts after control RNAi or HOTAIR knockdown (Fig. 3 and figs. S5 to S7). HOTAIR knockdown decreased SUZ12 and LSD1 occupancy in a similar pattern across HOXD [Pearson’s correlation coefficient (R) = 0.59, P < 10−9, Student’s t test] (Fig. 3, A and B, and fig. S6). Coordinate loss of SUZ12 and LSD1 occupancy caused by HOTAIR knockdown were concentrated in proximal promoters of HOXD genes (Fig. 3B). These regions correspondingly lost H3K27me3 and gained H3K4me2, the respective histone methylation products of PRC2 and LSD1 complexes (R = 0.40, P < 10−9, Student’s t test) (Fig. 3, A and C, and figs. S6 and S7). The loss of H3K27me3 occurred across broad domains encompassing multiple HOXD genes and intergenic regions, whereas the gain of H3K4me2 was concentrated near the transcriptional start sites of HOXD genes (8). Multiple independent small interfering RNAs targeting HOTAIR gave the same results.

Fig. 3

HOTAIR coordinates localization of PRC2 and LSD1 genome-wide. (A) Changes in mRNA and occupancy of H3K4me2, H3K27me3, LSD1, and SUZ12 across HOXD locus after RNAi of HOTAIR in foreskin fibroblasts. Yellow boxes indicate regions of notable correlation between gain of H3K4me2 and concordant loss of LSD1, H3K27me3, and SUZ12. (B) The patterns of change in LSD1 (x axis) and SUZ12 occupancy (y axis) upon HOTAIR knockdown across the HOXD locus are significantly correlated (Pearson correlation, R = 0.59, P < 10−9, Student’s t test). This correlation is concentrated in proximal promoters of HOXD genes (R = 0.86). (C) Positive correlation of changes in SUZ12 (x axis) and H3K27me3 occupancy (y axis) and negative correlation of LSD1 (x axis) and H3K4me2 occupancy (y axis). (D) Venn diagram shows the genes occupied by SUZ12 (4740 genes), LSD1 (2116 genes), or both (721 genes). (E) Heat map of SUZ12 and LSD1 co-occupied genes (721 genes). Each column is an experiment; each row is a gene. HOTAIR knockdown led to concordant loss of SUZ12 and LSD1 occupancy. Chromatin occupancy is indicated in blue per the scale bar. (F) HOTAIR knockdown leads to transcription derepression of target genes. Mean ± SD of quantitative RT-PCR data are shown.

Examining human promoters genome-wide, ChIP-chip analysis showed that PRC2 and LSD1 occupied 4740 and 2116 gene promoters, respectively (Fig. 3D). Nearly one third of LSD1-occupied promoters, comprising 721 genes, were also occupied by SUZ12, revealing a significant overlap (257 overlap expected by chance alone; P = 3.4 × 10−164, hypergeometric distribution). Among these 721 genes co-occupied by SUZ12 and LSD1, the distances between the binding sites of SUZ12 and LSD1 were predominantly less than 500 base pairs (bp), which is the fragmentation size of chromatin in our ChIP assay and the limit of resolution (fig. S8A).

HOTAIR knockdown led to concordant loss of SUZ12 and LSD1 occupancy in 289 of the 721 genes normally co-occupied by SUZ12 and LSD1 (almost 40%) (Fig. 3E and table S1). Additional genes showed more exclusive loss of LSD1 occupancy (33%) or SUZ12 occupancy (16%), suggesting that HOTAIR may be involved in other LSD1- or SUZ12-dependent pathways. ChIP followed by quantitative polymerase chain reaction (PCR) confirmed the requirement of HOTAIR for PRC2 and LSD1 localization for all six genes tested (fig. S8C). HOTAIR knockdown did not change the chromatin occupancy by PRC2 and LSD1 at hundreds of other genes, nor did it affect the protein or mRNA level of the subunits of PRC2 or LSD1 complexes (Fig. 2A and fig. S9, A to C). The functional consequence of coordinate targeting of PRC2 and LSD1 by HOTAIR is gene repression: Genes co-occupied by SUZ12-LSD1 in a HOTAIR-dependent manner are also significantly induced upon HOTAIR knockdown as measured with microarray or quantitative reverse transcription PCR (RT-PCR) [P < 0.05, Gene Set Enrichment Analysis (21)] (Fig. 3F and fig. S8D). These results suggest that a single lincRNA—HOTAIR—may be required to target both PRC2 and LSD1 to hundreds of genes across the genome in order to coordinate histone modifications for gene silencing.

Both PRC2 and LSD1 can bind multiple proteins that are thought to provide DNA target specificity (17, 22). A possible consequence of the HOTAIR-mediated bridging is that PRC2 may be recruited to LSD1-CoREST-REST–binding sites, and conversely LSD1 may be recruited to PRC2-binding sites. Previous genome-scale mapping studies of PRC2 already identified the REST motif as one of the most enriched DNA sequence motifs within PRC2-binding sites but with no mechanistic explanation (23). We searched for enriched sequence motifs in SUZ12-binding sites lost upon HOTAIR knockdown (“HOT-S sites”) and identified several enriched motifs (24), including a motif that corresponds to the right half of the canonical REST motif (P = 1.05 × 10−12) (Fig. 4A and fig. S10). REST is able to bind only one half-site of the canonical REST motif (25), and genes containing HOT-S sites are enriched for experimentally measured REST occupancy (P < 1.27 × 10−16, hypergeometric distribution) (fig. S9D and table S2) (25). The most significantly enriched motif in LSD1-binding sites that are lost upon HOTAIR knockdown (“HOT-L sites”) is a CG-rich motif (P = 3.66 × 10−10) (Fig. 4B and fig. S10), which is important for PRC2 binding (23, 26, 27). Thus, the enrichment of the CG-rich motif may reflect the HOTAIR-dependent recruitment of LSD1 complexes to PRC2-bound sites, which are often in CpG islands. We examined the gain of SUZ12 and LSD1 occupancy on chromatin when HOTAIR is overexpressed in primary lung fibroblasts, which do not express endogenous HOTAIR. HOTAIR overexpression caused ectopic occupancy of LSD1 and SUZ12 that significantly overlapped (P < 7.31 × 10−95). Further, motif analysis of the ectopically gained binding sites recovered an almost identical CG-rich motif (P = 7.9 × 10−37) (Fig. 4C), suggesting that this motif is involved in HOTAIR target selection. Nonetheless, the REST half-site and the CG-rich motif are currently not sufficient for de novo prediction of all HOTAIR-dependent genes, suggesting that additional motifs, binding partners, and/or motif arrangements may be important.

Fig. 4

HOTAIR-dependent SUZ12- and LSD1-binding motifs. (A) SUZ12 occupancy sites lost upon HOTAIR knockdown (HOT-S sites) are enriched for a DNA motif very similar to the right half of canonical REST motif. (B) LSD1 occupancy sites lost upon HOTAIR knockdown (HOT-L sites) are enriched for a CG-rich motif. (C) A nearly identical CG-rich motif is enriched in LSD1/SUZ12–binding sites gained upon HOTAIR overexpression, suggesting that this motif is involved in HOTAIR target selection.

In this report, we demonstrate that the lincRNA HOTAIR can link a histone methylase and a demethylase by acting as a modular scaffold (fig. S11). Other lincRNAs may also contain multiple binding sites for distinct protein complexes that direct specific combinations of histone modifications on target gene chromatin. Some lincRNAs may be “tethers” that recruit several chromatin modifications to their sites of synthesis (2) while other lincRNAs can act on distantly located genes as “guides” to affect their chromatin states (2). On the basis of their dynamic patterns of expression (28), specific lincRNAs can potentially direct complex patterns of chromatin states at specific genes in a spatially and temporally organized manner during development and disease states.

Supporting Online Material

Materials and Methods

Figs. S1 to S11

Tables S1 and S2

References and Notes

References and Notes

  1. Materials and methods are available as supporting material on Science Online.
  2. Microarray data are deposited in Gene Expression Omnibus ( under accession number GSE22345. We thank members of the D. Herschlag lab for assistance with RNA footprinting and X. Tan, P. Khavari, and J. Wysocka for critical reading of the manuscript. This work was supported by the California Institute for Regenerative Medicine (RN1-00529-1 to H.Y.C.), NIH (R01-HG004361 to H.Y.C. and E.S and R01-CA118487 to Y.S), the Susan G. Komen Foundation (M.-C.T.), the Azrieli Foundation (O.M.), NSF (J.K.W.), and the Agency for Science, Technology, and Research (Y.W.). E.S. is the incumbent of the Soretta and Henry Shapiro career development chair. Y.S. is co-founder and on the scientific advisory board of Constellation Pharmaceuticals. H.Y.C. is an Early Career Scientist of the Howard Hughes Medical Institute.
View Abstract

Stay Connected to Science

Navigate This Article