Pan-Cancer Analyses of the Nuclear Receptor Superfamily

Nuclear receptors (NR) act as an integrated conduit for environmental and hormonal signals to govern genomic responses, which relate to cell fate decisions. We review how their integrated actions with each other, shared co-factors and other transcription factors are disrupted in cancer. Steroid hormone nuclear receptors are oncogenic drivers in breast and prostate cancer and blockade of signaling is a major therapeutic goal. By contrast to blockade of receptors, in other cancers enhanced receptor function is attractive, as illustrated initially with targeting of retinoic acid receptors in leukemia. In the post-genomic era large consortia, such as The Cancer Genome Atlas, have developed a remarkable volume of genomic data with which to examine multiple aspects of nuclear receptor status in a pan-cancer manner. Therefore to extend the review of NR function we have also undertaken bioinformatics analyses of NR expression in over 3000 tumors, spread across six different tumor types (bladder, breast, colon, head and neck, liver and prostate). Specifically, to ask how the NR expression was distorted (altered expression, mutation and CNV) we have applied bootstrapping approaches to simulate data for comparison, and also compared these NR findings to 12 other transcription factor families. Nuclear receptors were uniquely and uniformly downregulated across all six tumor types, more than predicted by chance. These approaches also revealed that each tumor type had a specific NR expression profile but these were most similar between breast and prostate cancer. Some NRs were down-regulated in at least five tumor types (e.g., NR3C2/MR and NR5A2/LRH-1)) whereas others were uniquely down-regulated in one tumor (e.g., NR1B3/RARG). The downregulation was not driven by copy number variation or mutation and epigenetic mechanisms maybe responsible for the altered nuclear receptor expression.


Nuclear Receptor Gene Regulatory Actions are Integrated at Multiple Levels
The 48 human NRs form a major network to sense lipophilic molecules from diet, metabolism and hormone production, and regulate genes involved in development, metabolism, circadian rhythm, immune function, proliferation and differentiation [1][2][3][4][5][6].Reflecting this central role, they represent the target for approximately 15% of all pharmacologic drugs [7].
The NR superfamily can be classified in several ways.The superfamily can either be sub-divided by phylogenetics [8], or grouped according to the cellular location and ligand genomic response [9].In this latter classification, Type I receptors are typically cytoplasmic, associated with heatshock proteins, and ligand binding induces nuclear translocation.These receptors include the high affinity steroid receptors.Type II receptors, by contrast, are retained in the nucleus in the absence of ligand and continuously participate in chromatin modification events that are reversed upon ligand binding.These receptors are typified by NRs that bind micronutrient ligands, for example, NR1B1/RARA and NR1I1/VDR.This group can be further sub-divided to include the Type III receptors that bind as homodimers in the absence of ligand at direct repeats and include NR2A1/HNF4A.Finally, Type IV receptors bind to DNA through only a single binding domain (as a receptor monomer or dimer); this group includes the orphan receptors NR1D1/EAR1 and NR1F1/ROR1.Amongst NRs there are well-established examples of cooperative or antagonistic behavior at the gene regulatory level, for instance resulting in the antagonizing transcriptional effects of ERa and RARs in breast cancer cells [10].Conversely, RXR heterodimerization potentiates the actions of several NRs as illustrated in studies combining 9-cis retinoic acid (RXR ligand) with a range of other ligands which has combinatorial effects on cellular phenotypes [11][12][13][14] which are mediated through underlying regulation of the global transcriptome [15][16][17][18].
The interactions of NRs with coactivators and corepressors has revealed further levels of integration and suggest that gene regulation is dispersed across NRs by virtue of co-factor sharing.Coactivators, such as NCOA3/AIB1, are vital for transactivation by being a platform for the proteins that govern chromatin remodeling and looping, and the sequestration of the basal transcriptional machinery.Similarly, but in an opposite manner, corepressors act to silence or suppress transcription [19][20][21].
Outside of NR interactions with one another and with corepressors and coactivators, it is also clear that their signaling actions are guided by the actions of pioneer factors such as Forkhead box (FOX) family members [22][23][24] and integrated with other transcription factor signaling pathways [25], including WNT [26], p53 [27][28][29][30][31], SMADs [32][33][34] and KLFs [35,36].One elegant approach to capture such interactions was undertaken by Novershtern et al. [37] who measured the transcriptome profiles of a large number of hematopoietic stem cells, multiple progenitor states and terminally differentiated cell types.They found distinct regulatory circuits in both stem cells and differentiated cells, and identified 80 distinct modules of tightly co-expressed genes in the hematopoietic system.For example, one module was expressed in granulocytes and monocytes and included genes encoding enzymes and cytokine receptors that are essential for inflammatory responses.Major players in this module were VDR together with the factors CEBPa and SPI1/PU.1.This suggests that the VDR works together with this small set of transcription factors, in order to regulate granulocyte and monocyte differentiation.It is reasonable to anticipate that such modules exist in multiple cell types but are guided by the tissue specific expression of NRs and other factors.As genomic-based approaches are increasingly applied to individual receptors, and groups of NRs, it is becoming ever more clear how their combined actions are central to co-ordinate complex gene regulatory programs that govern cell fate decisions (reviewed in [38]).

Distorted Nuclear Receptor Function in Cancer
The work of Dr. George Beatson [39] in breast cancer and Dr. Charles Huggins [40] in prostate cancer provided very clear evidence of steroid hormone signaling acting as cancer drivers [41,42].Aside from the well-established therapeutic targeting of AR and ERs there are established roles to target GR as a pro-apoptotic therapeutic approach in lymphoma [43][44][45].However ligand-activation of the GR has been less effective in other cancers and suggests the biology of the GR is more nuanced with regards to cancer biology.More recently in advanced therapy resistant settings in prostate cancer a role has been revealed for the GR to promote progression by essentially phenocopying the actions of the AR and suggests there are overlapping genomic functions of these receptors [46].Subsequently, other therapeutic NR roles emerged in cancer and leukemias, not to antagonize but rather to enhance their function.In this case, potentially as pivotal as the discovery of steroid hormone actions as cancer drivers, was the analyses of RAR functions in leukemia that revealed its actions were disrupted, but could be pharmacologically targeted [47].
This was critical for several reasons.Firstly, all-trans retinoic acid (ATRA)-based leukemia therapy represents one of the earliest and most successful examples of targeted therapies and provided a paradigm for other therapies [48][49][50][51]; although of course clinical trial success with ATRA preceded cloning of the RARs [52,53].Secondly, pioneering work by the groups of Dr. Pier Pelicci [54] and Dr. Ron Evans [55] revealed mechanisms that corrupted RAR signaling in leukemia, specifically in acute promyelocytic leukemia.This leukemia is characterized by translocations of the RARa, including PML-RAR, which generate chimeric receptors that inappropriately retained association with corepressors.This discovery established the premise that altered NR interactions with coactivators and corepressors might have an epigenetic consequence that in turn could be targeted by co-treatment with epigenetic therapies.More widely, the therapeutic success of ATRA in leukemias, in socalled differentiation therapies, was a major catalyst for the explorations of the anticancer actions of other, principally type II, NRs in a wide range of leukemias and solid tumors.In this manner RARs, VDR, PPARs and more recently LXRs and FXR have been considered as potentially druggable targets across cancers [56][57][58][59][60][61][62].With the increasing number of genomic studies it has also emerged that NRs are disrupted in various tumor types; for example DNA CpG methylation of RARB [63][64][65] and copy number variation of NR1D1 [66] and RARA [67].
More recently, many of these earlier findings have been revisited.For example, reflecting the work of Huggins, the role of ER and ER signaling in the prostate have been reinvestigated and distortions to expression of these receptors appears important [68][69][70].Similarly, an appreciation has emerged of the importance of AR signaling in tissues other than the prostate, notably in breast cancer [71,72].

Nuclear Receptor Network Approaches in Cancer Cells
Given their interactive nature, various workers have examined NR networks in cancer.For example, profiling approaches using high throughput Q-PCR in breast cancer [73] and in silico analyses of prostate cancer data bases [74] both revealed a large complement of NR expressed in tumor and that expression profiles relate to tumor stage.Beyond expression profiling, other investigators have aimed to undertake cistromic analyses of multiple NRs and interacting transcription factors to construct a network level understanding of gene expression programs in breast cancer [10,75].These approaches identified high complexity enhancer sites that integrated the actions of multiple NRs and other transcription factors in both direct (cis) and indirect (trans) and often absent of canonical motifs but associated with significant levels of clustering [76][77][78].Cistromic analyses in breast cancer revealed the interactions and cross-talk of multiple NRs and revealed that RAR was amongst the most commonly found NR binding site with approximately 12000 RAR binding sites in MCF-7 breast cancer cells.These sites were significantly enriched with other NRs (e.g., RAR, PPAR, VDR, HNF4, ER), pioneer-type factors (FOXA1, SP1, STAT3) and co-regulators (CTCF).Key aspects of these associations, focused around the RAR and RAR, were related to clinical outcome in breast cancer patients and supported the role of larger networks of NRs to control cell fates.Specifically, the data support the concept of cross-talk between RARs and VDR, which exert mitotic restraint, and other NRs, such as ER, that drive proliferation and survival [10,75].Other workers also identified a comparable number of RAR binding sites, many contained in a so-called Mega-Trans complex containing ER and RAR at important enhancers in breast cancer [79], and specifically identified a significant role for trans RAR genome binding.The importance of RAR to regulate ER, has been supported further by RNAi screens in breast cancer cells aimed at dissecting tamoxifen resistance [80].
It is tempting to speculate that there are perhaps more general rules for these interactions, with specificities of coactivators or corepressors for certain types of receptors.However, there are few ChIP-Seq studies for these coactivators and corepressors and largely they have not been analyzed in an unbiased manner.To address this issue we recently undertook an integrative genomics analyses of the NCOR1 cistrome by exploiting ENCODE data [106].Surprisingly, we found that within the NCOR1 cistrome, NR motifs of any type were not the most commonly enriched, compared to other transcription factors.Of those NR that were enriched, there were both Type 1 (ER) and Type II (PPAR) motifs.This suggests that NCOR1 and NCOR2/SMRT involvement with NR function is either not a major aspect of their function or direct DNA interaction by NR in a cis relationship is limited, and that recruitment may be facilitated by pioneer and other integrating transcription factors.Further integrative and unbiased approaches will be critical in resolving the extent and specificities of coactivator and corepressor interactions in an unbiased manner with NRs and other TFs.

Integrative Analyses of the Nuclear Receptor Network Expression in Cancer
Given the availability of high quality genomic data for multiple tumors it is possible to investigate NR expression and function individually and in gene networks across different tumor and tissue controls [107][108][109][110][111][112][113][114][115].Key to such analyses is the work of The Cancer Genome Atlas (TCGA) [116,117] which provides genome-wide insight in large numbers of multiple tumor types; at the time of writing this 21441 tumors across 91 cancer studies.Previously, we exploited the Taylor et al. cohort of prostate cancer tumors [118] available through TCGA and mined the NR network [74].These analyses revealed that the NR superfamily expression was significantly lost in primary PCa, more than predicted by chance [74].These findings suggested a global distortion to the NR superfamily expression in PCa, and given the diversity of NRs detected, that prostate tissue maintenance relies on its ability to sense and respond to a range of hormonal and dietary lipophilic compounds.For instance, members of the RARs, RXRs, LXRs, and PPARs were deregulated in a substantial proportion of these patients.
In parallel, to identify potential mechanisms driving these disruptions we also mined microRNAs that are predicted to target specific NRs, and revealed a significant and reciprocal gain of expression of NR targeting microRNA that was more than predicted by chance.For example, miR-106b was amongst the most upregulated miRNA in our analysis and targets several down regulated NRs, including PPARA, suggesting that regulation by miRNAs add yet another layer of complexity within the NR network.Together, these observations of reciprocal NR and miRNA co-expression suggest that epigenetic distortion is important to distort the NR superfamily in the prostate [74,[119][120][121].

Pan-Cancer Post-Genomic Assessment of the Nuclear Receptor Superfamily
These findings in the literature, and our own work to date, support the concept that the NR superfamily acts in an integrated manner and is disrupted by multiple mechanisms across cancers.To complement this review we have also collated and analyzed NR expression from multiple TCGA cohorts.We have undertaken a comprehensive analysis of the NR superfamily across multiple cancer types in TCGA with the goal of establishing the tumor-specific and pancancer extent to which NRs are distorted.Specifically, from over 3000 tumors spread across six different cancer types we have examined NR expression, copy number variation and mutation status (Figure 1).Data sets.All analyses, unless otherwise indicated, were undertaken using the R platform for statistical computing (version 3.1.0)[67], and a range of library packages were implemented in Bioconductor [122].All transcription factor (TF) family annotations and their inclusive gene identifiers, including NRs were obtained through the HUGO Gene Nomenclature Committee [123].In the first instance normalized RPKM RNA-seq data from the cohorts were downloaded through the UCSC Cancer Genomics Browser [124] (Figure 1, Table 1, Table 2).Only primary, not metastatic, tumor data was considered, and only NRs that were detectible in at least 80% of normal and tumor samples were included in the expression analyses.
Statistical Analyses.To establish if expression levels of NRs and other transcription factors were different in tumors compared to normal samples, we first established the mean and distribution of expression of all genes in pools of normal samples and then calculated the relative expression of genes in tumor samples using Z scores.In this manner, the tumor expression of all genes, including NRs and other TFs, were converted into normal tissue relative Z-scores and significantly altered tumor expression of detectable genes was determined by considering only values that were elevated (Z ≥ 2) or suppressed (Z ≤ -2).For copy-number analysis, previously determined copy number variation (CNV) estimates (via the GISTIC 2 method) were directly downloaded and utilized.
For mutation analysis, only transcripts containing proteincoding sequences (CDS) were considered.CDS lengths from all exons associated with a given gene were compiled from Ensembl using BioMart [125].To correct for mutation frequencies being proportional to the coding length, all regions exons for a given gene were added (including all alternative exons) to yield the total CDS length for each gene.Mutation frequencies (mutations/protein coding base pair) were then calculated utilizing the number of mutations detected across tumors, the number of tumors, and the CDS lengths for all protein coding genes, including NRs and other indicated TF families.
To test if NR superfamily tumor tissue relative expression Z-scores were significantly different than expected by chance we utilized bootstrapping permutation approaches.For instance in BRCA, we determined that the average NR is upregulated (Z ≥ 2) in 6.72% of tumors, while downregulated (Z ≤ -2) in 26.05% of tumors (Figure 2, Table 3).Specifically, to test whether these observations were more or less than would be predicted by chance, we applied bootstrap approaches.This approach applies a random sampling method to simulate the distribution of expression changes across the transcriptome for comparison to observed findings [126].We sampled 100,000 replicates of random gene sets equal to the size of the number of NRs detectable (all NRs detected in >80% tumors and normal samples) in each respective cancer (e.g., 42 in BRCA), within the detectable transcriptome gene set (e.g.,  = 16, 622 in BRCA), and thus determined the distribution of significant relative expression changes across all genes within the patient cohort allowing us to directly compare the observed alterations of NRs.
Similarly, CNV frequency was determined for all genes across the genome (e.g., percentage of tumors with detectable CNV).For instance in BRCA, we observed that the average NR was amplified to some extent in 24.06% of tumors, while     deleted in 20.90% of tumors (Table 5).Likewise, random sampling ( = 100, 000) was performed to identify the background genomic distribution of copy number alterations in equivalent sized gene sets, to which the NR superfamily observations were directly compared.Lastly, mutation frequency was determined for all protein coding genes across the genome as described above (mutations/protein coding base pair (bp)).Again, using BRCA as an example, the average NR has a mutation frequency of 1.11E-06 mutations/bp, with the most commonly mutated member being ESR1 (mutation frequency = 1.72E-06) (Figure 4, Table 4).Random sampling approaches were similarly applied to query this observation against the background protein coding genome.
In all cases, empirical  -values were calculated based on the position of determined NR observations relative to the sampling distribution of the genome, to simply determine how probable the observations would be considered likely to occur by random chance.In the same manner stated above, observations and statistical testing were determined for 12 other transcription factor families (

Nuclear receptor superfamily expression is distorted
in both common and unique manners across tumors.The majority of the members of the NR superfamily were expressed across the six tumor types.That is, considering NRs that were expressed at a detectable level (RPKM > 0) in >80% of normal and tumor samples revealed that 37 NRs were detected across all six tumor types, and the smallest number expressed was in BLCA which expressed 40 members, whereas the remainder of tissues expressed 42 members.Few tissues shared a common set of NRs, the exception being prostate and breast.Some NRs were undetectable across all tissues, including NR5A1/SF1 and NR0B1/DAX1, while others had distinct patterns of expression including FXR, which was detectible only in colon and liver.
Staying with BRCA as an example, Figure 2 shows the relative expression of the 42 NRs across 1095 tumors compared to a pool of 113 matched normal samples.The bootstrap analyses reveal that collectively, NRs are significantly less overexpressed than predicted by chance, and more underexpressed than predicted by chance.This was not a unique event to the BRCA cohort, but rather was seen across all tumor types examined (Table 3).This collective, pancancer downregulation was unique to NRs and not observed to a significant extent across all cancer types for any of the other 12 TF families examined.The closest were the KLF and SMAD families.For example, KLF members were collectively and significantly underexpressed in breast, colon and liver cancer types.By contrast, the E2Fs displayed the opposite pattern and were collectively overexpressed in at least three of the tumor types.Other TF families, including the FOX and GATA family members, showed little consensus pattern or were not collectively altered in any cancer.These findings would support the concept that NRs, and interacting TFs that control differentiation such as KLFs and SMADs, are significantly downregulated in a wide spectrum of tumors [32][33][34][35][36].
Next, we sought to compare the patterns of NR expression across tumor types (Figure 2C).To achieve this, a relative expression score was calculated by summing the total Z scores across all tumors of a given cancer type and normalizing each by the square root of the number of tumors available for that respective cancer.Reflecting the fact that as a whole the NR superfamily is significantly downregulated, very few individual NRs had a relative expression score that was positive, and none were positive across all tumors.NR2F6/EAR2 was over-expressed in BLCA, BRCA, and PRCA which was also reported previously in bladder cancer cell lines [127].Its expression was also elevated in BRCA and there are several reports of the elevated expression and function of this orphan NR in breast cancer [73] (reviewed in [128]).Also in breast cancer RARA and NR1I3 were elevated.RARA is coamplified with HER2/ERBB2 [67] and has been proposed to stratify a novel subtype of BRCA [129].NR1I3/CAR was also overexpressed and has not previously been described in breast cancer.RARA elevation was also identified in LIHC and there is some evidence to support an oncogenic function for this receptor uniquely in the liver [130].Another NR to be elevated in 3/6 tumors was the orphan receptor NR2C1/TR2, but this appears relatively underexplored in the cancer field (reviewed in [128]).Other examples of NRs showing increased expression in cancers (NR6A1/GCN1, Table 3: Summary of gene expression bootstrapping results comparing the average tumor/normal relative expression (% of tumors displaying overexpression (Z-score ≤ 2) and % of tumors displaying underexpression (Z-score ≤ -2)) of detectable members of 13 TF families, including NRs, across six different tumor types, relative to the background transcriptome.Shown is the observed value for each TF family (TF AVG), including NRs, as well as the mean value of the transcriptome background (TCGA AVG) for each respective cancer, as well as the bootstrapping results comparing the two.Significantly distorted expression patterns are highlighted in yellow (P < 0.05).Note that NRs are less commonly upregulated, and more commonly downregulated, than would be predicted by chance across all six tumor types.RARG, NR1D1/Rev-Erb-alpha, RXRB) are also relatively unexplored.

OVEREXPRESSED
Conversely, there were many examples of individual NR expression being lost in cancer.In several instances, NR expression was lost in only some tumor types, or focally in single cancers, suggesting tissue specific importance.One example is RARG loss in PRAD, which we identified previously [118] and reflects the prostate metaplasia observed in RAR null mice [131].Other examples of specific NR loss that have been previously characterized include VDR loss in COAD, Rev-Erb-alpha loss in BRCA, and NR1H2/LXRB loss in COAD [132][133][134].
However, other NRs were commonly downregulated in all or most cancers examined, suggesting broader tissue importance.These include examples from all types of NRs, from high affinity classical steroid hormone receptors such as PR, MR, GR; Type II NRs including RARB, PPARA; to Type III NRs including NR2F1/COUP-TF1 and NR2C2/TR2; and Type IV receptors such as NR1D2/Rev-erb-beta.In particular there are a group of eight NRs, which are down-regulated in at least five of the six tumor types (GR, MR, PGR, AR, NR1A2/THRB, NR5A2/LRH-1, RARB, NR1F1/ROR-alpha).
Some of these are relatively well described and characterized in cancer, such as RARB as well as RORA as a negative regulator of proliferation and an emerging therapeutic target [135,136].However what is perhaps less obvious is why GR and MR should be so uniformly and strongly reduced in expression.Given the roles for these receptors to contribute to the control of local inflammation, and the importance of inflammation as an early trigger for cancer, it is tempting to Table 4: Summary of copy number variation bootstrapping results comparing the average copy number alterations (% of tumors with amplification, % of tumors with deletion) observed for 13 TF families, including NRs, across six different tumor types, relative to the background genome.Shown is the observed value for each TF family (TF AVG), including NRs, as well as the mean value of the genome background (TCGA AVG) for each respective cancer, as well as the bootstrapping results comparing the two.Significantly distorted CNV patterns are highlighted in yellow (P < 0.05).Note that NRs not commonly amplified or deleted relative to the background genome.speculate that the common loss of these receptors reflects this central role.This surprising finding was also identified in breast cancer by high throughput Q-PCR approaches which revealed that MR, along with THRB and PPARG were altered and significantly predicted metastasis-free survival in patients [73].

Neither copy number variation nor mutation fully explain the changes in NR expression
. Subsequently, we examined how genomic alterations including CNV and mutations associated with the expression changes observed within the NR superfamily.Collectively, CNV changes in the NR superfamily were not significantly different from the genome background in each of the six tumor types considered (Table 4).This was generally true of the other TF families considered with the exception of the SMADs, which were significantly less amplified and more deleted than expected by chance across all the tumor types.This suggests a common mechanism driving downregulation of some transcription factors including SMADs may be result of genomic instability, while others including NRs may be results of additional mechanisms.
However, comparing the results of CNV alterations with gene expression changes within an individual tumor reveals some NR specific associations for individual receptors, as can be seen comparing relative expression scores to relative CNV scores (calculated in a similar fashion to relative expression scores, as described above) in BRCA (Figure 3).From this comparison it appears that whilst there is Table 5: Summary of somatic mutation bootstrapping results comparing the protein coding sequence mutation frequencies (mutations/coding base pair) of members of 13 TF families, including NRs, across six different tumor types relative to the protein coding genome.Shown is the observed mutation frequency for each TF family (TF AVG), including NRs, as well as the mean value of the protein coding genome background (TCGA AVG) for each respective cancer, as well as the bootstrapping results comparing the two.Significantly distorted mutation frequencies are highlighted in yellow (P < 0.05).Note that coding sequences of NRs not commonly mutated relative to the background protein coding genome.Also, the most commonly mutated member of each TF family is listed for each respective cancer, along with its respective mutation frequency and how it relates to the background mutation frequency (mutation ratio).Finally, examination of the mutational status of the NR superfamily did not reveal a significant relationship when considering the number of mutations, after normalizing for the length of the total coding sequence, as compared to the background genome (Figure 4, Table 5).There were not many consistent patterns across cancers concerning TF family mutation frequencies, with the exception of Homeobox TFs which are more commonly mutated than predicted by chance in 3 cancers.Of the focal examples of commonly mutated families in specific cancers such as FOXs in PRAD and GATAs in BRCA, they are largely driven by one or two commonly mutated members, of which there is some precedent in the literature; FOXA1 is commonly mutated in prostate cancer [137], GATA3 is commonly mutated in breast cancer [138], although for others such as ELF3 (ETS family) in BLCA there doesn't appear to be any literature to date.

Conclusions and Future Perspectives
The present review has described how the NR superfamily is integrated through shared genomic binding, shared pathways of genomic signaling, shared cofactor interactions and crosstalk with other TF pathways.Signaling by NRs is central to many cell fate decisions and as a consequence these actions are corrupted in many cancer cell types.Indeed the history of cancer research is intimately interwoven with elucidation of NR function from the earliest studies of steroidal signaling in breast and prostate cancer, to the identification of targeted therapies, to discovery of epigenetic mechanisms that distort gene regulation function and now, in the post-genomic era, to development of integrative genomic workflows that combine genomic, epigenomic and transcriptomic data to develop dynamic maps of NR signaling.We sought to build on the review of NR function by developing a pan-cancer view of NR expression by exploiting the remarkable volume of genomic data developed by TCGA.Transcriptomic and genomic alterations of the NR superfamily across six tumor types were examined and by exploiting bootstrapping approaches we were able to generate robust statistical statements concerning the expression, CNV and mutation status of the NR superfamily, alongside 12 other TF families as comparison.
A clear finding from these approaches was that the detected members of the NR superfamily are more downregulated than predicted by chance, an observation which was uniform across the cancers examined.No other TF family displayed this phenomenon, although KLFs and SMADs mirrored it to a more restricted and limited statistical extent and reflects studies that identified cross-talk between NRs/SMADs/KLFs.Within each tumor however the precise up and down regulated NRs varied, and comparing across cancers revealed the common downregulation of GR, MR, PGR, and THRB, whereas other changes in expression appeared be unique to a specific tumor type; RARG loss in prostate and gain in colon cancer, gain of NR6A1 and RXRB in liver cancer, loss of VDR in colon cancer; the gain in colon and loss in breast cancer of Rev-erb-alpha and the loss in colon of LXRB.
Interestingly, whilst NRs were strongly downregulated, this was only to a small extent, if at all, explained by genomic causes such as CNV or mutation, as opposed to other TF families including SMADs whose expression changes reflected CNV alterations.Therefore, an interesting implication of this observation is the idea that epigenomic, rather than genomic, mechanisms may be the drivers for this phenomenon, and possibly that while NRs are downregulated in cancer, they may remain functional.There are well-established roles for DNA methylation to down-regulate NRs, most notably RARB [63][64][65], and the current findings suggest that targeted DNA methylation may be responsible for suppression of other NRs.Previously, we have considered roles for microRNA to explain NR expression levels and established that certain cohorts of NR targeting miRNA were more up-regulated than predicted by chance [74].Whilst undertaking these studies is statistically more challenging, given the many to many relationships between miRNA and mRNA, it seems reasonable to suggest that networks of miRNA may play a significant role to distort NR network expression and therefore function across cancers.
Interestingly, all tissues examined expressed a broad array of NRs, with BLCA expressing the fewest at 40.This observation coincides with recent undertakings profiling NRs in tissues.For instance, the 42 NRs detected in BRCA correlate well with findings from a recent study examining NR expression in an independent cohort of normal breast tissue and breast tumors of varying stage [73].In this study, 41 NRs were detectable via TaqMan low-density array across breast tissues, with 6 of the 7 undetected NRs also not found to be expressed by our criteria in TCGA samples (NR0B1/DAX1, NR0B2/SHP, HNF4A, NR2E1/TLX, NR1H4/FXR and NR5A1/SF1).The only discrepancy between detectible NRs in breast tissue between the former study and our TCGA analysis was the detectible expression of NR1I3/CAR in TCGA samples, which had amongst the lowest expression of NRs in breast tumors in our analysis.Also in validation of our analysis, this study found a general pan-repression of NRs in breast tumors relative to normal breast tissues, with almost half of detected NRs having significantly lower expression.
The RoadMap Epigenome [113] genomics consortia have underscored the significance of NR enhancer interactions.Specifically, the Roadmap Epigenome investigators developed an algorithm entitled ChromHMM based on Hidden Markov Models.This was applied to the ∼3000 genomic and epigenomic data sets generated from over 100 cell types to identify 15 different chromatin states [113].Within these states, enhancer regions [139] represented ∼3% of the genome, and were typified by genomic location, DNAse sensitivity, and histone modifications (e.g., H3K4me1, H3K27me3 and H3K36me3) [140][141][142][143]. Within the enhancer modules over 1500 transcription factor motifs were examined for enrichment and revealed 84 significant transcription factor-enhancer modules.Ten of these modules were centered on NRs.Therefore NR are over-represented, and the current study has revealed roles for five of these NRs (GR, GCN1, LRH-1, THRB, RARG) as being more altered than predicted by chance across thousands of tumor samples.These different NR-motif relationships were not identified in the same chromatin states and therefore they perhaps represent high priority receptors for ChIP-Seq based studies to define how loss of expression (but not deletion or mutation) alters the distribution within chromatin states and modulates enhancer associations and/or responses.
Furthermore, in parallel, large scale genome-wide association studies (GWAS) of genetic variation has revealed that the vast majority of SNPs are contained in areas of the genome that are outside of gene exons, and therefore do not have the potential to make a direct contribution to protein structure and function [144].It is emerging that many phenotype-and disease-associated SNPs at distal regions impact transcription factor activity that in turn is associated with disease [144,145], and therefore integration of frequently altered NRs, in the most parsimonious cancer phenotypes, may prove to be a powerful approach to reveal how genetic variation impacts NR function.

Figure 1 :
Figure1: Workflow diagram summarizing TCGA analyses.Data were downloaded directly from the UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/),filtered, and global transcriptomic and genomic alterations determined.Observed alterations of NRs and other TF families were directly compared to their genomic background equivalents using a bootstrapping approach.

Figure 2 :Figure 3 :
Figure 2: Nuclear Receptors are downregulated in cancer.(A) Heatmap depicting the relative expression of all 42 NRs (rows) detectable in 1095 BRCA samples (columns) relative to the expression of NRs observed in a pool of 113 matched normal samples.Tumors with relative expression (Z-score) ≥ +/-2 are considered significantly upregulated (red) or downregulated (green), respectively.(B) Bootstrapping results comparing the observed mean % of tumors with significantly disrupted expression for NRs relative to the background transcriptome in BRCA.Note that NRs are significantly upregulated less than is predicted by chance ( = 0.00065) and significantly downregulated more than is predicted by chance ( = 0.00179).(C) Pan-Cancer summary of NR expression patterns.Relative expression scores were determined by summing the Z-scores for a given gene in a given cancer and dividing by the square root of the number of tumors available for that tumor type.

Figure 4 :
Figure 4: Nuclear Receptors are not common targets of somatic mutation in cancer.(A) Heatmap depicting non-synonymous mutations found in protein coding regions for all 48 NRs (rows) in 977 primary BRCA samples (columns).Observed mutations are depicted in blue.(B) Bootstrapping results comparing the observed mutation frequency (mutations/protein coding base pair) for NRs relative to the background protein coding genome in BRCA.Note that NRs are not significantly mutated more or less than is predicted by chance.

Table 2 :
Summary of transcription factor families, including NRs, utilized in TCGA analyses.Gene families and their members were downloaded from the HUGO Gene Nomenclature Committee (http://www.genenames.org).The average size of TF families examined = 48.15,approximately the size of the NR superfamily.

Table 2 )
, to serve as comparison.Results of all comparisons are summarized in
namely NR1I3.Meanwhile for the other six NRs associated with amplified regions there is no significant gain, and in fact there is still significant loss of expression in several cases, suggesting other regulatory events control expression from the amplicon.