Research Article

High burden and pervasive positive selection of somatic mutations in normal human skin

See allHide authors and affiliations

Science  22 May 2015:
Vol. 348, Issue 6237, pp. 880-886
DOI: 10.1126/science.aaa6806

Normal skin's curiously abnormal genome

Within every tumor, a battle is being waged. As individual tumor cells acquire new mutations that promote their survival and growth, they clonally expand at the expense of tumor cells that are “less fit.” Martincorena et al. sequenced 234 biopsies of sun-exposed but physiologically normal skin from four individuals (see the Perspective by Brash). They found a surprisingly high burden of mutations, higher than that of many tumors. Many of the mutations known to drive the growth of cutaneous squamous cell carcinomas were already under strong positive selection. More than a quarter of normal skin cells carried a driver mutation, and every square centimeter of skin contained hundreds of competing mutant clones.

Science, this issue p. 880; see also p. 867

Abstract

How somatic mutations accumulate in normal cells is central to understanding cancer development but is poorly understood. We performed ultradeep sequencing of 74 cancer genes in small (0.8 to 4.7 square millimeters) biopsies of normal skin. Across 234 biopsies of sun-exposed eyelid epidermis from four individuals, the burden of somatic mutations averaged two to six mutations per megabase per cell, similar to that seen in many cancers, and exhibited characteristic signatures of exposure to ultraviolet light. Remarkably, multiple cancer genes are under strong positive selection even in physiologically normal skin, including most of the key drivers of cutaneous squamous cell carcinomas. Positively selected mutations were found in 18 to 32% of normal skin cells at a density of ~140 driver mutations per square centimeter. We observed variability in the driver landscape among individuals and variability in the sizes of clonal expansions across genes. Thus, aged sun-exposed skin is a patchwork of thousands of evolving clones with over a quarter of cells carrying cancer-causing mutations while maintaining the physiological functions of epidermis.

The standard narrative of tumor evolution depicts the accumulation of driver mutations in cancer genes, causing waves of expansion of progressively more disordered clones (1, 2). Central to this model is the presumption that randomly distributed somatic mutations must accumulate in normal cells before transformation (3), but directly observing them has proved challenging due to the polyclonal composition of normal tissue. Retrospective reconstructions of clonal evolution from sequencing of tumors give only partial insights, leaving us with fundamental gaps in our understanding of the earliest stages of cancer development. Critical but largely unanswered questions include the burden of somatic mutations in normal cells, which mutational processes are operative in normal tissues, the extent of positive selection among competing clones within a organ, and the patterns of clonal expansion induced by the very first driver mutations (4, 5). These questions have been partially addressed in blood cells, where somatic mutations, including some driver mutations, have been found to accumulate at a low rate with increasing age (610).

To study the burden, mutational processes and clonal architecture of somatic mutations in normal nonhematological tissue, we focused on sun-exposed skin. Previous studies have reported the existence of clonal patches of skin cells carrying TP53 mutations (1115). Motivated by this, we designed a sequencing strategy capable of detecting such clones by performing ultradeep sequencing of small biopsies and adapting algorithms to detect mutations in a small fraction of cells. We used eyelid epidermis because of its relatively high levels of sun exposure and because it is one of the few body sites to have normal skin excised (blepharoplasty). This procedure is performed for age-related loss of elasticity of the underlying dermis, which can cause eyelid drooping sometimes severe enough to occlude vision, although the epidermis remains physiologically and histologically normal. From four individuals undergoing bilateral blepharoplasty, we obtained the resected eyelids, all of which had normal epidermis free of macroscopic lesions. The donors, three female and one male, ranged from 55 to 73 years of age and had variable histories of sun exposure (table S1). Three were of Western European origin and one was of South Asian origin. We separated the underlying dermis and took multiple biopsies of the epidermis from each eyelid (Fig. 1, A and B). In total, 234 biopsies of 0.79 to 4.71 mm2 in area were analyzed. We sequenced the coding exons of 74 genes implicated in skin and other cancers to an average effective coverage of 500× (supplementary methods S1.2 and fig. S7). We also performed whole-genome sequencing to ~147× depth on one biopsy in which a predominant clone was found by the targeted gene screen.

Fig. 1 Burden and spectrum of mutations in normal human skin.

(A) Excised human eyelid viewed from the dermal surface. The inset shows a sample region of epidermis after the dermis has been removed and biopsies taken. (B) Locational map of harvested areas from an eyelid showing locations of 0.79 mm2, 1.57 mm2, and 3.14 mm2 biopsies. (C) Distribution of the variant allele fraction (i.e. the fraction of sequencing reads reporting the mutation of all reads across the locus) for the 3760 mutations found across the 234 samples from four individuals, colored by mutation type. (D and E) Total counts in the coding (untranscribed) versus the noncoding (transcribed) strand for single base substitutions (D) and dinucleotides (E). The counts of C>T (G>A) mutations in a dipyrimidine context are shown in dark purple. P values reflect the transcription strand asymmetry (exact Poisson test). (F) Heat map of the relative rates of each mutation type, depending on the nucleotides upstream and downstream of the mutated base. Rates are normalized for sequence composition of the targeted genes.

Mutational signature of ultraviolet light exposure in normal skin

To identify somatic mutations in the skin biopsies, we adapted an algorithm designed to detect subclonal variants in cancers (16) (supplementary methods S1.3 and figs. S3 and S6), based on building a per-base model of background sequencing errors and identifying loci that have a statistical excess of mismatched base calls. This allowed us to detect mutations present in as few as 1% of the cells of a biopsy, detecting mutant clones ranging from 0.01 mm2 to several square millimeters in size. Overall, we identified 3760 somatic mutations across the 234 biopsies (Fig. 1C and data set S1). Several lines of evidence confirm that the overwhelming majority of these variant calls are genuine somatic mutations (supplementary methods S1.3.2 and figs. S1 and S2).

The pattern of mutations we identified closely matched that expected for ultraviolet (UV) light exposure and that seen in skin cancers (Fig. 1, D to F, and fig. S8). There was a predominance of C>T mutations, especially when the mutated cytosine was preceded by another pyrimidine (namely, a TpC or CpC context), and there were high rates of CC>TT dinucleotide substitutions. This signature is consistent with the known chemistry of sunlight-induced damage to DNA, in which UV rays catalyze the formation of cyclobutane dimers from adjacent pyrimidines (1720). C>T and CC>TT mutations were significantly more frequent on the nontranscribed strand of genes (Fig. 1, D and E), which is consistent with transcription-coupled repair (21).

We also observed enrichment of C>A (G>T) mutations, with no obvious sequence context but a strong bias toward higher rates of C>A mutations on the transcribed strand (Fig. 1D). Assuming that the strand bias results from transcription-coupled repair, this indicates that the damaged base is the guanine in the C:G pairing. This signature is also seen in cutaneous squamous cell carcinoma (cSCC) cancers, particularly in those with a relatively low mutation burden (fig. S8), but less frequently in basal cell carcinomas (BCCs) and melanomas. A significant fraction of mutations seen after in vitro exposure of cells to UV rays are not the canonical transitions at dipyrimidine sites, with C>A transversions being prominent (20). One hypothesis for the pathogenesis of this signature is the oxidation of guanine residues (typically 8-oxoguanine) by reactive oxygen species generated by sunlight (22). 8-oxoguanine is subject to transcription-coupled repair (23), consistent with the strand bias we see.

Pervasive positive selection of somatic mutations in normal skin

In the Darwinian model of cancer evolution, clones with driver mutations in cancer genes have a selective advantage over those without. In genomic data across multiple tumors, this manifests as an enrichment of protein-altering mutations in cancer genes as compared to that expected for the background mutation rate. To explore whether clonal selection is operative in normal skin cells, we adapted a dN/dS model that accounts for the context-dependent mutation spectrum and that estimates the background mutation rate of each gene separately using synonymous mutations (24) (Fig. 2, supplementary methods S1.4, and fig. S6). One major advantage of this approach is that the mutation rate is estimated locally, thus inherently correcting for the variation in mutation rate across the genome, differences in read depth across the genes surveyed, and the mutational spectrum observed in each individual. Genes under positive selection can be identified, and the number of driver mutations can be quantified from the excess of nonsynonymous mutations (24).

Fig. 2 Pervasive positive selection of oncogenic mutations in normal skin.

(A) to (E) Patterns of selection in six genes recurrently mutated in normal skin and in six other genes frequently implicated in skin cancers. (A) Number of mutations per gene classified by their functional impact. (B) dN/dS ratios for genes under significant positive selection (only statistically significant ratios are shown). (C) Estimated number of driver mutations per square centimeter of normal skin. (D) Enrichment of indels and dinucleotides in driver genes (bars show significant observed-to-expected ratios only). (E) Estimated percentage of cells in normal skin carrying mutations in each gene. Lower-bound estimates were obtained assuming the possibility of up to two driver mutations per cell, whereas higher-bound estimates are obtained by allowing only one driver mutation per gene per cell. (F to H) Percentage of cSCC, BCC, and melanoma tumors that carry a nonsynonymous point mutation in each gene. Genes found to be significantly recurrently mutated in each cancer type are shown in black (supplementary results S2.2). (I) Distribution of mutations across five driver genes in normal skin (above the gene diagrams) and in SCCs (below), including 67 cutaneous SCCs and 319 TCGA head and neck cancer exomes. The gene diagrams show the location of encoded protein domains. (J) Differential selection in NOTCH2 across individuals (supplementary methods S1.5).

Remarkably, six genes had a significant excess of protein-altering base substitutions after correcting for multiple-hypotheses testing (Fig. 2), with five of these also showing excess rates of indels and/or dinucleotide subs (Fig. 2D and supplementary methods S1.4). NOTCH1 was the most frequently mutated gene in the cohort and showed the highest observed-to-expected ratios of missense, nonsense, and essential splice site mutations. NOTCH2 and NOTCH3 also carried a significant excess of protein-altering mutations. NOTCH receptors are key regulators of stem cell biology in a number of organs (25) and are a frequent target of inactivating mutations in epithelial cancers (2629) and activating mutations in lymphoid malignancies (30, 31). The distribution of somatic mutations within the NOTCH1 and NOTCH2 genes was not random, with heavy clustering of amino acid substitutions in the extracellular epidermal growth factor–like domains and large numbers of protein-truncating mutations distributed throughout the genes, matching that observed in cutaneous and head and neck SCCs (Fig. 2I). The density of positively selected driver mutations was surprisingly high. From the excess of protein-altering mutations, we estimated the density of cell clones carrying driver mutations to be 57.1 clones/cm2 [confidence interval (CI95%): 51 to 61/cm2] for NOTCH1, 24.6 clones/cm2 (CI95%: 19 to 28/cm2) for NOTCH2, and 1.3 clones/cm2 (CI95%: 0.6 to 1.6/cm2) for NOTCH3. (Fig. 2C and supplementary methods S1.4.2). Thus, on average we found 83 clones carrying positively selected driver mutations in NOTCH genes for every square centimeter of aged, sun-exposed skin.

In SCCs of the skin and other organs, both copies of NOTCH1 are frequently inactivated (28, 29), typically through a point mutation combined with a copy number alteration. We developed an algorithm to identify small populations of cells with copy number alterations across the genes targeted for sequencing by phasing heterozygous single-nucleotide polymorphisms (SNPs) (32) (supplementary methods S1.6 and fig. S4). NOTCH1 was the gene most frequently subject to copy number changes (Fig. 3), with 27 out of 234 biopsies having detectable alterations (Fig. 3B). Only occasional copy number alterations were detected in other genes, although our power to detect these was variable because of differences in the numbers of heterozygous SNPs sequenced. When we estimate the percentage of cells carrying a NOTCH1 copy number change in a biopsy, we find that there is often a NOTCH1 point mutation apparently occurring in the same fraction of cells in the biopsy (Fig. 3C). This overlap, which occurs much more frequently than expected by chance (P < 10−5, supplementary methods S1.6.1), demonstrates that biallelic inactivation of NOTCH1 is already frequent in normal skin cells and not restricted to SCCs.

Fig. 3 Frequent copy number aberrations and biallelic loss of NOTCH1 in normal skin.

(A) Example of four skin samples with subclonal copy number aberrations in NOTCH1 and RB1. Every point represents a heterozygous SNP within the affected gene, and aberrations manifest as allelic imbalances, with a higher fraction of reads (biallelic fraction) supporting one of the alleles of the gene (in red). The extent of the deviation from 0.5 depends on the number of gene copies gained or lost and on the proportion of the biopsy occupied by the subclone (supplementary methods S1.6). (B) Number of copy number aberrations detected per gene. (C) In NOTCH1, a substitution is often found in the same fraction of cells as a deletion of the other allele (dot colocalizing with a horizontal band), showing that the loss of both copies of NOTCH1 is frequent in normal skin cells. Horizontal lines represent the expected variant allele fraction for a mutation inactivating the only remaining allele of a gene in the same clone, with colored shadows representing 95% confidence intervals. Orange and purple dots represent the allele fraction of missense and nonsense mutations in the biopsy, with 95% CIs (supplementary methods S1.6.1 and fig. S5).

FAT1 showed a statistically significant excess of inactivating mutations across all classes, including nonsense and essential splice site substitutions and short indels [false discovery rate (FDR)–adjusted P value (q) = 8 × 10−11, 9 × 10−6, and 2 × 10−4, respectively; Fig. 2, B to D, and supplementary methods S1.4.5]. FAT1 is a cadherin-like protein that suppresses tumor growth by blocking β-catenin signaling and is recurrently mutated in a range of cancers (33), including cutaneous (table S2) and head and neck SCCs (34, 35). Consistent with previous analyses of mutant clones in normal skin (11), we found an estimated 9.5 clones/cm2 carrying a driver mutation in TP53 (4.6 to 11.8/cm2; q = 4 × 10−6). In addition, we saw canonical hotspot mutations in several oncogenes, including KRAS, NRAS, and HRAS.

We found evidence of positive selection in other genes that have not previously been implicated in skin cancer. RBM10, which encodes an RNA-binding protein, is subject to recurrent inactivating mutations in lung adenocarcinoma (36), and we also see an excess of protein-truncating mutations in normal skin (q = 0.009; Fig. 2B). RBM10 is not a known skin cancer gene, although it may conceivably emerge as a rare driver in cSCCs with further sequencing. Additionally, in an analysis for excess mutations at hotspots, FGFR3 showed significant recurrence at two canonical residues (supplementary methods S1.4.3). The same hotspot mutations have been found in ~40% of seborrheic keratoses (37). These skin growths have an incidence that is 15 times higher than that of skin cancers (38), but they never become invasive or malignant. This observation suggests that there may be a class of genes in which somatic mutations give a clonal selective advantage in normal tissue, but do not cause, or could even inhibit, hallmarks of the cancer phenotype such as invasion or dissemination.

We compared the catalog of significantly mutated genes in normal skin to published exome sequencing studies from the three commonest classes of skin cancer, namely cSCCs (28, 39, 40), BCC (41) and melanoma (42). When analyzed using the same statistical methodology, there was little overlap in positively selected genes in normal skin compared to either BCC or melanoma (Fig. 2, G and H; supplementary results S2.2; and tables S2 to S4). In contrast, we found that the pattern in normal skin closely matched that of cSCC, with NOTCH1, NOTCH2, FAT1, and TP53 all being significantly mutated in the latter (Fig. 2F). Point mutations in CDKN2A were not found to be under positive selection in normal skin, despite this gene being a frequent driver in cSCC, inactivated by point mutations or homozygous deletions. Although our design does not allow us to reliably detect homozygous deletions, we found only three CDKN2A point mutations (two missense and one synonymous) across all 234 samples of normal skin, whereas ~31% (CI95%: 14 to 52%) of cSCCs carry nonsynonymous point mutations in the gene. These data suggest that the selective forces acting on physiologically normal skin resemble those in squamous cell carcinomas, with remarkable similarities between the driver mutations in each. However, CDKN2A inactivation appears to be specific to cancer clones, suggesting that its loss confers no selective advantage until more advanced stages of cancer evolution. The absence of mutations characteristic of melanomas is consistent with the fact that around 95% of the cells in the epidermis are keratinocytes (43), whereas melanomas originate from melanocytes. The absence of the PTCH1 mutations seen in BCC is notable, especially given that BCC has a threefold higher incidence than cSCC in populations of European ancestry (44). This may be consistent with BCC originating from cells infrequent or absent in the eyelid epidermis, such as from hair follicles (45), although our data cannot rule out other explanations.

Surprisingly, one of the four individuals in our series contributed a disproportionate number of mutations in NOTCH2 (39% of all mutations in NOTCH2 compared to 24% in other genes). A formal test of heterogeneity confirmed that NOTCH2 showed a variable rate of driver mutations among individuals (q = 0.0005; Fig. 2J; supplementary methods S1.5; and figs. S9 and S10). Because the dN/dS method used inherently accounts for gene-specific coverage and patient-specific mutation spectrum, this finding is likely to reflect a true biological difference among the four individuals rather than a bias arising from some aspect of the experimental design. One conceivable explanation is that some difference in the local eyelid environment provides a stronger pressure for NOTCH2 mutations; another, more likely explanation is that the genetic background of each individual could lead to differences in the strength of selective advantage across genes. The patient with different selection strength for NOTCH2 was of South Asian ancestry, whereas the other three were Western Europeans, although this needs considerably larger sample sizes to address formally. Nonetheless, these data illustrate the exciting potential of such study designs to detect inter-individual differences in the driver landscape that cannot be extracted from sequencing a single established cancer per patient.

Mutant clonal expansions

Together with the mutation rate, the size of the clonal expansions induced by driver mutations in normal tissue is critical to understanding the evolution of cancer, since both factors together determine the size of the pool of cells that can acquire sequential hits (4, 5). With our experimental design, the observed fraction of sequencing reads reporting a mutation correlates accurately with the fraction of cells in a biopsy that carry the mutation, once we correct for the local copy number at that locus, enabling us to estimate clone sizes (32) (supplementary sections S1.7 and S2.6.1). For the majority of mutations identified here, the variant allele fraction was <5% (Fig. 1B), indicating that most mutations were seen in only a small proportion of cells in the biopsy, typically <10%, with many mutations seen in only 1 to 2% of cells. There were exceptions, however, and some biopsies carried somatic mutations found in most of the cells. We find that the distribution of mutant clone sizes in aged, sun-exposed skin has a heavy right tail (Fig. 4A and fig. S11A), with some clones as large as several square millimeters in surface area.

Fig. 4 Mutant clone sizes and clonal dynamics in normal skin.

(A) Distribution of clone sizes of all mutations. (B) Mutation burden per megabase in the normal skin of four individuals and in a range of human cancers (supplementary methods S1.8). (C) Clone sizes of likely driver and passenger mutations in normal skin. Driver mutations are defined as those mutation types found to be under significant positive selection in each gene (Fig. 2). CIs and FDR-adjusted P values (q) values were obtained using 10,000 random permutations of the gene labels assigned to each mutation. (D) Global dN/dS estimates across all 74 genes analyzed in the study in normal skin and cSCC. This allows us to estimate the number of driver mutations per normal cell or per tumor as the number of mutations fixed by positive selection (supplementary methods S1.4.2). (E) Identification of mutations co-occurring in the same subclone, using the pigeonhole principle (32). (F) Subclonal structure of a large clone found to overlap with six biopsies (shown in purple in the eyelid locational map). (G) Schematic representation of the mutant clones in an average 1 cm2 of normal eyelid skin. To generate the figure, a number of biopsies were randomly selected to amount to 1 cm2 of sequenced skin, and all clones observed in these biopsies were represented as circles randomly distributed in space. The density, size, and simulated nesting of clones are all based on the sequencing data obtained in this study.

To estimate the average burden of somatic substitutions per skin cell, we can integrate the estimated fraction of mutant cells across the biopsies from each of the four donors (supplementary methods S1.8). This reveals that the mutation burden estimated from coding sequence is at least two to six somatic mutations/Mb/cell (Fig. 4B). This estimate is at the lower end of the burden of mutations in cSCCs (1 to 380/Mb) and melanomas (0.5 to 200/Mb), and higher than the average mutation burdens seen in many adult solid tumors (Fig. 4B). Using the variant allele fraction, we estimate that 14 to 21% of skin cells carry NOTCH1 mutations, with 5 to 7% having NOTCH2 and 2 to 3% NOTCH3 mutations (Fig. 2E). TP53 mutations and FAT1 mutations are present in 3 to 5% of skin cells, remarkably similar to the estimate of 4% from immunohistochemical studies of TP53 clones in human skin (11). Thus, about a quarter of all skin cells in these biopsies carried NOTCH mutations, the vast majority of which are driver mutations.

In current models of cancer development, driver mutations cause clonal expansions, widening the pool of cells that is susceptible to further driver mutations until enough accumulate to drive transformation and invasion. We compared the clone size of mutations in driver genes against that of synonymous mutations in non-driver genes, which are likely selectively neutral (Fig. 4C). We find that whereas the average clone size for neutral mutations was 0.15 mm2 (CI95%: 0.13 to 0.17), it was significantly larger for driver mutations in NOTCH1 (average 0.23 mm2; q = 0.002), TP53 (0.33 mm2; q = 0.009), and FGFR3 (0.69 mm2; q = 0.0007; permutation test). Clone sizes for FAT1, NOTCH2, and NOTCH3 mutations were not significantly increased. Although some putatively neutral mutations in this data set may be hitchhiking in clones with driver mutations, the difference in clone sizes between driver and neutral mutations is unexpectedly small. The large excess of truncating mutations in these genes demonstrates that clones carrying these mutations must have had a strong selective advantage at some stage. Indeed, lineage tracing in mice has revealed that clones carrying TP53 mutations grow nearly exponentially in UV-exposed epidermis (13). Yet, exponential growth must slow relatively early in the expansion of the clones to explain both the limited range of clone sizes observed here (Fig. 4C) and their similarity across individuals of different ages (Fig. 1C). Such constraints on clonal growth are likely to represent a critical protection against progressive accumulation of driver mutations and cancer. The physiological mechanisms underpinning this are unknown, but “imprisonment” of TP53-mutant clones has been observed in murine epidermis (46), possibly driven by interactions between the clone and surrounding cells and density-dependent growth constraints.

In contrast to the relatively small clone sizes of canonical cSCC driver genes, clones with activating FGFR3 mutations were among the largest observed. It is striking that the driver mutations inducing the largest clonal expansions in normal skin were those associated with benign tumors, namely seborrheic keratosis. This shows that the size of clonal expansion induced by a somatic mutation need not correlate with its potential to induce malignant transformation.

Our data reveal notable similarities between normal and cancer cells, with normal cells carrying thousands of mutations, including oncogenic driver mutations subject to strong positive selection. A major difference between the normal cells sequenced here and cancer cells seems to be the number of driver mutations per cell (Fig. 4D). Using dN/dS, we estimate that normal cells in the skin of these four subjects carry an average of 0.27 (CI95%: 0.19 to 0.35) driver point mutations per cell. Using the same method for cSCCs, we estimate an average of 2.7 (CI95%: 0.91 to 4.65) driver point mutations per tumor in the genes sequenced in this study.

At an average of 0.27 driver mutations per cell, there may be many normal cells with several drivers coexisting. When clones represent a large enough fraction of the biopsy, we can apply deductive reasoning to demonstrate co-occurrence of mutations in the same clone of cells (32). In our data, there were six large clones for which this was possible (Fig. 4E), with three showing two or more likely driver mutations in the same subclone. In one massive clone that spanned six adjacent biopsies, we found all cells carrying a canonical activating mutation in FGFR3 together with a known driver mutation in TP53, and two separate subclonal expansions (Fig. 4F).

To obtain a more comprehensive picture of the mutational landscape of normal cells, we performed whole genome sequencing to 147x depth on a biopsy containing this clone. This identified 73,904 base substitutions and 2248 small indels, with a mutation signature largely dominated by UV light exposure (fig. S12, B and C). About 14,000 of these were clonal (~4.6/Mb), presumably hitchhiking with the FGFR3 and TP53 mutations, but the rest were subclonal, often in <20% cells (fig. S12A). Integrating the allele frequencies, we estimate an average of 21,102 mutations per genome per cell (~7/Mb) in this sample. The mutation rate was found to vary along the genome, with higher rates in lowly expressed genes and in repressed chromatin (fig. S13), as observed in cancer (47) and human evolution (48).

Discussion

We found the frequency of driver mutations in physiologically normal skin cells surprisingly high. For example, there were more NOTCH1 mutations in just 5 cm2 of aged, sun-exposed skin analyzed here than have been identified in more than 5000 cancers sequenced by TCGA (The Cancer Genome Atlas). About 20% of normal skin cells carry driver mutations in NOTCH1, with some but not overwhelming enrichment in the matching cancer (60% of cSCCs have NOTCH1 mutations). Several other cancer genes were under positive selection in normal skin, and we found clones carrying two to three driver mutations that had not acquired malignant potential, raising the question of what combinations of events are sufficient for transformation. These observations may not be entirely unexpected: For cancers to occur with the frequency they do in the general population, there may be a vast underlying reservoir of competing clones part or much of the way to malignant transformation. A rather sobering corollary is that if we had a systemic targeted therapeutic that killed all cells with inactivated NOTCH1, we might successfully treat 60% of cSCCs but with considerable collateral damage to physiologically normal skin.

Studying tumor evolution by sequencing established cancers is akin to inferring the rules of a musical talent quest by identifying similarities across the show’s annual winners. Successful aspirants undoubtedly have common properties that identify necessary criteria for victory, but there is no substitute for directly observing the competition in its raw, early, local heats. Here, we have found hundreds of evolving clones per square centimeter of skin (Fig. 4G); thousands of mutations per skin cell; variability among individuals in the profile of driver mutations; and variability among cancer genes in clonal dynamics. Scaled up across the range of organ systems, cell types, and mutational exposures—and encompassing aging, predisposing diseases, and genetic backgrounds—such studies promise to reveal fundamental insights into the earliest stages of cancer development.

Supplementary Materials

www.sciencemag.org/content/348/6237/880/suppl/DC1

Methods

Supplementary Text

Figs. S1 to S13

Tables S1 to S4

References (4968)

Author Contributions

Data Set S1

References and Notes

  1. Acknowledgments: We thank M. S. Kolodney, R. J. Cho, and M. Dimon for kindly providing their published exome sequencing data on BCC and cSCC. This work was supported by the Wellcome Trust (grant 077012/Z/05/Z) and by a Medical Research Council (MRC) Centennial award (P.H.J.). P.J.C. has a Wellcome Trust Senior Clinical Research Fellowship (WT088340MA). P.H.J. is supported by an MRC grant in aid and Cancer Research UK (CRUK) (program grant C609/A17257). I.M. is supported by fellowships from EMBO (grant 1287-2012) and Queens’ College, Cambridge. A.R. acknowledges support from the CRUK-Cambridge Cancer Centre Clinical Research Fellowship. J.M.C.T. is supported by Marie Curie Fellowship FP7-PEOPLE-2012-IEF (project number 328264), and P.J.C. holds equity in and is a paid consultant for 14M Genomics Ltd. The sequencing data have been deposited in the European Genome-Phenome Archive (accession nos. EGAS00001000860, EGAS00001000515, and EGAS00001000603). The R code used for variant calling (ShearwaterML) is available in Bioconductor (deepSNV package).
View Abstract

Navigate This Article