Extreme Diversity, Conservation, and Convergence of Spider Silk Fibroin Sequences

See allHide authors and affiliations

Science  30 Mar 2001:
Vol. 291, Issue 5513, pp. 2603-2605
DOI: 10.1126/science.1057561


Spiders (Araneae) spin high-performance silks from liquid fibroin proteins. Fibroin sequences from basal spider lineages reveal mosaics of amino acid motifs that differ radically from previously described spider silk sequences. The silk fibers of Araneae are constructed from many protein designs. Yet, the repetitive sequences of fibroins from orb-weaving spiders have been maintained, presumably by stabilizing selection, over 125 million years of evolutionary history. The retention of these conserved motifs since the Mesozoic and their convergent evolution in other structural superproteins imply that these sequences are central to understanding the exceptional mechanical properties of orb weaver silks.

There are over 34,000 described species of Araneae (1). Each species uses silk, and some ecribellate orb weavers (Araneoidea) have a varied tool kit of task-specific silks with divergent mechanical properties (2). Araneoid major ampullate silk, the primary dragline, is extremely tough. Minor ampullate silk, used in web construction, has high tensile strength. An orb web's capture spiral, in part composed of flagelliform silk, is stretchy and can triple in length before breaking (3). Each of these fibers is composed of one or more proteins encoded by the spider silk fibroin gene family (4). Previously sequenced araneoid fibroins are dominated by iterations of four simple amino acid motifs: polyalanine (An) (n, iterations of amino acid or motif), alternating glycine and alanine (GA), GGX (where X represents a small subset of amino acids), and GPG(X)n (P, proline) (5). However, orb weavers represent only a fraction of spider diversity (1). The silks of non-araneoid spiders have not been characterized at the DNA sequence level (Fig. 1).

Figure 1

Hypothesized phylogenetic relationships of Araneae based on morphological evidence (1, 28). Previously published spider fibroin sequences are from the two genera marked by white circles. Including data presented here, fibroin sequences have been characterized for the taxa in red. Circles at internal nodes mark fossil calibration points. Extinct taxa that calibrate these nodes,Macryphantes (black circle) (16) andRosamygale (gray circle) (13), are indicated, and higher level taxa are to the right of the brackets (Ma, millions of years ago). Dolomedes, Plectreurys, andEuagrus are from the families Pisauridae, Plectreuridae, and Dipluridae, respectively.

Spiders draw fibers from dissolved fibroin proteins that are stored in specialized sets of abdominal glands. Recently, it has been suggested that this spinning mechanism may impact the generation of high-performance silk fibers more than the sequences of the fibroins themselves (2, 6). If this was so, amino acid sequences of different silk proteins should not have been conserved over long evolutionary intervals. To this point, comparative analyses of spider fibroins have been limited, so the evolutionary stability of these molecules is unclear.

DNA sequences have been described for only two genera of orb weavers (Fig. 1). To expand this database, we made seven cDNA libraries of silk glands from five spider genera. cDNA data were supplemented by information from two genomic libraries and polymerase chain reaction (PCR)–amplified sequences (7). We obtained partial cDNA or gene sequences for 23 fibroins from six families of Araneae. These data greatly extend the phylogenetic diversity of characterized fibroins (Fig. 1).

Like previously published fibroin sequences from spiders (4, 5, 8, 9) and lepidopterans (10, 11), our sequences encode repetitive alanine- and glycine-rich proteins. In each molecule, iterated amino acid motifs are organized into higher order ensemble repeats. Ensemble repeats within each fibroin were aligned, and a consensus ensemble repeat was generated for each molecule (12).

In part, silk DNA sequences from non-araneoid spiders (Fig. 2) reiterate the importance of amino acid motifs that compose orb weaver fibroins. GA, GGX, and An form the consensus ensemble repeat units of silk fibroins from the pisaurid fishing spider, Dolomedes. The association of these three motifs in Dolomedes silk proteins mirrors the pattern seen in major and minor ampullate fibroins of orb weavers (Fig. 3). GA, GGX, and An motifs also are distributed, sometimes sparsely, among ensemble repeat units from successively more basal lineages of spiders (Haplogynae and Mygalomorphae). An is represented in each of the fibroins from these taxa and from all lineages of Araneae studied thus far (Figs. 2and 3). Mygalomorphae, tarantulas and close relatives, diverged from Araneomorphae, “true” spiders, at least 240 million years ago in the Middle Triassic (13) (Fig. 1); thus, An motifs may have been maintained in different spider silks since that time.

Figure 2

Consensus ensemble repeat units for non-araneoid spider fibroins. Single-letter symbols for amino acids are used (29), and GGX, GA, and An motifs are indicated in green, yellow, and red, respectively. Plectreurys cDNA1 andPlectreurys cDNA2 were derived from the larger ampule-shaped glands of Plectreurys, andPlectreurys cDNA3 and Plectreurys cDNA4 were from the smaller ampullate glands of this spider.

Figure 3

Consensus ensemble repeat units for four araneoid fibroin ortholog groups. Single-letter symbols for amino acids are used (29), and GGX, GA, An, and GPG(X)n motifs are indicated in green, yellow, red, and blue, respectively. Dashes indicate gaps inserted into the alignments of ensemble repeat units for MaSp1, MaSp2, and MiSp. The “[spacer]” region of the MiSp fibroins is a serine-rich sequence that is 137 amino acids long in Nephila clavipes (GenBank accession number AF027735). Abbreviations are as follows: Nep.c., Nephila clavipes;Nep.m., N. madagascariensis; Nep.s.,N. senegalensis; Tet.k., Tetragnatha kauaiensis; Tet.v., T. versicolor;Lat.g., Latrodectus geometricus;Arg.a., Argiope aurantia; Arg.t.,Argiope trifasciata; Ara.d., Araneus diadematus; Gas.m., Gasteracantha mammosa; and Ara.b., Araneus bicentenarius. Symbols indicate the following: *, previously published sequence; †, PCR/genomic clone; §m, cDNA from major ampullate glands; and §f, cDNA from flagelliform glands. The previous designations for Araneus diadematus fibroins (4) are shown in parentheses (ADF1 through ADF4).

Although the fibroins of Plectreurys (Haplogynae) andEuagrus (Mygalomorphae) are internally repetitive, the ensemble repeats from these basal taxa (Fig. 2) are unlike analogous units from previously described silks (Fig. 3). Each of the fibroins from these primitive groups contains runs of serine.Plectreurys cDNA1 is highly internally repetitive with iterations of An, Sn, (GX)n, and (AQ)n (S, serine; Q, glutamine). Plectreurys cDNA3 has a unique molecular architecture, with the 5′ end encoding a tandem array of long repeat units and the 3′ end encoding 15 repeats of a much shorter ensemble unit. The ∼344–amino acid Euagrus repeat unit is a complex mixture of serine and alanine–rich sequence that includes a string of threonine, an amino acid that is rare in araneoid fibroins (Fig. 2).

Aside from an overall modular structure, scattered GA, GGX, and An motifs, and amino acid matches in the nonrepetitive COOH-terminus, there is only limited sequence similarity between araneoid fibroins and those from Plectreurys andEuagrus (Figs. 2 and 3). Perhaps these differences are not surprising given the contrasting ecologies of these species (1,14), but our data clearly indicate that spiders use a diversity of proteins to spin silk threads. The fibroin repeats of basal Araneae suggest that spider silk design may not be especially dependent on specific sequences, but comparisons of fibroins among orb weavers contradict this notion (Fig. 3).

In combination with published data (4, 5, 8, 9), our sequences allow comparisons between the two basal-most clades of ecribellate orb weavers, Araneidae and “derived araneoids” (Fig. 1), for four groups of fibroins (15): major ampullate spidroin 1–like (MaSp1), major ampullate spidroin 2–like (MaSp2), minor ampullate spidroins (MiSp), and flagelliform silk protein (Flag). Differences among fibroins within each of these four groups are primarily variations in the arrangement and frequency of An, GA, GGX, and GPG(X)nmotifs (Fig. 3). An, GA, and GGX are present in consensus repeats for both araneid and derived araneoid MiSp orthologs. Major ampullate fibroins are similarly conserved among araneoids. Stable repeats for MaSp1 are An, GA, and GGX, and stable repeats for MaSp2 are GPG(X)n and An. These motifs are retained even in major ampullate fibroins of the widow Latrodectus, a cobweb-weaving araneoid that does not spin a conventional orb web. The long Flag repeats are divergent within Araneoidea, but both araneid and derived araneoid repeat units are composed primarily of clustered GPG(X)n and GGX motifs (Fig. 3).

Fossil evidence suggests that the divergence of Araneidae from derived araneoids occurred no later than the Early Cretaceous (Fig. 1). Therefore, the motifs conserved within MaSp1, MaSp2, MiSp, and Flag have been maintained, presumably by stabilizing selection, for over 125 million years (16). Motifs that have been retained over such long evolutionary periods are likely to be critical to the divergent mechanical properties of the specialized orb weaver silks.

All of the motifs that are conserved in araneoid fibroins are found in other structural proteins that also have exceptional mechanical properties. There is strong evidence that An and GA regions create β sheets in silk fibers, and these crystalline structures are hypothesized to be critical for the high tensile strength of spider silk (17). An and GA are found in several proteins that also are characterized by high strength: lepidopteran fibroins from Bombyx, Antheraea, andGalleria (10, 11); oyster shell matrix protein (18); and mussel byssus proteins (19). Furthermore, GPG(X)n has been implicated as the elastic module of MaSp2 and Flag (5), and analogous PG motifs confer stretchiness to wheat gluten and elastin (20). GPG(X)n is duplicated in proteins of the elastic byssal threads of mussels (19). The structure of GGX motifs in spider silk currently is not clear (21). Regardless, GGX motifs are maintained in MaSp1, MiSp, and Flag and are distributed widely among Antheraea and Galleria fibroins, mussel byssus proteins, oyster shell matrix protein, and abductin (a component of the hinge ligament that connects molluskan shells) (22). The convergent use of An, GA, GGX, and GPG(X)n in other structural superproteins is further evidence for the importance of these motifs to the mechanical properties of different araneoid silks.

Many arthropods produce silk, but silk use has a widely scattered phylogenetic distribution (14). The profoundly different modes of silk production in lepidopteran insects (labial glands) versus spiders (abdominal glands) suggest that silks from these taxa evolved convergently (23). It also would be difficult to argue that the structural proteins used by bivalve mollusks are closely related to spider fibroins. These “aquatic silks” likely evolved common motifs in parallel with terrestrial arthropod fibroins. Such evolutionary convergences are documented regularly at the gross anatomical level, but comparable molecular examples are rare [e.g., (24)].

Recent attempts at artificially spinning orb weaver fibroins have not been successful in replicating the strength and toughness of native silk fibers (25). Furthermore, the mechanical properties of spider silks have been shown to vary with the spinning rate (26). However, to suggest that the drawing process, rather than the primary sequences of fibroins, exerts the dominant influence on fiber formation and properties (2, 6) is an overstatement. Evidently, both factors are important. Four simple amino acid motifs account for most repetitive sequences in characterized araneoid fibroins. The retention of these motifs in spider silks since the Mesozoic implies that fibroin sequences are critical to any detailed understanding of the elasticity, toughness, and strength of different araneoid silk fibers. The convergent evolution of these motifs in other structural superproteins reiterates the importance of these sequences. However, clusters of the amino acid motifs conserved in orb weaver silks make up only a fraction of the repetitive sequences in silk proteins from Plectreurys andEuagrus. These molecules from basal Araneae hint at the potential wealth of protein designs yet to be discovered in spider silks.

  • * Present address: Department of Biology, University of California, Riverside, CA 92521, USA.

  • These authors contributed equally to this work.


Stay Connected to Science

Navigate This Article