Duplication and concerted evolution of MiSp-encoding genes underlie the material properties of minor ampullate silks of cobweb weaving spiders
© The Author(s). 2017
Received: 24 August 2016
Accepted: 24 February 2017
Published: 14 March 2017
Orb-web weaving spiders and their relatives use multiple types of task-specific silks. The majority of spider silk studies have focused on the ultra-tough dragline silk synthesized in major ampullate glands, but other silk types have impressive material properties. For instance, minor ampullate silks of orb-web weaving spiders are as tough as draglines, due to their higher extensibility despite lower strength. Differences in material properties between silk types result from differences in their component proteins, particularly members of the spidroin (spider fibroin) gene family. However, the extent to which variation in material properties within a single silk type can be explained by variation in spidroin sequences is unknown. Here, we compare the minor ampullate spidroins (MiSp) of orb-weavers and cobweb weavers. Orb-web weavers use minor ampullate silk to form the auxiliary spiral of the orb-web while cobweb weavers use it to wrap prey, suggesting that selection pressures on minor ampullate spidroins (MiSp) may differ between the two groups.
We report complete or nearly complete MiSp sequences from five cobweb weaving spider species and measure material properties of minor ampullate silks in a subset of these species. We also compare MiSp sequences and silk properties of our cobweb weavers to published data for orb-web weavers. We demonstrate that all our cobweb weavers possess multiple MiSp loci and that one locus is more highly expressed in at least two species. We also find that the proportion of β-spiral-forming amino acid motifs in MiSp positively correlates with minor ampullate silk extensibility across orb-web and cobweb weavers.
MiSp sequences vary dramatically within and among spider species, and have likely been subject to multiple rounds of gene duplication and concerted evolution, which have contributed to the diverse material properties of minor ampullate silks. Our sequences also provide templates for recombinant silk proteins with tailored properties.
Major ampullate silk has been the primary focus of silk research because it is relatively easy to harvest and has tensile strength rivaling that of steel [6, 7]. However, all silk fibers have impressive and unique material properties . For instance, minor ampullate silk of the orb-web weavers Argiope trifasciata, Argiope argentata, and Nephila clavipes have higher extensibility and lower strength than major ampullate silks from the same species, yet are equally tough [8–12]. Additionally, minor and major ampullate silks display different physical properties when wet. Supercontraction in water occurs with A. trifasciata and Nephila inaurata major ampullate silks, which allows them to become more extensible while maintaining their high strength . However, when minor ampullate silks from the same species are exposed to water, supercontraction does not occur and the mechanical properties do not change . Despite having consistent behaviors within species whether wet or dry, mechanical properties of minor ampullate silks varied more among species than did the properties of major ampullate silks [14, 15]. The consistency of minor ampullate silks’ mechanical properties when wet or dry, as well as their increased extensibility relative to draglines and greater variability among species, could allow for different applications of minor ampullate silk that would not be feasible for major ampullate silk. The material properties of minor ampullate silks made by cobweb weavers have yet to be measured, but given the different functions of this silk in cobweb weavers compared to in orb-web weavers, we predict divergence in the mechanical performance of these fibers.
The variation in physical properties between major and minor ampullate silks and among minor ampullate silks of different species likely results from sequence differences among their component proteins, termed spidroins (spider fibroins). Spidroins are extremely large (>200 kDa) and are made almost entirely of a highly repetitive region, which is flanked by short, conserved carboxy (C)- and amino (N)-terminal regions [16–21]. The repetitive region of many spidroins contain numerous subrepeats of short amino acid sequences, called motifs . Motifs found frequently in spidroin repetitive regions include GGX (where X refers to a subset of amino acids), (GA)n where n ≥ 2, An where n ≥ 4, and GPG [2, 18]. An and (GA)n form β-sheets, GGX forms 31-helices, and GPG forms β-spirals [2, 14, 18, 22–26]. Secondary structures of β-sheets likely confer strength, and β-spirals contribute to elasticity .
Major ampullate silks of all araneoid spiders examined thus far are made up of two proteins, MaSp1 and MaSp2, which are distinguished by a high GPG content in the latter but not the former [27, 28]. Similarly, N. clavipes minor ampullate silk is made up of two proteins, MiSp1 and MiSp2, although they are not as distinctly different from each other as MaSp1 and MaSp2 . All described minor ampullate spidroins (generalized as MiSp) have consensus repeats that are primarily made up of An, (GA)n and GGX motifs disrupted by serine and threonine-rich “spacer” regions [2, 14, 18, 29, 30]. The functions of the spacer regions are not well understood, but artificially made recombinant proteins composed solely of spacer regions tend to form oligomers and only recombinant MiSp proteins that include at least one spacer, repetitive region, and the C-terminal domain can form fibers . Both MaSp1 and MiSp are predicted to be composed of β-sheets connected by helical regions and amorphous chains [2, 14, 18]. The higher tensile strength of major ampullate silk may result from longer β-sheets made up of An repeats, in contrast to shorter β-sheets made up of An and (GA)n repeats in minor ampullate silk .
The large size and repetitiveness of spidroins make spidroin-encoding genes extremely difficult to sequence completely. The only full-length sequence of MiSp known is for Araneus ventricosus . Furthermore, even partial MiSp sequences have been characterized from only a few orb-web weavers [29, 30]. The full length sequence shows that MiSp repeat units are not as similar to each other as are the repeats of MaSp1 or MaSp2 repeats; and that A. ventricosus MiSp possesses exactly two spacers that interrupt the glycine and alanine rich motifs . The generality of these features for other MiSp-encoding sequences needs to be tested.
The only known sequences of MiSp in a cobweb weaver are partial sequences from Latrodectus hesperus [5, 32]. The known repetitive regions are similar between L. hesperus and orb-web weavers’ MiSp, with the main difference being length of spacers [30, 32]. However, because cobweb weavers use minor ampullate silks for prey wrapping, while orb-web weavers use them in the auxiliary spiral of their web , we predict different selective pressures on cobweb weaving MiSp that could consequently alter secondary structure and physical properties.
Here, we characterize complete or nearly complete MiSp sequences from five cobweb weaving spider species: L. hesperus, L. tredecimguttatus, L. geometricus, Steatoda grossa, and Parasteatoda tepidariorum (Theridiidae). We also measured mechanical properties of minor ampullate silks in L. hesperus, L. geometricus, and S. grossa. Our goals were to determine the extent of variation in MiSp sequences and minor ampullate silk properties within cobweb weaving spiders, compare this variation to that found in orb-web weaving spiders, and identify any relationship between sequence and material property variation across both groups. We found evidence for at least two MiSp loci in each cobweb weaving species, and that one copy is more highly expressed than the other in minor ampullate glands in L. hesperus and L. geometricus. Phylogenetic analyses suggested extensive gene duplication and concerted evolution of MiSp-encoding loci contributed to variation in MiSp sequences. Finally, we found significant differences in mechanical properties among minor ampullate silks that were partially explained by variation in MiSp sequences.
Multiple loci encode MiSp in cobweb weavers
We employed a series of molecular experiments to identify MiSp-encoding sequences in our five species of cobweb weavers. Additional file 1: Figure S1 provides an overview of these various methods.
Spidroins included in phylogenetic analyses of N- and C-terminal sequences
Nephila inaurata madagascariensis
Nephila inaurata madagascariensis
Primer sets designed to be specific to Lh MiSp_v1 versus Lh MiSp_v2 successfully amplified genomic DNA of three single individuals. Variant-specific PCR products were 99.3% identical to the targeted variant, but only 86.3% identical to each other on average at the 616 bp C-terminal/ 3’UTR region. Sequence chromatograms of Lh MiSp_v1 and MiSp_v2 amplifications exhibited double peaks in both directions in at least one individual (10 double peaks in one of three individuals amplified with Lh MiSp_v1 specific primers and one double peak in all three individuals amplified with Lh MiSp_v2 specific primers). The pattern of double peaks in one individual but not others for the Lh MiSp_v1 specific PCR product is consistent with allelic variation for Lh MiSp_v1 and confirms that Lh MiSp_v1 and Lh MiSp_v2 represent separate genomic loci.
L. tredecimguttatus and L. geometricus
Because these species are diploid, we expect no more than two alleles per locus within a single individual. Thus each species may possess as many as six loci encoding MiSp (11 variants identified in each species). However, because some PCR or cloning error could produce slight variants among sequences (despite the use of a proofreading enzyme) we conservatively considered the two distinct clusters to represent separate loci and completely sequenced one representative from each cluster (Fig. 2). We designated the completely sequenced clones from L. tredecimguttatus as Lt MiSp_v1 and Lt MiSp_v2, and the completely sequenced clones from L. geometricus as Lg MiSp_v1 and Lg MiSp_v2. Designation of MiSp variants with 1 or 2 is not intended to imply orthology among species, but simply reflects different genomic loci within species.
TOPO cloning and end sequencing of an ~5000 bp S. grossa PCR product that was generated with primers targeting complete MiSp resulted in ten identical clones from a single individual. When translated, these ~800 bp end sequences aligned well with our Latrodectus MiSp sequences, but contained numerous stop codons. This variant is likely a non-functional pseudogene, designated Sg MiSp_pseudo.
PCR amplification using primers designed from our ~3 kb C-terminal encoding S. grossa MiSp cDNA (designated Sg MiSp) failed to find any further variants. In contrast, TOPO cloning and sequencing of 20 N-terminal and adjacent repetitive MiSp encoding PCR products for a single S. grossa individual resulted in 20 unique sequences, each different from our ~500 bp N-terminal encoding cDNA. The clones (designated by clone numbers) grouped into three clusters (Additional file 1: Figure S3); within a cluster pairwise differences among sequences were <0.9% on average (0.13–1.5%) while between clusters pairwise differences were much greater (average pairwise distance = 10%). Nucleotide differences within a cluster could reflect alleles or multiple loci, but could also be attributable to PCR error because amplification was performed with Taq polymerase alone, which does not have proofreading capability, in contrast to the high fidelity polymerases used to amplify the Latrodectus species (see Methods). Conservatively, the three clusters represent three true variants. Thus there is a minimum of two loci that encode MiSp in S. grossa.
We found partial length N and C-terminal MiSp-encoding cDNAs from P. tepidariorum (designated Pt MiSp, see Table 1 for accessions) that were nearly identical (>98%) to the 5’ and 3’ ends, respectively, of an Augustus-predicted gene model on Scaffold 853 in the publicly available genome. This gene model was predicted to span 10292 bp, but included four gaps (4353 bp of assembled sequence, 5939 bp of unknown). Characterization of a TOPO clone containing an ~2.3 kb PCR product for P. tepidariorum MiSp (KX584004) resulted in a sequence that was 100% identical to the 104 bp of overlap with the C-terminal encoding region of Scaffold 853 and filled 2236 bp of the last gap. We refer to the combination of our TOPO clone with Scaffold 853 as Pt MiSp variant 1 (Pt MiSp_v1). The fully sequenced TOPO clone included a 1175 bp long intron with canonical donor and acceptor sites identified by aligning with our C-terminal encoding cDNA using SPIDEY . We also found BLASTN alignments of this 2.3 kb TOPO clone to three other scaffolds (954, 3280, and 3758) that may represent additional MiSp-encoding loci. The C-terminal encoding region of our TOPO clone aligned with 91% identity on Scaffold 954. The intron sequence aligned with 95% identity to Scaffold 3280 and 99% identity to Scaffold 3758. The repeat region flanking the intron (including 100 bases of intron) aligned with 89% identity to Scaffold 3280 and 90% identity to Scaffold 3758. The presence of two C-terminal encoding sequences that differ by 9% and are placed on separate scaffolds suggests that at least two loci encode MiSp in P. tepidariorum.
MiSp loci are expressed at different rates in cobweb weaving spiders
MiSp transcript abundance based on read counts in minor ampullate, major ampullate, and total silk gland RNA-seq libraries
Variable genomic structure of MiSp
All MiSp genes characterized from Latrodectus lacked introns interrupting coding sequence. While only L. hesperus MiSp_v1 included both the start and stop codons, the other MiSp sequences missed no more than 300 bp of the N or C-terminal coding regions on either end. Our P. tepidariorum MiSp genomic TOPO clone that includes the C-terminal encoding region has an intron of 1175 bp. For comparison, orb-weaver A. ventricosus MiSp has a large, single intron of 5628 bp .
Inspection of the L. hesperus fosmid genomic clone revealed a TATA box approximately 50 bases upstream of the full-length L. hesperus MiSp coding region. The nucleotide motif CACG found in all previously characterized spidroin genes was also found in L. hesperus MiSp, 11 bases upstream of the TATA box. The P. tepidariorum sequence on Scaffold 853 in the i5k genome possessed a TATA box 63 bases upstream of the MiSp coding region and a CACG motif 10 bases upstream of the TATA box. The A. ventricosus sequence has a TATA box 63 bases upstream of the full-length MiSp coding region, and a CACG motif 30 bases upstream of the TATA box . Polyadenylation signals were found 110 bases downstream of the stop codon for L. hesperus MiSp_v1, 242 bases downstream of the stop for P. tepidariorum MiSp_v1, and 82 bases downstream of the stop for A. ventricosus MiSp .
MiSp amino acid variation within and among species
Cobweb weaver MiSp length is highly variable within and among species (815 aa to 2751 aa, Additional file 1: Figure S4). For comparison, the completely sequenced L. hesperus MaSp1 is 3129 aa and MaSp2 is 3779 aa, and A. ventricosus MiSp is 1,766 aa . Glycine, alanine, and serine are the three most abundant amino acids for all MiSp variants, showing similar composition percentages to L. hesperus MaSp1 and MaSp2. Serine is found in both the spacer regions and the repeat regions, while glycine and alanine are predominantly found in the repeat region only. Proline is found in very low abundance for both L. hesperus MiSp variants, both L. tredecimguttatus MiSp variants, L. geometricus MiSp_v2, and P. tepidariorum MiSp_v1, but is more abundant in L. geometricus MiSp_v1 and is the fourth most abundant amino acid in S. grossa MiSp, surpassing L. hesperus MaSp2 proline composition (Additional file 1: Figure S4). Codon usage is variable among MiSp loci and species, but there is an overall bias toward A or T in the third position of the codon, especially for alanine and glycine (Additional file 1: Table S1).
Material properties of minor ampullate spidroins correlate with MiSp sequence variation
Properties of minor ampullate silk for cobweb and orb-web weaving species
Young’s Modulus (GPa)
Extensibility (ln mm/mm)
1.12 ± 0.47
3.9 ± 2.9
245.4 ± 120.5
0.57 ± 0.02
66.7 ± 46.4
1.09 ± 0.29
2.6 ± 1.3
174.4 ± 17.6
0.54 ± 0.02
43.6 ± 20.1
1.07 ± 0.26
2.1 ± 0.7
251.3 ± 106.2
0.74 ± 0.02
57.3 ± 20.7
Nephila clavipes a
2.5 ± 0.2
3.0 ± 0.6
346 ± 53
0.30 ± 0.15
Nephila clavipes b
960 ± 50
0.25 ± 0.03
Nephila inaurata c
1.8 ± 0.1
11.2 ± 0.7
1500 ± 200
0.46 ± 0.05
300 ± 50
Argiope trifasciata d,e
0.69 ± 0.10
8.9 ± 0.05
751.5 ± 40
0.44 ± 0.04
150 ± 12
Argiope trifasciata c
1.8 ± 0.1
10 ± 0.4
1040 ± 6
0.45 ± 0.02
240 ± 20
Argiope argentata e
1.1 ± 0.7
10.6 ± 1.2
923 ± 154
0.33 ± 0.03
137 ± 22
Araneus diadematus f
Araneus gemmoides b
1400 ± 100
0.22 ± 0.07
Phylogenetic relationships of spidroins among orb-web and cobweb weaving species suggest multiple duplication events
Our characterization of complete or almost complete encoding sequences for the minor ampullate spidroin (MiSp) in multiple cobweb weaving spider species (Theridiidae) demonstrate the presence of at least two MiSp loci in each species. The substantial variation of MiSp sequences within and among species and its relationship to variation in material properties has multiple implications for molecular evolution, spider ecology, and biomimetic applications through recombinant DNA technology.
Two MiSp-encoding loci have also been documented in the golden orb-weaver spider, Nephila clavipes (Nephilidae), and therefore the presence of multiple MiSp-encoding loci in spider genomes most likely dates minimally back to the common ancestor of theridiids and nephilids (e.g. Araneoidea). Our reconciliation of gene trees with species trees suggests even older duplication events (Fig. 7). However, the maintenance of two loci appears to involve complex molecular evolutionary processes including intergenic concerted evolution of the loci within species, and multiple gains and losses of individual MiSp copies. Although our reconciliation analyses inferred independent duplication events within each of our cobweb weaving species, the grouping of MiSp loci within species based on the terminal domains (Figs. 6 and 7), could also result from intergenic concerted evolution of the N- and C-terminal encoding regions. We favor the latter hypothesis because of the dramatic differences in the repetitive region within some species (Fig. 3). Concerted evolution of the N and C-terminal encoding regions could occur through non-homologous recombination between the loci, facilitated by their similar sequences, as proposed for the major ampullate spidroin paralogs, MaSp1 and MaSp2, of multiple species [2, 32, 41–43]. Within theridiids, the rate of concerted evolution appears to be faster than speciation, since relationships among each species’ pair of MiSp loci reflect species relationships (Figs. 6 and 7). Intergenic concerted evolution of terminal-encoding regions could be favored by selection because these regions are involved in assembly of multiple MiSp monomers into polymers and the conversion of the protein complex from a liquid to a solid [31, 44]. Highly similar terminal regions may be necessary for polymers to form.
Outside of theridiids, relationships among MiSp N and C-terminal domains are not congruent with species relationships. The low posterior probabilities and bootstrap support for these incongruous relationships suggest there is limited phylogenetic signal retained in the MiSp C-termini at the distant time scale of divergence of theridiids from other araneoid families (~170 million years ago, [36, 38, 39]). However, the consistent grouping of Nephila and theridiid MiSp C-terminal domains to the exclusion of araneid MiSp C-terminal domains suggests that nephilid and theridiid MiSp sequences are derived from a different ancient copy than the araneid MiSp sequences, as supported by our reconciliation analyses (Figs. 6 and 7). The loss of functional MiSp loci through multiple single nucleotide mutations is further supported by the presence of a MiSp pseudogene in the S. grossa genome. Many additional losses were inferred by our reconciliation analyses (Additional file 1: Figure S8), but because of incomplete spidroin sampling for most species we do not feel confident in estimating the extent of spidroin gene loss.
We found extensive variation in the MiSp repetitive regions between loci within a genome and among species (Fig. 3). MiSp variants in L. hesperus and L. tredecimguttatus, L. geometricus MiSp_v2, P. tepidariorum MiSp_v1, Nephila MiSp 1 & 2, and araneid MiSp sequences are similar in terms of amino acid motif composition. S. grossa MiSp and L. geometricus MiSp_v1 are especially divergent, with the difference between variants within L. geometricus being extremely striking (Fig. 3). The divergent repeats have an increased proline content, which likely occurred independently in S. grossa MiSp and L. geometricus MiSp_v1, based on gene tree relationships (Figs. 6 & 7). The mutation of alanine or glutamine codons to a proline codon requires only a single base change and the GPG motif is frequently found as GPGA or GPGQ, indicating that it could have evolved from (GA)n or (GQ)n motifs, which are the most common motifs in all other MiSp sequences (Fig. 3). Intragenic concerted evolution could rapidly proliferate these mutations throughout a single gene as has been proposed for other spidroin paralogs [16, 17, 45, 46]. The near identity of spacer sequences within each of our cobweb weaver MiSp variants (Fig. 4) supports the hypothesis that intragenic concerted evolution is common, as also found in orb-web weaver MiSp [30, 31]. However, intragenic concerted evolution must be offset by some other molecular processes, since the remainder of the repetitive regions are not as homogenized as the spacers (Fig. 3). The simple amino acid motifs found in MiSp such as (GA)n are encoded by microsatellite-like sequences and could thus be prone to high rates of slipped strand mispairing. It is also possible that excessive homogenization of MiSp repetitive regions adversely affects its function and is eliminated by selection.
The high proportion of the GPG amino acid motif in S. grossa MiSp correlates with higher extensibility in S. grossa minor ampullate silk fibers (Fig. 5c, Table 3). GPG content is probably not the only predictor of extensibility, however, since Latrodectus minor ampullate fibers were more extensible than orb-web weavers’ even though GPG content for L. hesperus is similar to the orb-web weavers. Furthermore, we did not find a significant difference in extensibility between L. hesperus and L. geometricus minor ampullate silk fibers despite L. geometricus MiSp_v1 having a relatively high percentage of GPG motifs (Fig. 3). The MiSp_v2: MiSp_v1 transcript ratio in L. geometricus suggests that the GPG-rich MiSp_v1 is in lower abundance in L. geometricus (Table 2), which could explain why there is not a significant increase in extensibility of its minor ampullate silk fibers in comparison to L. hesperus. Cobweb weaving spiders use MiSp in their prey wrapping silks , and evolution of extensibility-conferring motifs could potentially allow for capturing different prey types by S. grossa compared to other cobweb weavers, although little is known about the ecology or diet of this species. It is also possible that L. geometricus modulates expression of its two MiSp-encoding loci in response to prey availability. Experimentally manipulating prey type has been shown to affect proline content of major ampullate fibers in Nephila, potentially as a result of plastic changes in relative expression levels of the proline-poor MaSp1 and the proline-rich MaSp2 [47, 48]. Variation among individuals in MaSp1 and MaSp2 transcript abundance has also been demonstrated for black widows . Analogous experiments have not been done for minor ampullate fibers. However, the variation in MiSp_v2: MiSp_v1 ratios that we found between L. geometricus individuals (Table 2) suggests plasticity of expression is possible.
Our six complete or nearly complete single exon MiSp-containing clones can also serve as templates for recombinant proteins. Due to the dramatic differences in length and amino acid content of some of the proteins encoded by these loci, it may be possible to spin artificial fibers with custom-made properties. For instance, the GPG-containing L. geometricus MiSp_v1 could be used to make more extensible fibers, while the longer alanine-rich L. hesperus MiSp_v1 may make stiffer fibers.
We found that intragenic concerted evolution within MiSp-encoding genes likely led to rapid proliferation of proline replacements for alanine or glutamine in MiSp protein sequences independently in at least two species. For one species, the proliferation of proline coincides with higher extensibility of minor ampullate silks. This could allow cobweb weavers to access new prey types or the ability to modulate mechanical properties of their silks through altering expression levels of MiSp gene copies. Our multiple nearly complete MiSp sequences also provide various templates for tailored biomimetic applications through recombinant DNA technologies.
Individual L. hesperus were collected in Riverside, California and Tucson, Arizona. L. geometricus individuals were collected in San Diego, California. L. tredecimguttatus and S. grossa were obtained from SpiderPharm (Yarnell, Arizona). P. tepidariorum were obtained from a laboratory culture founded with spiders collected near Cologne, Germany and purchased from SpiderPharm.
Genomic library clone
We screened an L. hesperus genomic library with PCR for clones containing MiSp (see  for library construction and screening protocols). Primers used in screening were designed from the C-terminal encoding region of a partial-length L. hesperus MiSp cDNA (HM752571; primers listed in Additional file 1: Table S3). An ~49 kb MiSp containing clone was shotgun sequenced and assembled to 8× coverage by Qiagen (Hilden, Germany).
PCR amplification and cloning of Latrodectus genomic MiSp
We amplified full-length L. hesperus MiSp from genomic DNA using primers designed from the 3’ UTR of the C-terminal encoding MiSp cDNA (HM752571) and the N-terminal region of another partial MiSp cDNA (HM752570). We amplified genomic MiSp from L. tredecimguttatus and L. geometricus first using primers designed from L. hesperus MiSp N and C-terminal cDNAs and then with internal species-specific primers (primers in Additional file 1: Table S3). Genomic PCR was performed with 1 unit of AccuPrimeTM Taq DNA Polymerase High Fidelity (Invitrogen, Carlsbad, CA) according to manufacturer’s instructions. Cycling conditions were 40 cycles of 94 °C for 30 s, 50–60 °C (depending on primer pair) for 45 s, and 68 °C for 10 min.
PCR products were gel-excised and cloned using the TOPO®-TA Cloning® Kit (Invitrogen). Clones were screened by PCR amplification of MiSp N and/or C-terminal encoding region, and MiSp positive clones were sequenced with Sp6, T7, M13F, or M13R universal primers. End sequencing of TOPO clones indicated that, within both L. geometricus and L. tredecimguttatus, clones clustered into two distinct groups (Additional file 1: Figure S2). For both L. tredecimguttatus and L. geometricus, the longest clones representing each cluster were completely sequenced. End sequences and restriction digest patterns of L. hesperus TOPO clones were very similar. Thus, one clone was chosen for complete sequencing. Clones chosen for complete sequencing were subjected to random transposon insertion with the GPS-1 Genome Priming System (New England Biolabs, Ipswich, MA). Position of transposon insertion was mapped with restriction digests and clones were sequenced from both ends of the transposon. Sequences were assembled with Sequencher v. 4.9 (GeneCodes).
Steatoda grossa and P. tepidariorum cDNA libraries
We constructed cDNA libraries from S. grossa total silk glands and P. tepidariorum major and minor ampullate silk glands following procedures detailed in Garb et al. (2010). In brief, total RNA was extracted with TRIzol® (Invitrogen) and the RNeasy Mini Kit (Qiagen, Valencia, CA). mRNA was isolated with oligo-(dT)-tagged magnetic beads (Invitrogen). cDNA was synthesized with SuperScript®III (Invitrogen), fractionated by size with ChromaSpin 1000 columns (Clontech, Mountain View, CA), blunt-end ligated into pZErOTM-2 (Invitrogen), and electroporated into TOP10 Escherichia coli cells (Invitrogen). Approximately 2000 colonies per library were arrayed and stored at −80 °C. The libraries were screened by gel electrophoresis of plasmid DNA . Plasmids with inserts >500 bp were sequenced with T7 or SP6 universal primers. We identified silk protein-encoding sequences by conceptual translations and comparisons to published sequences with BLASTX .
An S. grossa cDNA clone (designated Sg MiSp) with a 3 kb insert containing the C-terminal encoding portion of MiSp was completely sequenced by random insertion of transposons with the EZ-Tn5 < Tet > kit (Epicentre). Transposon mapping, sequencing, and assembly were performed as for TOPO clones above.
Amplification and cloning of S. grossa and P. tepidariorum MiSp
We attempted to amplify complete MiSp encoding sequences from S. grossa genomic DNA using primers designed from conserved regions of Latrodectus MiSp N-terminal encoding sequences and from the C-terminal encoding region of the S. grossa MiSp cDNA (Additional file 1: Table S3). S. grossa genomic DNA was amplified with 1 unit of Phusion® High-Fidelity DNA Polymerase (New England Biolabs), 200 μM each dNTP, 0.5 μM each primer, 1X Phusion® GC buffer, and 3% DMSO. Cycling conditions were 40 cycles of 98 °C for 5 s, 58 °C for 15 s, and 72 °C for 2.5 min. PCR products were gel-excised and cloned using the TOPO®-Blunt Cloning® Kit (Invitrogen). End sequencing of these clones revealed numerous premature stop codons and thus clones were not completely sequenced.
In order to obtain longer N-terminal MiSp encoding sequences from S. grossa, we amplified S. grossa genomic DNA using forward primers designed from the conserved region of Latrodectus MiSp N-terminal encoding sequences or from the S. grossa N-terminal MiSp cDNA, and a reverse primer designed from the repetitive region of the S. grossa C-terminal cDNA (Additional file 1: Table S3). Genomic DNA was amplified with 1 unit GoTaq Polymerase (Promega), 1X GoTaq Buffer, 200 μM each dNTP, 0.5 μM each primer, and 1.5 mM MgCl2 with 45 cycles of 30s at 94 °C, 45 s at 54.2 °C or 61 °C, and 1.5 or 2 min at 72 °C. PCR products from the two amplifications were separately gel-excised and cloned using the TOPO®-TA XL PCR Cloning ® Kit (Invitrogen). TOPO clones from each reaction were amplified with M13 forward and reverse primers and sequenced with T7 or SP6 universal primers.
A MiSp gene was identified in the P. tepidariorum genome by BLASTN searches of our N- and C-terminal encoding MiSp cDNA sequences against the genome assembly  generated by the i5K initiative . We found alignments of both cDNAs to a region on Scaffold 853 that included an Augustus-predicted gene model (aug3.g7951.t3). Using the genomic sequence, we designed a reverse primer in the C-terminal encoding region and a forward primer in the adjacent repetitive region. Genomic DNA and cDNA were amplified as for S. grossa N-terminal encoding region but with 40 cycles of 30s at 94 °C, 45 s at 60.1 °C and 1 min at 72 °C. PCR products were gel-excised and sequenced. Genomic PCR products had sizes ranging from 250 bp to 2500 bp, while cDNA PCR products had sizes ranging from 250 bp to 1200 bp. Those bands that were too large to directly sequence were TOPO cloned using the TOPO-TA PCR Cloning Kit (Invitrogen) and completely sequenced by random insertion of transposons as for S. grossa TOPO clones above. Completed TOPO clones were compared to the P. tepidariorum genome by BLASTN.
Amino acid sequence characteristics
Amino acid composition and codon usage were calculated with codonW . Kyte-Doolittle hydropathy plots  with a window size of seven were generated online . Amino acid motifs GGX, An (n ≥ 4), (GA)n (n ≥ 2), and GPG were identified by word searches in Microsoft Word. Spacer regions were defined manually as regions that were not in the N or C-termini and were low in alanine and glycine and were manually aligned. Two authors (JVH and CYH) independently assigned spacer regions. Tandem repeats were identified by XSTREAM v1.4.8 using default parameters for moderate repeat degeneracy and high significance .
Transcript abundance estimation
Transcript abundance was estimated by aligning raw RNA-seq reads  (SRR1539569 and SRR1539569 for L. hesperus and L. geometricus respectively) to our full or almost full-length MiSp paralogs found in L. hesperus (Lh MiSp_v1 and MiSp_v2) and L. geometricus (Lg MiSp_v1 and MiSp_v2) using Bowtie 2 with default parameters, which searches for multiple alignments for each read but reports the one best alignment . RNA-seq reads were generated for mRNA isolated from major and minor ampullate glands of ~30 adult females and from all the silk glands (“total silk”) of a single adult female for both L. hesperus and L. geometricus. SAMtools v1.1 was used to report the number of sequence reads matching our reference MiSp sequences . To compare transcript abundance between loci within the same library, we first compared the total counts of reads aligned to the entire reference sequence. We did not correct for gene length because many of our libraries were heavily 3’-prime biased. However, to ensure that differences in gene length did not dramatically affect our estimates of the ratio of one MiSp locus to another, we also compared the counts that aligned to the last 500 bases of coding sequence (Table 2).
Relationships among spidroin paralogs
We added our theridiid and A. ventricosus  MiSp N and C-terminal coding sequences to an alignment of 29 spidroin termini analyzed previously (see ). N-terminal sequences were not available for many spidroins and thus we added 25 additional spidroin sequences to the C-terminal alignment to more comprehensively represent spidroin gene family diversity in our analyses (Table 1). Additional C-termini of spidroin paralogs were chosen for species for which MiSp has been characterized. The expanded amino acid alignment was manually edited and used to guide the nucleotide alignment in SeaView v. 4.2.7 [61, 62].
We conducted heuristic searches for maximum parsimony (MP) and maximum likelihood (ML) trees based on amino acid and nucleotide alignments in PAUP* v4.0b10 using tree bisection reconnection branch swapping and 1000 (MP) or 10 (ML) replicates of random stepwise addition of taxa . Support for clades recovered in MP analyses was evaluated with 1000 bootstrap pseudoreplicates and 10 random addition sequences per pseudoreplicate.
Bayesian analyses were carried out with MRBAYES v.3.1.2 [64, 65]. Optimal models of evolution were determined for nucleotide sequences with JMODELTEST v0.1.1  and for protein sequences with PROTTEST v2.4  for N and C-termini separately. Combined analysis of nucleotides employed a model partitioned by N and C-termini. Combined analysis of amino acids employed a mixed model, which allowed estimation of the optimal model of protein evolution during the Bayesian analysis. Default priors and Metropolis coupled, Markov-chain, Monte Carlo sampling procedures were executed for two independent runs of 10 million generations each, sampled every 1000th generation. Convergence was assessed every 1000th generation and the posterior distribution was considered adequately sampled if the standard deviation of split frequencies of these two runs was below 0.01. The mygalomorph spidroin, Bothriocyrtum californicum fibroin1 (Bc fibroin1), was set as the outgroup in all analyses.
Gene tree – species tree reconciliation
We inferred gene duplication events by reconciling each of our gene trees with two species trees (Fig. 1) using Notung v. 188.8.131.52 . Species trees were based on published phylogenies as follows. Relationships among families were based on Garrison et al.  and Dimitrov et al. . These studies agreed on all family-level relationships except the placement of Deinopidae and Uloboridae. The former placed Deinopidae sister to the RTA-clade (represented here by E. australis and A. aperta) with Uloboridae sister to Deinopidae plus the RTA-clade. The latter switched the placement of Uloboridae and Deinopidae. For relationships among theridiid species we used Liu et al.  and Garb et al. . Relationships among nephilid species were based on Kuntner et al. . Relationships among araneid genera were based on Dimitrov et al.  with relationships for Argiope species from Cheng and Kuntner .
Mechanical properties of minor ampullate fibers
We obtained minor ampullate silk from L. hesperus, L. geometricus, and S. grossa adult females within one week of being received or collected. Individual spiders were anesthetized with CO2 for 2–5 min. We then secured the spider ventral side up to a stereo microscope stage using Scotch ® tape, exposing the spinnerets. We manually pulled silk emanating from the minor ampullate spigots. Single fibers were taped to m-shaped cardstock (2.54 × 7.62 cm), with a single silk fiber suspended across each of the two rectangular notches (1 cm wide × 1.5 cm deep) in a collection card. Fibers were secured onto the cards using cyanoacrylate glue. We collected four to ten minor ampullate fibers from each of seven (L. hesperus and S. grossa) or eight (L. geometricus) individuals per species.
We measured silk fiber diameters using polarized light microscopy (as described in ). Tensile tests were conducted with a NanoBionix tensile tester (formerly MTS Systems Corp., Oak Ridge, TN; currently Keysight Technologies, Santa Rosa, CA) . In brief, each m-shaped card was cut in half, such that only one fiber segment was tested at one time. Each silk sample was mounted onto the NanoBionix and extended at a rate of 1% per second until failure. We defined strength as the true stress of the fiber at breakage. True stress is the force applied to the fiber divided by its cross-sectional area calculated from the original cross-sectional area assuming constant volume of the fiber during extension . We defined extensibility as the true strain at breakage. True strain is the natural log of the ratio of the instantaneous length of the fiber to the original gage length of the fiber. We also measured stiffness or Young’s modulus, the initial slope of the true stress-true strain curve, and toughness, the area under the true stress-true strain curve. Force-extension data were visualized using Testworks 4.0 software (MTS Systems Corp.). We tested for differences among species for the four tensile properties using multivariate analysis of variance (MANOVA). We used Tukey’s test for post-hoc pairwise comparisons. All statistical tests were carried out with SPSS 13.0 (SPSS Inc, Chicago, IL).
We compiled published tensile data for minor ampullate silks and fit the relationship between proportion of GPG amino acid motifs and extensibility to linear models implemented in R for each species for which at least one MiSp sequence and minor ampullate fiber tensile data were available. Minor ampullate silk was assumed to be composed only of MiSp and for each species all available MiSp motif percentages were averaged to estimate composition. To account for phylogenetic relationships and evolutionary distances among species we calculated phylogenetic independent contrasts using APE  implemented in R. We used estimates of divergence time from three published molecular clock studies [35, 38, 39] for branch lengths (see Additional file 1: Figure S6).
We thank Stephen Richards and colleagues at Baylor College of Medicine’s Human Genome Sequencing Center for access to the P. tepidariorum genome. We also thank Alistair McGregor, Mario Stanke, and colleagues for annotating this genome. Students in Genetics Lab (BIOL 221) at Washington and Lee University assisted with collecting sequence data.
This work was supported by the National Science Foundation (IOS-0951086 to NAA, IOS-0951061 to CYH), National Institutes of Health (F32 GM78875-1A to NAA; 1F32GM083661-01 and 1R15GM097714-01 to JEG), Army Research Office (W911NF-06-1-0455 and W911NF-11-1-0299 to CYH), and Washington and Lee University through Lenfest Summer Fellowships to NAA and Summer Research Scholarships to JVH and AKL. ERB was supported in part by a grant to Washington and Lee University from the Howard Hughes Medical Institute through the Precollege and Undergraduate Science Education Program (52007570).
Availability of data and materials
Sequences generated for this study are available in GenBank (Accessions: KX584003 - KX584055). Accessions for previously published sequences are reported in the manuscript text and in Table 1.
CYH and NAA conceived the study. JMV, ERB, JEG, EES, and NAA generated sequence data. AKL, MAC, SMC, and CYH measured tensile properties of minor ampullate silk fibers. JMV, ERB, MAC, THC, and NAA analyzed the data. JMV and NAA wrote the manuscript. All authors edited the manuscript and read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Foelix RF. Biology of Spiders. 3rd ed. New York: Oxford University Press; 2011.Google Scholar
- Gatesy J, Hayashi C, Motriuk D, Woods J, Lewis R. Extreme diversity, conservation, and convergence of spider silk fibroin sequences. Science. 2001;291:2603–5.View ArticlePubMedGoogle Scholar
- Guerette PA, Ginzinger DG, Weber BH, Gosline JM. Silk properties determined by gland-specific expression of a spider fibroin gene family. Science. 1996;272:112–5.View ArticlePubMedGoogle Scholar
- Vollrath F. Spider webs and silks. Sci Am. 1992;266:70–6.View ArticleGoogle Scholar
- Mattina CL, Reza R, Hu X, Falick AM, Vasanthavada K, McNary S, et al. Spider minor ampullate silk proteins are constituents of prey wrapping silk in the cob weaver Latrodectus hesperus†. Biochemistry (Mosc). 2008;47:4692–700.View ArticleGoogle Scholar
- Blackledge TA, Summers AP, Hayashi CY. Gumfooted lines in black widow cobwebs and the mechanical properties of spider capture silk. Zoology. 2005;108:41–6.View ArticlePubMedGoogle Scholar
- Gosline JM, DeMont ME, Denny MW. The structure and properties of spider silk. Endeavour. 1986;10:37–43.View ArticleGoogle Scholar
- Blackledge TA, Hayashi CY. Silken toolkits: biomechanics of silk fibers spun by the orb web spider Argiope argentata (Fabricius 1775). J Exp Biol. 2006;209:2452–61.View ArticlePubMedGoogle Scholar
- Hayashi CY, Blackledge TA, Lewis RV. Molecular and mechanical characterization of aciniform silk: Uniformity of iterated sequence modules in a novel member of the spider silk fibroin gene family. Mol Biol Evol. 2004;21:1950–9.View ArticlePubMedGoogle Scholar
- Liivak O, Flores A, Lewis R, Jelinski LW. Conformation of the polyalanine repeats in minor ampullate gland silk of the spider Nephila clavipes. Macromolecules. 1997;30:7127–30.View ArticleGoogle Scholar
- Pérez-Rigueiro J, Elices M, Llorca J, Viney C. Tensile properties of Argiope trifasciata drag line silk obtained from the spider’s web. J Appl Polym Sci. 2001;82:2245–51.View ArticleGoogle Scholar
- Swanson BO, Blackledge TA, Beltrán J, Hayashi CY. Variation in the material properties of spider dragline silk across species. Appl Phys A. 2006;82:213–8.View ArticleGoogle Scholar
- Guinea GV, Elices M, Plaza GR, Perea GB, Daza R, Riekel C, et al. Minor ampullate silks from Nephila and Argiope spiders: tensile properties and microstructural characterization. Biomacromolecules. 2012;13:2087–98.View ArticlePubMedGoogle Scholar
- Papadopoulos P, Ene R, Weidner I, Kremer F. Similarities in the structural organization of major and minor ampullate spider silk. Macromol Rapid Commun. 2009;30:851–7.View ArticlePubMedGoogle Scholar
- Stauffer SL, Coguill SL, Lewis RV. Comparison of physical properties of three silks from Nephila clavipes and Araneus gemmoides. J Arachnol. 1994;5–11.Google Scholar
- Ayoub NA, Garb JE, Tinghitella RM, Collin MA, Hayashi CY. Blueprint for a high-performance biomaterial: full-length spider dragline silk genes. PLoS One. 2007;2, e514.View ArticlePubMedPubMed CentralGoogle Scholar
- Ayoub NA, Garb JE, Kuelbs A, Hayashi CY. Ancient properties of spider silks revealed by the complete gene sequence of the prey-wrapping silk protein (AcSp1). Mol Biol Evol. 2013;30:589–601.View ArticlePubMedGoogle Scholar
- Hayashi CY, Shipley NH, Lewis RV. Hypotheses that correlate the sequence, structure, and mechanical properties of spider silk proteins. Int J Biol Macromol. 1999;24:271–5.View ArticlePubMedGoogle Scholar
- Motriuk-Smith D, Smith A, Hayashi CY, Lewis RV. Analysis of the conserved N-terminal domains in major ampullate spider silk proteins. Biomacromolecules. 2005;6:3152–9.View ArticlePubMedGoogle Scholar
- Sponner A, Schlott B, Vollrath F, Unger E, Grosse F, Weisshart K. Characterization of the protein components of Nephila clavipes dragline silk. Biochemistry (Mosc). 2005;44:4727–36.View ArticleGoogle Scholar
- Vasanthavada K, Hu X, Falick AM, La Mattina C, Moore AM, Jones PR, et al. Aciniform spidroin, a constituent of egg case sacs and wrapping silk fibers from the black widow spider Latrodectus hesperus. J Biol Chem. 2007;282:35088–97.View ArticlePubMedGoogle Scholar
- Holland GP, Jenkins JE, Creager MS, Lewis RV, Yarger JL. Quantifying the fraction of glycine and alanine in β-sheet and helical conformations in spider dragline silk using solid-state NMR. Chem Commun. 2008;43:5568–70.View ArticleGoogle Scholar
- Holland GP, Creager MS, Jenkins JE, Lewis RV, Yarger JL. Determining secondary structure in spider dragline silk by carbon- carbon correlation solid-state NMR spectroscopy. J Am Chem Soc. 2008;130:9871–7.View ArticlePubMedGoogle Scholar
- Jenkins JE, Creager MS, Lewis RV, Holland GP, Yarger JL. Quantitative correlation between the protein primary sequences and secondary structures in spider dragline silks. Biomacromolecules. 2009;11:192–200.View ArticleGoogle Scholar
- Jenkins JE, Creager MS, Butler EB, Lewis RV, Yarger JL, Holland GP. Solid-state NMR evidence for elastin-like β-turn structure in spider dragline silk. Chem Commun. 2010;46:6714–6.View ArticleGoogle Scholar
- van Beek JD, Hesst S, Vollrath F, Meier BH. The molecular structure of spider dragline silk: Folding and orientation of the protein backbone. Proc Natl Acad Sci U S A. 2002;99:10266–71.View ArticlePubMedPubMed CentralGoogle Scholar
- Hinman MB, Lewis RV. Isolation of a clone encoding a second dragline silk fibroin. Nephila clavipes dragline silk is a two-protein fiber. J Biol Chem. 1992;267:19320–4.PubMedGoogle Scholar
- Xu M, Lewis RV. Structure of a protein superfiber: spider dragline silk. Proc Natl Acad Sci. 1990;87:7120–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Colgin MA, Lewis RV. Spider minor ampullate silk proteins contain new repetitive sequences and highly conserved non-silk-like “spacer regions.”. Protein Sci. 1998;7:667–72.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen G, Liu X, Zhang Y, Lin S, Yang Z, Johansson J, et al. Full-length minor ampullate spidroin gene sequence. PloS One. 2012;7, e52293.View ArticlePubMedPubMed CentralGoogle Scholar
- Gao Z, Lin Z, Huang W, Lai CC, Fan J, Yang D. Structural characterization of minor ampullate spidroin domains and their distinct roles in fibroin solubility and fiber formation. PLoS ONE. 2013;8, e56142.View ArticlePubMedPubMed CentralGoogle Scholar
- Garb JE, Ayoub NA, Hayashi CY. Untangling spider silk evolution with spidroin terminal domains. BMC Evol Biol. 2010;10:243.View ArticlePubMedPubMed CentralGoogle Scholar
- Wheelan SJ, Church DM, Ostell JM. Spidey: A tool for mRNA-to-genomic alignments. Genome Res. 2001;11:1952–7.PubMedPubMed CentralGoogle Scholar
- Garb JE, González A, Gillespie RG. The black widow spider genus Latrodectus (Araneae: Theridiidae): Phylogeny, biogeography, and invasion history. Mol Phylogenet Evol. 2004;31:1127–42.View ArticlePubMedGoogle Scholar
- Liu J, May-Collado LJ, Pekár S, Agnarsson I. A revised and dated phylogeny of cobweb spiders (Araneae, Araneoidea, Theridiidae): a predatory cretaceous lineage diversifying in the era of the ants (Hymenoptera, Formicidae). Mol Phylogenet Evol. 2015;94:658–75.View ArticlePubMedGoogle Scholar
- Bond JE, Garrison NL, Hamilton CA, Godwin RL, Hedin M, Agnarsson I. Phylogenomics resolves a spider backbone phylogeny and rejects a prevailing paradigm for orb web evolution. Curr Biol. 2014;24:1765–71.View ArticlePubMedGoogle Scholar
- Coddington JA. Phylogeny and classification of spiders. In: Ubick D, Paquin P, Cushing PE, Roth V, editors. Spiders of North America: and Identification Manual. 2005. p. 18-24.Google Scholar
- Garrison NL, Rodriguez J, Agnarsson I, Coddington JA, Griswold CE, Hamilton CA, et al. Spider phylogenomics: untangling the Spider Tree of Life. PeerJ. 2016;4, e1719.View ArticlePubMedPubMed CentralGoogle Scholar
- Dimitrov D, Benavides LR, Arnedo MA, Giribet G, Griswold CE, Scharff N, et al. Rounding up the usual suspects: a standard target-gene approach for resolving the interfamilial phylogenetic relationships of ecribellate orb-weaving spiders with a new family-rank classification (Araneae, Araneoidea). Cladistics. 2016;online early.Google Scholar
- Starrett J, Garb JE, Kuelbs A, Azubuike UO, Hayashi CY. Early events in the evolution of spider silk genes. PLoS One. 2012;7, e38084.View ArticlePubMedPubMed CentralGoogle Scholar
- Ayoub NA, Hayashi CY. Multiple recombining loci encode MaSp1, the primary constituent of dragline silk, in widow spiders (Latrodectus: Theridiidae). Mol Biol Evol. 2008;25:277–86.View ArticlePubMedGoogle Scholar
- Beckwitt R, Arcidiacono S, Stote R. Evolution of repetitive proteins: spider silks from Nephila clavipes (Tetragnathidae) and Araneus bicentenarius (Araneidae). Insect Biochem Mol Biol. 1998;28:121–30.View ArticlePubMedGoogle Scholar
- Rising A, Johansson J, Larson G, Bongcam-Rudloff E, Engström W, Hjälm G. Major ampullate spidroins from Euprosthenops australis: multiplicity at protein, mRNA and gene levels. Insect Mol Biol. 2007;16:551–61.PubMedGoogle Scholar
- Ittah S, Cohen S, Garty S, Cohn D, Gat U. An essential role for the C-terminal domain of a dragline spider silk protein in directing fiber formation. Biomacromolecules. 2006;7:1790–5.View ArticlePubMedGoogle Scholar
- Chaw RC, Zhao Y, Wei J, Ayoub NA, Allen R, Atrushi K, et al. Intragenic homogenization and multiple copies of prey-wrapping silk genes in Argiope garden spiders. BMC Evol Biol. 2014;14:31.View ArticlePubMedPubMed CentralGoogle Scholar
- Hayashi CY, Lewis RV. Molecular architecture and evolution of a modular spider silk protein gene. Science. 2000;287:1477–9.View ArticlePubMedGoogle Scholar
- Blamires SJ, Chao I-C, Tso I-M. Prey type, vibrations and handling interactively influence spider silk expression. J Exp Biol. 2010;213:3906–10.View ArticlePubMedGoogle Scholar
- Tso I-M, Wu H-C, Hwang I-R. Giant wood spider Nephila pilipes alters silk protein in response to prey variation. J Exp Biol. 2005;208:1053–61.View ArticlePubMedGoogle Scholar
- Lane AK, Hayashi CY, Whitworth GB, Ayoub NA. Complex gene expression in the dragline silk producing glands of the Western black widow (Latrodectus hesperus). BMC Genomics. 2013;14:1.View ArticleGoogle Scholar
- Beuken E, Vink C, Bruggeman CA. Enhanced efficiency of cloning FACS ® -sorted mammalian cells. BioTechniques. 1998;24:750–2.Google Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.View ArticlePubMedGoogle Scholar
- BLAST Query | i5k - App [Internet]. [cited 2014 Jun 3]. Available from: https://i5k.nal.usda.gov/webapp/blast/.
- i5K Consortium. The i5K Initiative: advancing arthropod genomics for knowledge, human health, agriculture, and the environment. J Hered. 2013;104:595–600.View ArticlePubMed CentralGoogle Scholar
- Peden J. codonW [Internet]. [cited 2015 Jul 7]. Available from: http://codonw.sourceforge.net/.
- Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982;157:105–32.View ArticlePubMedGoogle Scholar
- Kyte-Doolittle entry form [Internet]. [cited 2015 Jul 6]. Available from: http://gcat.davidson.edu/rakarnik/kyte-doolittle.htm.
- Newman AM, Cooper JB. XSTREAM: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences. BMC Bioinformatics. 2007;8:382.View ArticlePubMedPubMed CentralGoogle Scholar
- Clarke TH, Garb JE, Hayashi CY, Arensburger P, Ayoub NA. Spider transcriptomes identify ancient large-scale gene duplication event potentially important in silk gland evolution. Genome Biol Evol. 2015;7:1856–70.View ArticlePubMedPubMed CentralGoogle Scholar
- Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Galtier N, Gouy M, Gautier C. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput Appl Biosci CABIOS. 1996;12:543–8.PubMedGoogle Scholar
- Gouy M, Guindon S, Gascuel O. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010;27:221–4.View ArticlePubMedGoogle Scholar
- Swofford DL. PAUP* version 4.0 b10. Sunderland: Sinauer; 2002.Google Scholar
- Huelsenbeck JP, Ronquist F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001;17:754–5.View ArticlePubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–4.View ArticlePubMedGoogle Scholar
- Posada D. jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008;25:1253–6.View ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–5.View ArticlePubMedGoogle Scholar
- Durand D, Halldórsson BV, Vernot B. A hybrid micro-macroevolutionary approach to gene tree reconstruction. J Comput Biol. 2006;13:320–35.View ArticlePubMedGoogle Scholar
- Kuntner M, Arnedo MA, Trontelj P, Lokovšek T, Agnarsson I. A molecular phylogeny of nephilid spiders: evolutionary history of a model lineage. Mol Phylogenet Evol. 2013;69:961–79.View ArticlePubMedGoogle Scholar
- Cheng R-C, Kuntner M. Phylogeny suggests nondirectional and isometric evolution of sexual size dimorphism in argiopine spiders: nondirectional and isometric evolution of SSD. Evolution. 2014;68:2861–72.View ArticlePubMedGoogle Scholar
- Vollrath F, Madsen B, Shao Z. The effect of spinning conditions on the mechanics of a spider’s dragline silk. Proc R Soc Lond B Biol Sci. 2001;268:2339–46.View ArticleGoogle Scholar
- Paradis E, Claude J, Strimmer K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics. 2004;20:289–90.View ArticlePubMedGoogle Scholar
- Rising A, Hjälm G, Engström W, Johansson J. N-terminal nonrepetitive domain common to dragline, flagelliform, and cylindriform spider silk proteins. Biomacromolecules. 2006;7:3120–4.View ArticlePubMedGoogle Scholar
- Work RW. Dimensions, birefrigences, and force-elongation behavior of major and minor ampullate silk fibers from orb-web-spinning spiders - the effects of wetting on these properties. Text Res J. 1977;47:650–62.View ArticleGoogle Scholar