Skip to main content

Evolution of the avian β-defensin and cathelicidin genes



β-defensins and cathelicidins are two families of cationic antimicrobial peptides (AMPs) with a broad range of antimicrobial activities that are key components of the innate immune system. Due to their important roles in host defense against rapidly evolving pathogens, the two gene families provide an ideal system for studying adaptive gene evolution. In this study we performed phylogenetic and selection analyses on β-defensins and cathelicidins from 53 avian species representing 32 orders to examine the evolutionary dynamics of these peptides in birds.

Results and conclusions

Avian β-defensins are found in a gene cluster consisting of 13 subfamiles. Nine of these are conserved as one to one orthologs in all birds, while the others (AvBD1, AvBD3, AvBD7 and AvBD14) are more subject to gene duplication or pseudogenisation events in specific avian lineages. Avian cathelicidins are found in a gene cluster consisting of three subfamilies with species-specific duplications and gene loss. Evidence suggested that the propiece and mature peptide domains of avian cathelicidins are possibly co-evolving in such a way that the cationicity of the mature peptide is partially neutralised by the negative charge of the propiece prior to peptide secretion (further evidence obtained by repeating the analyses on primate cathelicidins). Negative selection (overall mean dN < dS) was detected in most of the gene domains examined, conserving certain amino acid residues that may be functionally crucial for the avian β-defensins and cathelicidins, while episodic positive selection was also involved in driving the diversification of specific codon sites of certain AMPs in avian evolutionary history. These findings have greatly improved our understanding of the molecular evolution of avian AMPs and will be useful to understand their role in the avian innate immune response. Additionally, the large dataset of β-defensin and cathelicidin peptides may also provide a valuable resource for translational research and development of novel antimicrobial agents in the future.


Defensins and cathelicidins are two families of cationic small peptides that have broad-spectrum antimicrobial activities against a wide range of bacterial, fungal or viral pathogens. These peptides are produced in a large variety of invertebrate and vertebrate organisms, representing an ancient form of host defense against microbes. In addition to their antimicrobial function, defensins and cathelicidins have also been found to exhibit diverse immunomodulatory activities, rendering them important components of both innate and adaptive immune systems [13]. With a key role in host defense against rapidly evolving pathogens, defensin and cathelicidin gene families provide an ideal system for studying adaptive molecular evolution [4]. Previous studies in mammalian lineages have revealed positive selection driving rapid divergence of these host defense peptides [4, 5]. Recent whole-genome sequence analysis of 48 bird species [6] has enabled us to perform a comprehensive comparative analysis on the avian lineages of defensins and cathelicidins, which will not only greatly improve our understanding of the evolutionary diversification of these ancient peptides over the past 100 million years through the avian radiation, but also provide a valuable resource for developing novel antibiotics to treat microbial infections in birds and other vertebrates [7, 8].

Among the three defensin subfamilies (α, β and θ) identified in vertebrates, only β-defensins have been found in birds [9, 10]. Avian β-defensins (AvBDs) have been previously described in the chicken (Gallus gallus), zebra finch (Taeniopygia guttata), mallard (Anas platyrhynchos) and crested ibis (Nipponia nippon) with a cluster of 14, 22, 18 and 14 closely linked β-defensin genes found in each genome, respectively [913]. AvBD genes usually consist of 3–4 exons [10]. The first exon contains the 5′-untranslated region and the others contain the open reading frame encoding three peptide domains, including a hydrophobic, leucine-rich signal peptide, a short propiece (absent in AvBD12) and the mature peptide. The mature β-defensin forms a characteristic β sheet-rich fold, which contains three disulfide bridges (Cys1-Cys5, Cys2-Cys4 and Cys3-Cys6) within a conserved six-cysteine framework. AvBDs have been shown to be expressed in a number of tissues with varying levels of expression [14, 15] and play a role in the innate host defense against viruses, Gram-positive and Gram-negative bacteria, and parasites in the digestive system [1621] and reproductive system [2226]. Evidence has also indicated that they may have similar regulatory functions as mammalian β-defensins in various immune processes, such as chemotaxis, wound-healing, and inducing or suppressing inflammatory responses [reviewed in 1]. Within-species genetic diversity and trans-species polymorphisms of AvBDs have been observed in passerine bird species [27, 28].

Cathelicidins are characterised by having a conserved cathelin domain and a highly variable mature peptide domain. Four avian cathelicidin genes have been identified in the chicken, designated CATHL1, CATHL2, CATHL3 and CATHB1 [2, 2931]. These genes share similar structures with mammalian cathelicidins, each comprising of four exons, encoding a prepropiece consisting of a signal peptide, the cathelin-like domain (propiece) and the mature peptide. One unusual feature found in the propiece of chicken CATHB1, is the presence of nine octamer repeats (PGLDGSXS) N-terminal to the cathelin domain [30], which is not seen in other cathelicidin genes. Mature avian cathelicidins are activated before secretion with the propiece cleaved off by serine proteases [32] and form an alpha-helix predominant structure with a kink induced by a glycine or proline close to the centre [3335]. All four chicken cathelicidins show high levels of expression in immune organs and gastrointestinal, respiratory and urogenital tracts, with CATHL1, 2 and 3 most highly expressed in the bone marrow and lung, and CATHB1 in the bursa of Fabricius [36]. In addition to broad antimicrobial activity against bacteria and fungi, avian cathelicidins also play a range of immunoregulatory roles, such as blocking lipopolysaccharide-induced cytokine expression [37] or inducing specific chemokine production [31].

In this paper, we studied the evolution of β-defensins and cathelicidins in 53 avian species (Table 1) and discussed the following issues: conservation and lineage-specific duplication/deletion of genes, conserved genomic organisation of β-defensin and cathelicidin gene clusters, coevolution between pro- and mature peptides in cathelicidins, and amino acid sites and evolutionary branches under selection.

Table 1 List of 53 examined bird species

Results and discussion

Avian β-defensins and cathelicidins

A total of 758 genes, including 714 β-defensins and 44 cathelicidins, were annotated in 53 avian genomes (scaffold number and gene coordinates provided in Additional file 1; nucleotide sequences in Additional file 2). The selected species represent 32 different avian orders spanning 52 genera (Table 1), last sharing a common ancestor around 114 million years ago (Fig. 1) [38].

Fig. 1

Phylogenetic tree showing evolutionary relationships between 53 examined bird species. The tree was generated via webserver [38]. Branch length indicates evolutionary time; numbers besides the nodes are estimated divergence time (million years before present). The number of genes detected to have undergone episodic diversifying selection over total number of annotated genes in each species is shown to the right of the species name

Annotated genes were designated based on their orthologous relationships with chicken β-defensins (AvBD1-AvBD14) and cathelicidins (CATHL1, CATHL2, CATHL3 and CATHB1). Phylogenetic analysis revealed 13 distinct AvBD groups, namely AvBD1, AvBD2, AvBD3, AvBD4, AvBD5, AvBD6 and AvBD7, AvBD8, AvBD9, AvBD10, AvBD11, AvBD12, AvBD13, and AvBD14 (Fig. 2a), while cathelicidin genes cluster into three clades, CATHL1 and CATHL3, CATHL2, and CATHB1 (Fig. 2b).

Fig. 2

Phylogenetic relationships of avian (a) β-defensin and (b) cathelicidin subfamilies. Bootstrap values higher than 40 % are shown next to the branches

Characteristics of AvBD gene subfamilies are summarised in Table 2 (alignments shown in Additional file 3). Ten genes (AvBD2, 4, 5, 7, 8, 9, 10, 11, 12 and 13) are conserved as one-to-one orthologues in most surveyed species, suggesting a high level of conservation of these genes for over 100 million years (Fig. 3). AvBD1 and AvBD3 are subject to lineage-specific expansion, with up to three AvBD1 paralogues found in the killdeer (Charadrius vociferus), saker falcon (Falco cherrug), medium ground finch (Geospiza fortis) and zebra finch; extensive AvBD3 duplications occurred in Passeriformes with up to 14 paralogues found in the white-throated sparrow (Zonotrichia albicollis). AvBD6 is a duplication of AvBD7, which has arisen within Galliformes after the divergence of family Odontophoridae and before Phasianidae, as it is present in chicken and turkey but absent in the northern bobwhite (Colinus virginianus). AvBD7 has degenerated into a pseudogene in the sunbittern (Eurpyga helias) and is missing in all three psittacines. AvBD14 was found in 23 species and has been degraded into a pseudogene in the orders Falconiformes, Passeriformes and Psittaciformes, and species Columbia livia, Gavia stellata, and Calypte anna.

Table 2 Characteristics of avian β-defensin subfamilies
Fig. 3

Schematic diagram of avian β-defensin gene clusters

The spacing between cysteines within the defensin motif is conserved within subfamilies with the overall consensus being C (4-7) C (3-6) C (7-10) C (5-6) CC, numbers representing the number of residues between cysteines (Table 2). The majority of subfamilies have 9–11 residues preceding the first cysteine of the defensin motif. Exceptions include the group of passeriforme AvBD3s immediately preceding AvBD5 which have 2 residues prior to the first cysteine in addition to having only 2 exons. AvBD1 and AVBD3 have the same spacing of cysteines but differ in the number of residues prior to the first cysteine.

Cathelicidins were only found in 21 surveyed species due to low assembly quality of the genomic regions. Similar to the 10 relatively conserved AvBDs, CATHL2, CATHL3 and CATHB1 have been conserved across a variety of avian orders (Fig. 4). Evidence suggested that CATHL3 has been reduced to a pseudogene in Falconiformes and lost in Passeriformes, yet duplicated to give rise to CATHL1 in Galliformes. Annotation results also suggested that CATHL2 and CATHL3 may have been lost in Sphenisciformes and Ciconiiformes, represented by the emperor penguin (Aptenodytes forsteri) and crested ibis (Nipponia nippon), respectively (Fig. 4), though this could be an artefact caused by high degree of assembly gaps in the genomic region. The octamer repeats feature found in chicken CATHB1 appeared to be unique to Galliformes (or Phasianidae), as it has only been seen in the chicken and turkey.

Fig. 4

Schematic diagram of avian cathelicidin gene clusters

Conserved synteny

Avian β-defensins cluster in a syntenic region flanked by cathepsin B (CTSB) and translocation associated membrane protein 2 (TRAM2) genes (Fig. 3). The gene order is highly conserved as CTSB, AvBD13, 12, 11, 10, 9, 8, 7, 2, 1, 3, 5, 4, 14, and TRAM2, with duplicated AvBD1 and 3 interspersed in the region between AvBD2 and 5. The strand orientations of AvBDs are also strictly maintained, with the exception of several inverted AvBD1 and 3 gene duplicates in Passeriformes. However, it should be noted that most of the genome assemblies surveyed were of draft quality with defensins found on up to 11 different scaffolds/contigs, thus there may be unrevealed lineage-specific gene rearrangements.

Similar to AvBDs, avian cathelicidins also form a conserved gene cluster, which is flanked by kelch-like family member 18 (KLHL18) and transforming growth factor beta regulator 4 (TBRG4) genes (Fig. 4). The majority of species share a conserved gene order of KLHL18, CATHL2, CATHL3, CATHB1 and TBRG4. Exceptions include an inversion of the region containing CATHL3 and CATHL2 in Galliformes, and CATHL3 and CATHB1 being arranged in the reverse order in the common cuckoo (Cuculus canorus).

Such clustering of homologous genes in a tightly linked fashion is rather common for immune genes. Other well-known examples include the major histocompatibility complex (MHC), immunoglobulins (Ig), Fc receptors (FcR) and killer-cell Ig-like receptors (KIR). These immune gene families are believed to be regularly refreshed via in cis duplication, resulting in related genes lying next to each other in linked array in the genome [39]. It has been suggested that immune genes clustering together may be biologically significant in that it may facilitate the coordinated expression of functionally related loci, and therefore has been selectively maintained [39, 40].

Net charge of mature peptides of β-defensins and cathelicidins

Disruption of microbial membranes is a major mechanism underlying antimicrobial activity of defensins and cathelicidins [41] and it has been demonstrated that the net charge of the mature peptide directly influences its antimicrobial potency [42]. The net charge of the mature peptide ranges from −2.9 to +10.0 in avian β-defensins and +4.0 to +12.0 in cathelicidins (Fig. 5). On average, cathelicidins have higher charges than the defensins. Six defensin subfamilies (AvBD5, 8, 9, 10, 11 and 12) have an average net charge lower than +4.0, with AvBD12 showing the lowest average charge (+0.1). AvBD3 and all cathelicidin subfamilies have an average charge higher than +6.0. The low net charge of certain AvBDs indicates that they may have lower activities in terms of direct killing microbes. By contrast, the highly cationic peptides, such as AvBD3 in the emperor penguin, Adélie penguin (Pygoscelis adeliae) and yellow-throated sandgrouse (Pterocles gutturalis) with a net charge of +10.0, and CATHB1 in the kea (Nestor notabilis) with a net charge of +12.0, may provide valuable templates for developing new antimicrobial agents. The mature peptide sequences of all avian β-defensins and cathelicidins are provided in Additional file 4 and net charges shown in Additional file 5.

Fig. 5

Comparison of net charges of putative avian β-defensin and cathelicidin mature peptides

Avian cathelicidins and the “charge balance hypothesis”

In mammalian α-defensins, Michaelson et al. [43] proposed that the anionic propiece plays a role in preventing autocytotoxicity by neutralising the cationicity of the mature peptide. The authors demonstrated a linear relationship between the net negative charge of the propiece and the positive charge of the mature peptide of seven α-defensins (two human and five rabbit genes). Hughes and Yeager [4, 44] provided further supportive evidence of this relationship (r = −0.742; p < 0.001) using 28 α-defensins from five mammalian species (mouse, rat, guinea pig, rabbit and human). This “charge balance hypothesis” is unlikely to apply to the β-defensins due to the short length of the propiece (0–7 amino acids in AvBDs). However, we explored the hypothesis in avian cathelicidins and revealed a similar, though weaker (r = −0.38; p = 0.03) association between the electrostatic charges of the propiece and mature peptides (Fig. 6a). All annotated avian cathelicidins with an intact coding sequence were used in the analysis (n = 24). While the mature form of all cathelicidins is highly cationic (8.51 ± 0.61), the propiece has an anionic character (−5.24 ± 1.13). The inactive form of CATHL2 before secretion has the lowest mean net charge (0.81 ± 0.62) among avian cathelicidins (overall mean = 3.27 ± 1.06), suggesting a better neutralising effect of the propiece in this subfamily. It should be noted that changes in cytoplasmic pH also affect the electrostatic charges of peptides, with higher pH resulting in lower charges (e.g. at pH 7.5, overall mean charge of presecretory avian cathelicidins = 2.79 ± 1.05). Similar level of correlation (r = −0.38; p = 0.02) was also found in 28 primate cathelicidin genes (Fig. 6a; accession numbers of primate genes listed in Additional file 6).

Fig. 6

Relationship between net charges of the propiece and mature peptide in avian and primate cathelicidins. a Separate linear regression lines were drawn for avian (y = −0.70x + 0.73; r = −0.38; p = 0.03) and primate CAMP (y = −0.32x − 0.18; r = −0.38; p = 0.02) genes. b Changes in the net charge of the propiece and mature peptide, as inferred to have occurred during evolution based on ancestral sequence reconstruction. Separate linear regression lines were drawn for avian (y = −0.42x + 0.17; r = −0.35; p = 0.01) and primate (y = −0.30x − 0.01; r = −0.36; p = 0.01) data points

Further analysis was performed on reconstructed ancestral sequences to infer changes that may have occurred in avian and primate cathelicidins over evolutionary time (Fig. 6b, Additional file 7) [44]. A significant negative correlation was observed between electrostatic property changes in the propiece and mature peptide in both groups (aves: r = −0.35, p = 0.01; primate: r = −0.36, p = 0.01). When substitutions increasing peptide charge occurred in the mature cathelicidin, charge in the propiece tended to either decrease or remain unchanged, whereas when the mature peptide became less cationic, the propiece tended to become less anionic. This is highly similar to what Hughes and Yeager observed in mammalian defensins [44].

Moreover, evidence of intra-molecular amino acid residue co-evolution was detected between two pairs of sites with electrostatic properties in the propiece and mature peptide domains of CATHB1 (highlighted in Additional file 3). Residue 122L/G/E/K and 226G/K/R, and 160R/Q/P and 222E/D/G/N showed 96.9 and 80.2 % probability of having been co-evolving, respectively (residue positions are based on chicken CATHB1).

These observations indicate that, similar to mammalian α-defensins, the propiece and mature peptide of avian and primate cathelicidins may have co-evolved in such a way that amino acid substitutions in both regions are selected and accumulated to balance the charge. Further experimental evidence will be needed to validate this hypothesis and elucidate the role of cathelicidin propiece in preventing autocytotoxicity.

Amino acid sites under selection

Negative selection was detected in a large proportion of amino acid sites in the examined avian β-defensin (11.4–40.9 %) and cathelicidin (9.7–24.7 %) genes (Fig. 7a, Additional file 3). In AvBD1, 2, 3, 8, 9, 11, 12 and 13, the overall mean rate of nonsynonymous nucleotide substitutions (dN) was significantly (at 0.05 nominal level) lower than that of synonymous substitutions (dS) in both mature peptide and the signal domains, suggesting an overall effect of negative selection on these genes (Fig. 7b, Table 3). In the other five AvBDs significant negative selection was detected only in the mature peptide, whereas the opposite pattern was observed in the avian cathelicidin genes with only the signal and propiece domain showing significantly higher dS than dN.

Fig. 7

Natural selection in avian β-defensins, cathelicidins, and three primate genes (DEFB1, DEFB4 and CAMP). a Proportion of codon sites under positive and negative selection in each gene; b Comparison between mean rates of synonymous and nonsynonymous substitutions (dN − dS) in the signal and propiece region and mature peptide domain

Table 3 Mean rates of synonymous (dS ± S.E.) and nonsynonymous (dN ± S.E.) nucleotide substitutions and test for overall selection in β defensin and cathelicidin genes

Despite the strong background of negative selection, evidence indicating specific amino acid sites subject to episodic diversifying selection was found in most studied avian genes (Fig. 8), though the effect of such selection was very weak in AvBD9, AvBD11 and AvBD13 and the three cathelicidins, with less than 4 % of total sites inferred to be positively selected. To reduce chances of false positive detections, only codon sites that were detected by multiple selection test methods were considered significant (see Methods section) [45, 46]. In the examined avian AMPs, most positively selected sites were found inside the mature peptide domain, except for AvBD5, AvBD13 and AvBD14 and all cathelicidins, in which more were found in the signal and propiece regions (Figs. 7a and 8). Within mature defensins, diversifying selection appears to mainly (68.3 % cases) affect those residues that are close to (within two residues) the conserved cysteines. For comparison, same analyses were performed on two primate β-defensin subfamilies, DEFB1 (n = 24) and DEFB4 (n = 10), and the primate cathelicidin CAMP (n = 28) (gene accession numbers provided in Additional file 6). Consistent with previous reports, our results suggested that positive selection has involved in driving the evolution of primate DEFB4 (coding for β-defensin 2) [47], whereas DEFB1 is highly conserved with no evidence of diversifying selection in primate lineages [48]. Taken together both positive and negative selection, AvBDs appear to have evolved under higher selective pressures and restraint, while the two examined mammalian β-defensins have evolved more neutrally in primates (Fig. 7a). Contrary to what was observed in avian cathelicidins, in primate CAMP a majority (58.3 %) of positively selected residues are located inside the mature peptide, resulting in a significantly higher overall dN than dS in the gene domain (Fig. 7).

Fig. 8

Positively selected sites in avian β-defensins and cathelicidins. Site-wise rates of synonymous and nonsynonymous substitutions (dN − dS) calculated using FEL are shown (cutoff at 5 for large |dN − dS|). Chicken genes are shown as reference sequence; dashes indicate gaps introduced during alignment with orthologues in other species; mature peptide domains are indicated by grey graph background. Sites under positive selection are indicated by plus signs

Due to the important role of electrostatic charge on antimicrobial potency of the mature peptide, positively charged amino acid residues are expected to have evolved under positive selection. However, in the examined AvBDs and cathelicidins, a number of such sites showed strong negative selection or neutral results. One possible explanation is that these cationic residues may have been selectively accumulated prior to the divergence of the studied bird lineages (~114 million years ago), and then have been conserved by negative selection through the long evolutionary history due to their significant role on peptide function. Further studies to include more distant taxa (such as reptiles) will help elucidate the evolutionary dynamics of the peptides.

Lineages subject to episodic diversifying selection

Evolutionary branches that were indicated to have experienced diversifying selection episodes at each gene are highlighted in Additional file 8. On average, 20.9 and 26.6 % of nodes over the course of evolution of AvBDs and cathelicidins, respectively, were detected with episodes of positive selection, with the highest percentage observed in AvBD7 (41.4 %) and lowest in AvBD13 (7.1 %). The later stages of evolution within avian lineages seem to have involved relatively stronger diversifying selection as compared to the more ancient branches, which is consistent with the observation that the examined gene families are generally well conserved across all avian orders and families. This result contradicts to what was previously detected in mammalian, particularly primate β-defensins, which involved more positive selection episodes in more ancient branches due to duplication and diversification of β-defensins in the early stages of mammalian evolution [5].

Several evolutionary nodes were detected to have undergone diversifying selection at multiple genes (Additional file 8). For example, AvBD5, 7, 10, 12 and 14 of Galliformes were inferred to be under diversifying selection between 84.5 and 37.9 million years before present (evolutionary time estimated based on branch length in Fig. 1), prior to the divergence between family Phasianidae (chicken and turkey) and Odontophoridae (northern bobwhite Colinus virginianus). Within the order Psittaciformes, AvBD3, 4 and 8 were estimated to have been subject to episodic positive selection between 79.8 and 55.3 million years ago before the divergence of the Puerto Rican parrot (Amazona vittata), budgerigar (Melopsittacus undulatus) and kea (Nestor notabilis), all belonging to the Psittacidae family. Similarly, positive selection appeared to have been involved in driving the diversification of AvBD1, 3, 4, 5, 8, 10 and 12 during the early radiations of Passeriformes. For each species, the number of genes indicated to have been affected by episodic diversifying selection is shown in Fig. 1.


In this study, we investigated the evolution of β-defensins and cathelicidins in 53 bird species. Both gene families form a generally conserved gene cluster in avian genomes with certain genes being more prone to duplication (AvBD1, AvBD3 and AvBD7) or pseudogenisation (AvBD14) events. Intense negative selection was detected in a majority of examined gene domains, likely accounting for the conservation of certain amino acid residues that are essential for the functioning of β-defensins and cathelicidins in birds. Evidence indicated that episodic positive selection also played a role in driving the diversification of specific residues of certain antimicrobial peptides in avian evolutionary history, contributing to high variability of gene sequences and electrostatic property of the peptides. Our results also revealed that selection may have acted on cathelicidins to maintain a balanced charge between the anionic propiece and cationic mature peptide over evolutionary time. This work not only has greatly improved our understanding of the molecular evolution of these host defense peptides, but also provides a valuable resource for potential translational research and development of novel antimicrobial agents.


Database search and gene nomenclature

Fifty-three bird genomes were searched for β-defensin and cathelicidin genes (GenBank Assembly IDs provided in Table 1). For each genome, four steps were taken to identify genes of interest: 1) An initial search was performed on a predicted protein/CDS database with BLAST programs using chicken genes as query sequences. Hits with E-value <0.1 were extracted from the database, aligned to chicken sequences with ClustalW [49], and manually examined to exclude false positives. 2) Then for each gene, a profile hidden Markov model (HMM) [50] was built from a peptide sequence alignment that includes all orthologues found in the previous step. The profile HMMs were searched against the predicted protein databases with HMMER3.1 programs [51] on a GALAXY platform. Hits with both E-values (full sequence and best 1 domain) <1 were extracted, aligned with previously found sequences, and manually checked to confirm real homologues. 3) Sequences identified in the previous steps were then used to BLAST search the whole genome to find any genes, gene fragments, or pseudogenes that are not included in the protein/CDS database. 4) All genomic scaffolds and contigs containing β-defensin or cathelicidin genes were extracted to study the genomic organisation of these genes. In addition, sequences containing CTSB, TRAM2, KLHL18 or TBRG4, the flanking genes of the β-defensin or cathelicidin gene clusters, were also extracted. Scaffolds and contigs were manually curated using Artemis [52].

Annotated sequences were named by tagging the gene name with a five-letter abbreviation as a suffix that distinguishes the species. For example, AvBD1_ACACH refers to gene AvBD1 of the rifleman (Aca nthisitta ch loris). Duplications of AvBD1 and AvBD3 were numbered 1.n and 3.n from AvBD2. Some AvBD1 and AvBD3 duplicates were found on isolated scaffolds and numbering of these defensins may not represent the actual position in the intact cluster. Previously used identification references for zebra finch duplicates [9] are included in the identification.

Mature peptide prediction and net charge estimation

Sequence features, such as signal peptide, propiece and mature peptide, within annotated genes were speculated based on functional domains in chicken defensins [10] and cathelicidins [30, 31, 33, 34]. The net charge of mature peptide was estimated as \( {\displaystyle \sum_i}{N}_i\frac{1}{1+{10}^{pH-pK{a}_i}}-{\displaystyle \sum_j}{N}_j\frac{1}{1+{10}^{pK{a}_j-pH}} \) with N i and pKa i being the number and pKa values of histidine (H), lysine (K) and arginine (R) residues and the N-terminus, and N j and pKa j the number and pKa of aspartic acid (D), glutamic acid (E), cysteine (C) and tyrosine (Y) residues and the C-terminus [53]. Lehninger’s set of pKa values were used [54] and intramolecular disulfide bond formation was taken into account in calculation.

Evolutionary analyses

Overall phylogenetic analyses between all avian β-defensins and cathelicidins (Fig. 2) were conducted in MEGA5 with a Maximum Likelihood method [55, 56] using four discrete categories for the Gamma distribution to model evolutionary rate differences among sites and 100 bootstrap replicates to infer the level of confidence on the phylogeny (support values lower than 40 % are not shown in the consensus tree) [57].

A range of evolutionary analyses were performed on each β-defensin and cathelicidin gene via the Datamonkey webserver [58], including: 1) Negative selection sites were detected using Fixed Effect Likelihood (FEL) [59] and Fast Unconstrained Bayesian Approximation for Inferring Selection (FUBAR) [60]. 2) Individual sites under positive selection were detected using three test methods, including Mixed Effects Model of Evolution (MEME) [61], FEL, and FUBAR. Codon sites found to be significant for positive or negative selection by more than two methods (MEME p < 0.05, FEL p < 0.1, and FUBAR posterior probability >0.9) were included in the analyses [45, 46]. 3) Individual branches with episodic diversifying selection were inferred by combining results from two analyses–MEME (emprical bayes factor >20) and branch-site REL (p < 0.05) [62]. 4) Intramolecular co-evolution of amino acid sites in cathelicidins were detected using the Spidermonkey/Bayesian Graphical Model [63]. Only sites that are involved in electrostatic properties and have more than three branches with nonsynonymous substitutions were included in the analysis for covariation. Assessment of overall mean rates of nonsynonymous (dN) and synonymous (dS) nucleotide substitutions in the signal and propiece region and the mature peptide domain, and the significance test of overall selection were calculated in MEGA5 [55] using the Kumar model [64] and 1000 bootstrap replicates to estimate standard errors.

Ancestral cathelicidin sequences were inferred using the Maximum Likelihood method [64] under the Whelan And Goldman model [65] in MEGA5. Rates among sites were treated as a Gamma distribution using five Gamma categories.


No human, human data, or animal was used in this study.

Availability of supporting data

GenBank assembly IDs of all 53 examined avian genomes are provided in Table 1. Scaffold and coordinates information of annotated genes are detailed in Additional file 1. Nucleotide sequences of all annotated genes are available in Additional file 2. Full-length amino acid sequence alignments and putative mature peptide sequence of each gene are provided in Additional files 3 and 4, respectively. GenBank accession numbers of primate genes used in analyses are listed in Additional file 6.



Antimicrobial peptide


Avian β-defensin; Abbreviations for species names are listed in Table 1


  1. 1.

    Cuperus T, Coorens M, van Dijk A, Haagsman HP. Avian host defense peptides. Dev Comp Immunol. 2013;41:352–69.

    CAS  Article  PubMed  Google Scholar 

  2. 2.

    van Dijk A, Molhoek EM, Bikker FJ, Yu PL, Veldhuizen EJA, Haagsman HP. Avian cathelicidins: paradigms for the development of anti-infectives. Vet Microbiol. 2011;153:27–36.

    Article  PubMed  Google Scholar 

  3. 3.

    Semple F, Dorin JR. β-defensins: multifunctional modulators of infection, inflammation and more? Journal of Innate Immunity. 2012;4:337–48.

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Hughes AL. Evolutionary diversification of the mammalian defensins. CMLS, Cell Mol Life Sci. 1999;56:94–103.

    CAS  Article  PubMed  Google Scholar 

  5. 5.

    Semple CAM, Taylor K, Eastwood H, Barran PE, Dorin JR. b-defensin evolution: selection complexity and clues for residues of functional importance. Biochem Soc Trans. 2006;34:257–62.

    CAS  Article  PubMed  Google Scholar 

  6. 6.

    Jarvis ED, Mirarab S, Aberer AJ, Li B, Houde P, Li C, et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science. 2014;346:1320–31.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  7. 7.

    Higgs R, Lynn D, Cahalane S, Alaña I, Hewage C, James T, et al. Modification of chicken avian β-defensin-8 at positively selected amino acid sites enhances specific antimicrobial activity. Immunogenetics. 2007;59:573–80.

    CAS  Article  PubMed  Google Scholar 

  8. 8.

    van der Does AM, Bergman P, Agerberth B, Lindbom L. Induction of the human cathelicidin LL-37 as a novel treatment against bacterial infections. J Leukoc Biol. 2012;92:735–42.

    Article  PubMed  Google Scholar 

  9. 9.

    Hellgren O, Ekblom R. Evolution of a cluster of innate immune genes (beta-defensins) along the ancestral lines of chicken and zebra finch. Immunome Research. 2010;6:1745–7580.

    Article  Google Scholar 

  10. 10.

    Xiao Y, Hughes A, Ando J, Matsuda Y, Cheng J-F, Skinner-Noble D, et al. A genome-wide screen identifies a single beta-defensin gene cluster in the chicken: implications for the origin and evolution of mammalian defensins. BMC Genomics. 2004;5:56.

    PubMed Central  Article  PubMed  Google Scholar 

  11. 11.

    Huang Y, Li Y, Burt DW, Chen H, Zhang Y, Qian W, et al. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nat Genet. 2013;45:776–83.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  12. 12.

    Lynn DJ, Higgs R, Lloyd AT, O’Farrelly C, Hervé-Grépinet V, Nys Y, et al. Avian beta-defensin nomenclature: a community proposed update. Immunol Lett. 2007;110:86–9.

    CAS  Article  PubMed  Google Scholar 

  13. 13.

    Lan H, Chen H, Chen L-C, Wang B-B, Sun L, Ma M-Y, et al. The first report of a pelecaniformes defensin cluster: characterization of b-defensin genes in the crested ibis based on BAC libraries. Scientific Reports. 2014;4.

  14. 14.

    van Dijk A, Veldhuizen EJA, Haagsman HP. Avian defensins. Vet Immunol Immunopathol. 2008;124:1–18.

    Article  PubMed  Google Scholar 

  15. 15.

    Zhang G, Sunkara L. Avian antimicrobial host defense peptides: from biology to therapeutic applications. Pharmaceuticals. 2014;7:220–47.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  16. 16.

    Akbari MR, Haghighi HR, Chambers JR, Brisbin J, Read LR, Sharif S. Expression of antimicrobial peptides in cecal tonsils of chickens treated with probiotics and infected with salmonella enterica serovar typhimurium. Clin Vaccine Immunol. 2008;15:1689–93.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  17. 17.

    Crhanova M, Hradecka H, Faldynova M, Matulova M, Havlickova H, Sisak F, et al. Immune response of chicken gut to natural colonization by gut microflora and to salmonella enterica serovar enteritidis infection. Infect Immun. 2011;79:2755–63.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  18. 18.

    Derache C, Esnault E, Bonsergent C, Le Vern Y, Quéré P, Lalmanach A-C. Differential modulation of β-defensin gene expression by Salmonella Enteritidis in intestinal epithelial cells from resistant and susceptible chicken inbred lines. Dev Comp Immunol. 2009;33:959–66.

    CAS  Article  PubMed  Google Scholar 

  19. 19.

    Hong YH, Song W, Lee SH, Lillehoj HS. Differential gene expression profiles of β-defensins in the crop, intestine, and spleen using a necrotic enteritis model in 2 commercial broiler chicken lines. Poult Sci. 2012;91:1081–8.

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Milona P, Townes CL, Bevan RM, Hall J. The chicken host peptides, gallinacins 4, 7, and 9 have antimicrobial activity against Salmonella serovars. Biochem Biophys Res Commun. 2007;356:169–74.

    CAS  Article  PubMed  Google Scholar 

  21. 21.

    van Dijk A, Veldhuizen EJA, Kalkhove SIC, Tjeerdsma-van Bokhoven JLM, Romijn RA, Haagsman HP. The β-defensin gallinacin-6 is expressed in the chicken digestive tract and has antimicrobial activity against food-borne pathogens. Antimicrob Agents Chemother. 2007;51:912–22.

    PubMed Central  Article  PubMed  Google Scholar 

  22. 22.

    Abdel Mageed AM, Isobe N, Yoshimura Y. Immunolocalization of avian β-defensins in the hen oviduct and their changes in the uterus during eggshell formation. Reproduction. 2009;138:971–8.

    CAS  Article  PubMed  Google Scholar 

  23. 23.

    Das SC, Isobe N, Yoshimura Y. Expression of toll-like receptors and avian β-defensins and their changes in response to bacterial components in chicken sperm. Poult Sci. 2011;90:417–25.

    CAS  Article  PubMed  Google Scholar 

  24. 24.

    Hervé-Grépinet V, Réhault-Godbert S, Labas V, Magallon T, Derache C, Lavergne M, et al. Purification and characterization of avian β-defensin 11, an antimicrobial peptide of the hen egg. Antimicrob Agents Chemother. 2010;54:4401–9.

    PubMed Central  Article  PubMed  Google Scholar 

  25. 25.

    Michailidis G, Avdi M, Argiriou A. Transcriptional profiling of antimicrobial peptides avian β-defensins in the chicken ovary during sexual maturation and in response to Salmonella enteritidis infection. Res Vet Sci. 2012;92:60–5.

    CAS  Article  PubMed  Google Scholar 

  26. 26.

    Yoshimura Y, Ohashi H, Subedi K, Nishibori M, Isobe N. Effects of age, egg-laying activity, and Salmonella-inoculation on the expressions of gallinacin mRNA in the vagina of the hen oviduct. J Reprod Dev. 2006;52:211–8.

    CAS  Article  PubMed  Google Scholar 

  27. 27.

    Hellgren O. Allelic variation at innate immune genes (avian β-defensins), within a natural population of great tits. J Avian Biol. 2015;46:113–8.

    Article  Google Scholar 

  28. 28.

    Hellgren O, Sheldon BC. Locus-specific protocol for nine different innate immune genes (antimicrobial peptides: β-defensins) across passerine bird species reveals within-species coding variation and a case of trans-species polymorphisms. Mol Ecol Resour. 2011;11:686–92.

    CAS  Article  PubMed  Google Scholar 

  29. 29.

    Bhunia A, Mohanram H, Bhattacharjya S. Lipopolysaccharide bound structures of the active fragments of fowlicidin-1, a cathelicidin family of antimicrobial and antiendotoxic peptide from chicken, determined by transferred nuclear overhauser effect spectroscopy. Pept Sci. 2009;92:9–22.

    CAS  Article  Google Scholar 

  30. 30.

    Goitsuka R, Chen C-LH, Benyon L, Asano Y, Kitamura D, Cooper MD. Chicken cathelicidin-B1, an antimicrobial guardian at the mucosal M cell gateway. Proc Natl Acad Sci. 2007;104:15063–8.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  31. 31.

    van Dijk A, Molhoek EM, Veldhuizen EJA, Bokhoven JLMT-V, Wagendorp E, Bikker F, et al. Identification of chicken cathelicidin-2 core elements involved in antibacterial and immunomodulatory activities. Mol Immunol. 2009;46:2465–73.

    Article  PubMed  Google Scholar 

  32. 32.

    Zanetti M. The role of cathelicidins in the innate host defenses of mammals. Current Issues in Molecular Biology. 2005;7:179–96.

    CAS  PubMed  Google Scholar 

  33. 33.

    Bommineni YR, Dai H, Gong Y-X, Soulages JL, Fernando SC, DeSilva U, et al. Fowlicidin-3 is an α-helical cationic host defense peptide with potent antibacterial and lipopolysaccharide-neutralizing activities. FEBS J. 2007;274:418–28.

    CAS  Article  PubMed  Google Scholar 

  34. 34.

    Xiao Y, Dai H, Bommineni YR, Soulages JL, Gong Y-X, Prakash O, et al. Structure–activity relationships of fowlicidin-1, a cathelicidin antimicrobial peptide in chicken. FEBS J. 2006;273:2581–93.

    CAS  Article  PubMed  Google Scholar 

  35. 35.

    Xiao Y, Herrera AI, Bommineni YR, Soulages JL, Prakash O, Zhang G. The central kink region of fowlicidin-2, an α-helical host defense peptide, is critically involved in bacterial killing and endotoxin neutralization. Journal of Innate Immunity. 2009;1:268–80.

    CAS  Article  PubMed  Google Scholar 

  36. 36.

    Achanta M, Sunkara L, Dai G, Bommineni Y, Jiang W, Zhang G. Tissue expression and developmental regulation of chicken cathelicidin antimicrobial peptides. Journal of Animal Science and Biotechnology. 2012;3:15.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  37. 37.

    Xiao Y, Cai Y, Bommineni YR, Fernando SC, Prakash O, Gilliland SE, et al. Identification and functional characterization of three chicken cathelicidins with potent antimicrobial activity. J Biol Chem. 2006;281:2858–67.

    CAS  Article  PubMed  Google Scholar 

  38. 38.

    Jetz W, Thomas GH, Joy JB, Hartmann K, Mooers AO. The global diversity of birds in space and time. Nature. 2012;491:444–8.

    CAS  Article  PubMed  Google Scholar 

  39. 39.

    Trowsdale J, Parham P. Mini-review: defense strategies and immunity-related genes. Eur J Immunol. 2004;34:7–17.

    CAS  Article  PubMed  Google Scholar 

  40. 40.

    Rached LA, McDermott MF, Pontarotti P. The MHC big bang. Immunol Rev. 1999;167:33–45.

    CAS  Article  Google Scholar 

  41. 41.

    Shai Y. Mode of action of membrane active antimicrobial peptides. Biopolymers. 2002;66:236–48.

    CAS  Article  PubMed  Google Scholar 

  42. 42.

    Klüver E, Schulz-Maronde S, Scheid S, Meyer B, Forssmann W-G, Adermann K. Structure–activity relation of human β-defensin 3: influence of disulfide bonds and cysteine substitution on antimicrobial activity and cytotoxicity. Biochemistry. 2005;44:9804–16.

    Article  PubMed  Google Scholar 

  43. 43.

    Michaelson D, Rayner J, Couto M, Ganz T. Cationic defensins arise from charge-neutralized propeptides: a mechanism for avoiding leukocyte autocytotoxicity? J Leukoc Biol. 1992;51:634–9.

    CAS  PubMed  Google Scholar 

  44. 44.

    Hughes AL, Yeager M. Coordinated amino acid changes in the evolution of mammalian defensins. J Mol Evol. 1997;44:675–82.

    CAS  Article  PubMed  Google Scholar 

  45. 45.

    Grueber CE, Wallis GP, Jamieson IG. Episodic positive selection in the evolution of avian toll-like receptor innate immunity genes. PLoS One. 2014;9, e89632.

    PubMed Central  Article  PubMed  Google Scholar 

  46. 46.

    Azarian T, Lo Presti A, Giovanetti M, Cella E, Rife B, Lai A, et al. Impact of spatial dispersion, evolution, and selection on Ebola Zaire Virus epidemic waves. Scientific Reports. 2015;5:10170.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  47. 47.

    Boniotto M, Tossi A, DelPero M, Sgubin S, Antcheva N, Santon D, et al. Evolution of the beta defensin 2 gene in primates. Genes and Immunity. 2003;4:251–7.

    CAS  Article  PubMed  Google Scholar 

  48. 48.

    Del Pero M, Boniotto M, Zuccon D, Cervella P, Spanò A, Amoroso A, et al. β-Defensin 1 gene variability among non-human primates. Immunogenetics. 2002;53:907–13.

    Article  PubMed  Google Scholar 

  49. 49.

    Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  50. 50.

    Eddy SR. Profile hidden Markov models. Bioinformatics. 1998;14:755–63.

    CAS  Article  PubMed  Google Scholar 

  51. 51.

    Eddy SR. Accelerated profile HMM searches. PLoS Comput Biol. 2011;7, e1002195.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  52. 52.

    Carver T, Berriman M, Tivey A, Patel C, Böhme U, Barrell BG, et al. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics. 2008;24:2672–6.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  53. 53.

    Kozlowski LP. Calculation of protein isoelectric point. Accessed March 2015.

  54. 54.

    Lehninger AL, Nelson DL, Cox MM. Lehninger principles of biochemistry. 6th ed. New York: W.H. Freeman; 2013.

    Google Scholar 

  55. 55.

    Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  56. 56.

    Hasegawa M, Fujiwara M. Relative efficiencies of the maximum likelihood, maximum parsimony, and neighbor-joining methods for estimating protein phylogeny. Mol Phylogenet Evol. 1993;2:1–5.

    Article  PubMed  Google Scholar 

  57. 57.

    Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–91.

    Article  Google Scholar 

  58. 58.

    Delport W, Poon AFY, Frost SDW, Kosakovsky Pond SL. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26:2455–7.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  59. 59.

    Kosakovsky Pond SL, Frost SDW. Not So different after All: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol. 2005;22:1208–22.

    Article  PubMed  Google Scholar 

  60. 60.

    Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, et al. FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol Biol Evol. 2013;30:1196–205.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  61. 61.

    Murrell B, Wertheim JO, Moola S, Weighill T, Scheffler K, Kosakovsky Pond SL. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012;8, e1002764.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  62. 62.

    Kosakovsky Pond SL, Murrell B, Fourment M, Frost SDW, Delport W, Scheffler K. A random effects branch-site model for detecting episodic diversifying selection. Mol Biol Evol. 2011.

  63. 63.

    Poon AFY, Lewis FI, Pond SLK, Frost SDW. An evolutionary-network model reveals stratified interactions in the V3 loop of the HIV-1 envelope. PLoS Comput Biol. 2007;3, e231.

    PubMed Central  Article  PubMed  Google Scholar 

  64. 64.

    Nei M, Kumar S. Molecular evolution and phylogenetics. New York: Oxford University Press; 2000.

    Google Scholar 

  65. 65.

    Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–9.

    CAS  Article  PubMed  Google Scholar 

  66. 66.

    Schneider TD, Stephens RM. Sequence logos: a new way to display consensus. Nucleic Acids Res. 1990;18:6097–100.

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  67. 67.

    Beitz E. TEXshade: shading and labeling multiple sequence alignments using LATEX2e. Bioinformatics. 2000;16:135–9.

    CAS  Article  PubMed  Google Scholar 

Download references


DWB and WG are supported by Biotechnology and Biological Science Research Council (BBSRC), UK. RK is supported by the Wellcome Trust, UK. YC and KB are supported by the Australian Research Council (ARC).

Author information



Corresponding authors

Correspondence to Yuanyuan Cheng or David W. Burt.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

DWB and KB had the idea, guided and supervised analyses and revised drafts. YC and MDP both carried out gene annotation, data analyses and draft writing. YC also prepared figures, Table 3 and additional files. MDP also made Table 1 and 2 and proof checked the figures. WG generated protein/CDS sequences from the genomes and prepared databases for BLAST and HMMER programs. RK set up the GALAXY platform, built HMM sequence models, and optimised workflows for database searches.

Yuanyuan Cheng and Michael Dennis Prickett contributed equally to this work.

Additional files

Additional file 1:

List of annotated β-defensin and cathelicidin genes in 53 avian species with scaffold and coordinates information. (XLSX 47 kb)

Additional file 2:

Nucleotide sequences of annotated avian β-defensin and cathelicidin genes (in fasta format). (FAS 204 kb)

Additional file 3:

Sequence alignments of avian β-denfensins and cathelicidins. The ruler is based on residue positions of chicken sequences. Dots represent identical residues in comparison with the chicken sequence in the top; dashes are gaps or missing data. Functionally significant conserved residues are boxed. Plus and minus signs on top of the alignment indicate sites under positive and negative selection, respectively. Letters ‘a’ and ‘b’ above the CATHB1 alignment indicate two pairs of amino acid residues with evidence of intra-molecular co-evolution. A sequence logo was generated at the bottom of the alignment to show the frequency of certain amino acid substitution at a position, indicated by the height of the letter [66]. The labelled alignments and sequence logo were produced with TEXshade [67]. (PDF 1538 kb)

Additional file 4:

Putative mature peptide sequences from annotated avian β-defensin and cathelicidin genes (in fasta format). (FAS 42 kb)

Additional file 5:

Net charge of putative avian β-defensin and cathelicidin mature peptides. (XLSX 30 kb)

Additional file 6:

List of primate genes used in analyses with Genbank accession numbers. (XLSX 11 kb)

Additional file 7:

Reconstructed avian and primate ancestral cathelicidin sequences and inferred charge changes between nodes. (PDF 299 kb)

Additional file 8:

Phylogenetic trees of avian β-defensin and cathelicidin genes. Branches with evidence suggesting episodic diversifying selection are highlighted in red. (PDF 1444 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Cheng, Y., Prickett, M.D., Gutowska, W. et al. Evolution of the avian β-defensin and cathelicidin genes. BMC Evol Biol 15, 188 (2015).

Download citation


  • Amino Acid Site
  • Mature Peptide
  • Emperor Penguin
  • Avian Genome
  • Fixed Effect Likelihood