- Research article
- Open Access
Evolutionary genomics of LysMgenes in land plants
© Zhang et al; licensee BioMed Central Ltd. 2009
- Received: 23 March 2009
- Accepted: 3 August 2009
- Published: 3 August 2009
The ubiquitous LysM motif recognizes peptidoglycan, chitooligosaccharides (chitin) and, presumably, other structurally-related oligosaccharides. LysM-containing proteins were first shown to be involved in bacterial cell wall degradation and, more recently, were implicated in perceiving chitin (one of the established pathogen-associated molecular patterns) and lipo-chitin (nodulation factors) in flowering plants. However, the majority of LysM genes in plants remain functionally uncharacterized and the evolutionary history of complex LysM genes remains elusive.
We show that LysM-containing proteins display a wide range of complex domain architectures. However, only a simple core architecture is conserved across kingdoms. Each individual kingdom appears to have evolved a distinct array of domain architectures. We show that early plant lineages acquired four characteristic architectures and progressively lost several primitive architectures. We report plant LysM phylogenies and associated gene, protein and genomic features, and infer the relative timing of duplications of LYK genes.
We report a domain architecture catalogue of LysM proteins across all kingdoms. The unique pattern of LysM protein domain architectures indicates the presence of distinctive evolutionary paths in individual kingdoms. We describe a comparative and evolutionary genomics study of LysM genes in plant kingdom. One of the two groups of tandemly arrayed plant LYK genes likely resulted from an ancient genome duplication followed by local genomic rearrangement, while the origin of the other groups of tandemly arrayed LYK genes remains obscure. Given the fact that no animal LysM motif-containing genes have been functionally characterized, this study provides clues to functional characterization of plant LysM genes and is also informative with regard to evolutionary and functional studies of animal LysM genes.
- Genome Duplication
- Domain Architecture
- Nodulation Factor
- Angiosperm Plant
The Lysin motif (LysM), usually 42–48 amino acids in length, is a ubiquitous modular cassette found in virtually every living organism except for Archaea [1, 2]. X-ray crystallography and homology-modeling of LysM motifs revealed a symmetrical βααβ topology with the two α helices residing on one side of a two-stranded antiparallel β-sheet [1, 3, 4]. A detailed domain folding study indicated that the LysM domain folds in a stepwise and robust manner . In bacteria, LysM proteins are generally involved in bacterial cell wall degradation by anchoring enzymatic domains to the cell wall through binding to peptidoglycan (PGN), a linear form of alternatively β1→4 linked N-acetyl-muramic acid and N-acetyl-glucosamine (GlcNAc) [6, 7]. A LysM-containing receptor-like protein (LYP) (lacking a kinase domain), CEBiP, was shown in rice to biochemically bind chitin, a β1→4 linked homopolymer of N-acetylglucosamine . In Arabidopsis, LysM-containing receptor-like kinases (LYKs) were genetically defined as receptors for chitin [9, 10]. Chitin is a major component of the fungal cell wall and an established pathogen-associated molecular pattern (PAMP). Hence, these LYKs were implicated in the plant defense response to fungal pathogens. In contrast, other LYKs (e.g., NFR1, NFR5) were genetically defined as receptors for the rhizobial nodulation factor [11–13], an acylated, substituted chitin oligomer. In this case, these receptors are coupled to a complex plant developmental pathway leading to the formation of a novel organ, the nodule, in which nitrogen fixing, symbiotic bacteria reside. Although no direct evidence exists in vertebrates, the LysM domain is presumably capable of binding peptidoglycan, chitin and structurally-related oligosaccharide molecules in these organisms.
In a previous study, we categorized LysM motifs across kingdoms into a minimum of 11 distinctive types . The phylogenetic gene family topology based on LysM motif sequences revealed several multiple-kingdom clades. Interestingly, several bacterial-rooted LysM clades contained sequences from fungi, insects, plants and animals. This suggested that some LysM motif patterns may have origins predating the divergence of fungal, plant and animal kingdoms, while other LysM motifs may have originated in a convergent manner . Besides the great diversity among LysM motifs, the numbers of LysM motifs within individual LysM proteins range from one to twelve. Moreover, LysM motifs are often associated with a diversity of other protein domains. The diversity of the LysM motifs and associated domains makes it possible to create a catalog of distinguishable LysM protein domain architectures. In the Pfam database, LysM motifs are associated with over 50 domains and LysM proteins display 241 domain architectures http://pfam.sanger.ac.uk/family?acc=PF01476. Although the LysM domain is associated with a variety of protein domains across kingdoms, it is intriguing that some associations are distinct to particular kingdoms. For example, LYK proteins are only present in plants [1, 2]. Nevertheless, the evolutionary dynamics of LysM protein architectures in individual kingdoms, especially the plant kingdom, have not been clearly defined.
The last six years have seen a great leap forward in our understanding of the biological functions of plant LysM proteins. At the same time, a large number of uncharacterized plant LysM-protein sequences were deposited in public databases. Little is known regarding the origins, evolution, common gene and protein features, and comparative genomics of these plant LysM genes. A multiple-dimension "atlas" (within and across species, considering phylogenetic context) of plant LysM genes is needed to better understand this important gene family in the entire plant kingdom.
The ubiquity of LysM genes across kingdoms, especially within the plant kingdom, makes it an appropriate candidate gene family to study the evolution and impact of polyploidy on plant genomes. The genomes of soybean (Glycine max) and poplar (Populus trichocarpa) have a greater number of LYK genes than Arabidopsis, rice and Medicago truncatula, probably due to the influence of an additional genome duplication in each of the Glycine and Populus genera. Tandemly arrayed LYK gene pairs were identified in legume and poplar plants, but not in rice, suggesting that these tandem gene duplications or local genome rearrangements occurred only in certain plant lineages .
In this study, we report a domain architecture catalogue of LysM proteins across kingdoms and reconstruct evolutionary scenarios of LysM genes in the plant kingdom. While a simple core domain architecture of LysM proteins is conserved across kingdoms, each individual kingdom has a unique array of domain architectures. Compared to green algae such as Chlamydomonas, the moss Physcomitrella possesses a more diverse set of LysM proteins, including LYK, LYP, and LysM-containing F-box proteins. More importantly, these LysM architectures persist throughout all major plant lineages. In contrast, gymnosperms appear to have lost diversity among the LysM proteins. We calculated majority-ruled parsimony phylogenies of plant LysM genes and present associated gene, protein and genomic features along with the phylogenies. We also investigated the relative timing and patterns of large-scale duplications of LYK genes. Our analysis shows that a block of tandemly arrayed LYK genes likely resulted from an ancient genome duplication followed by local genome rearrangements. This study will provide clues to functional characterization of plant LysM genes and be informative to functional studies of animal LysM genes.
LysM domain architectures across kingdoms
Besides LYK, we identified other architectures that are unique to and common in the plant kingdom (Figure 1). The F-box+LysM combination is found not only in mosses and angiosperm plants but also in green algae. LYP and extracellular LysM (LysMe) are the other two common architectures that are unique in plants. Interestingly, angiosperm plants have apparently lost a few architectures present in green algae and mosses, such as peptidase+LysM; C2+LysM; LysM+B3; and LysM+thioredoxin (Figure 1). This suggests that overall gross diversity of LysM protein architectures has decreased in angiosperm plants.
The copy numbers of intracellular non-secretory LysM genes (LysMn) are similar in the ten plant genomes we examined. The same also holds true for F-box+LysM genes, and LYP genes (Figure 1). In particular, the F-box+LysM gene appears to be single-copy in these plant genomes, except for poplar and soybean, each with two copies (Figure 1). In contrast, there is clearly an expansion of the LYK gene family in virtually all genomes examined, and of the LysMe genes in the poplar and soybean genomes. The LYK gene family has expanded in the angiosperms; for example, there are 2 LYKs in Physcomitrella (bryophytes), 5 LYKs in the Selaginella (lycophytes), and 21 LYKs in Glycine max (Figure 1).
Phylogenies and comparative genomics of LysMgenes
In order to better illustrate the relationships among the various LYK proteins, Figure 2 also shows the gene and genomic features of each clade in conjunction with the phylogenetic tree (Figure 2). Most LYKs in subclade IA have a conserved domain architecture, with 3 LysM motifs and a non-functional kinase domain lacking the P-loop and the activation motif . Similarly, clade V contains only one LysM motif. Most LYKs in clade I and II have a simple gene structure with only one exon, while LYKs in clade V and VI possess a complex gene structure with more than 10 exons. However, most of these introns are embedded in the region encoding for the kinase domain, while the LysM domains are encoded by an uninterrupted exon (data not shown). Interestingly, 28 out of 79 LYKs in this study are tandemly arrayed in several genomes . Namely, 35% of LYKs are tandemly arrayed, which is higher than the average level of tandemly arrayed genes in Arabidopsis, (~16%), rice (~14%) and poplar (11%) [14–16]. Subclade IA LysMs are oriented head-to-head with LysMs in subclade IIA, while LYKs in subclade VIA are oriented in a head-to-tail manner with LYKs in subclade VIB . However, tandemly arrayed LYKs were not identified in monocots, suggesting that tandem duplication of these genes may have occurred after the split of monocot and dicot plants.
Microsynteny of orthologous LysM genes
LYKgene duplications and genome polyploidy
It has been estimated that most, if not all, flowering plants, have one or more genome duplication events in their history [17–19]. The great expansion of the LYK gene family (Figure 1 and 2) is likely due to two rounds of independent genome doubling events followed by functional speciation. The LYK phylogenetic topology clearly reveals several rounds of gene duplications, especially the duplication event giving rise to the clades I and II (since the subclades IA and IIA are tandemly arrayed in their genomes). However, little is known regarding the timing of these duplications. In order to better understand the plant genome doubling events, we inferred the recent chronological order of these LYK duplication events by measuring the synonymous nucleotide substitutions per site (Ks) of duplicated genes, which is a common practice in dating gene duplication events. [18–23].
We noticed 16 pairs of soybean LysM genes from the four phylogenetic topologies (Figure 2, 3, 4 and 5) that are highly similar at the protein sequence level (see Additional file 1). These likely represent homeologous gene pairs resulting from the most recent genome doubling event in soybean. The pairwise synonymous distances between these homeologous pairs fall within 0–0.40 and have a peak at 0.13 ± 0.03 (Figure 7d). This is close to the value of 0.188 reported by Schlueter et al. (2005). Assuming a rate of 6.1 synonymous substitutions per site every one billion years , the values for the LysM genes estimate this duplication event at approximately 14 mya, which is consistent with previous studies [19, 23, 25].
LysM protein domain architectures
The LysM motif is a ubiquitous protein module found in association with a wide variety of protein domains, thus creating a tremendous diversity of domain architectures. Interestingly, only a simple core LysM architecture is conserved across different kingdoms and each individual kingdom has its own characteristic architectures (Figure 1), suggesting that LysM genes in different kingdoms have undergone distinct evolution and diversification, presumably reflecting functional selection. Recently, Onaga and Taira (2008) reported the isolation of one LysM protein (PrChiA) from the fern plant, Pteris ryukyuensis, consisting of 2 LysM motifs and a glycol_hydro_18 domain . However, our analysis finds this specific protein architecture is limited to the fungal kingdom. The 2 LysM motifs of this protein are 95% identical to each other, in contrast to the fact that 2 LysM motifs in a given plant LysM protein are usually different from each other. Moreover, majority-ruled parsimony topology based on LysM motif sequences (about 42 aa in length) clearly shows that the 2 LysM motif sequences of PrChiA are separated from plant LysM motif sequences and lie in the same clade with fungal LysM motif sequences (see Additional file 2). It is not uncommon to isolate fungal genes from RNA preparations derived from plants and, therefore, further study of the origin of this proposed fern gene is warranted. This case exemplifies the value of domain architecture and phylogenetic studies as an aid to predict gene function and distribution.
Although LysM proteins presumably bind oligomers of N-acetyl glucosamine (e.g., chitin) and peptidoglycan, it is not surprising to find the LysM motif associated with other carbohydrate-binding modules given the fact that a complex array of carbohydrates exists in nature. These carbohydrate-binding modules include the B_lectin domain http://pfam.sanger.ac.uk/family?acc=PF01453, chitin-binding domain in chitinases, and WSC (cell-wall integrity and stress response component) domain http://pfam.sanger.ac.uk/family?acc=PF01822). The presence of these domains with the LysM domain is consistent with the pattern that these proteins are involved in binding polysaccharides, including those of complex structure.
Physcomitrella, a bryophyte and arguably the most primitive plant whose genome has been sequenced, possesses a few plant-specific LysM architectures. The LYK genes, one example, are consistent with the recent report of a number of receptor-like kinases in liverworts . Proteins with a LysM domain and predicted F-box domain are highly conserved in plant kingdom and may play a role in regulating the stability of yet unknown protein substrates. It is interesting to speculate that these are glycoproteins recognized by the LysM domain. Interestingly, this domain arrangement is also present in the Chlamydomonas genome, the green algae that diverged from land plants over 1 billion years ago. This indicates that the role for these proteins, presumably in glycoprotein degradation, can be traced back in Chlamydomonas and other primitive species as well. This notion is supported by the fact that green algae and moss genomes contain ubiquitin-proteasome elements, such as ubiquitin, ubiquitin-conjugating genes, and other F-box genes (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html; http://genome.jgi-psf.org/Phypa1_1/Phypa1_1.home.html).
A few architectures that are present exclusively in primitive plants were progressively and apparently permanently lost during angiosperm evolution (the lack of fern and gymnosperm genome sequences precludes us from examining this trend in these lineages). Among these lost architectures, two of them, peptidase+LysM and C2+LysM, are also present in the genome of Chlamydomonas, suggesting the existence of common ancestors for these LysM proteins in nature. These LysM proteins appeared to have been lost during the change from the unicellular, aquatic life style to a terrestrial, vascular life style. However, the nature of the peptidase domain in Chlamydomonas (peptidase_C1) is different from that in Physcomitrella (peptidase_M23). Actually, LysM+peptidase_M23 proteins are present in cyanobacteria (data not shown) and, therefore may have been introduced into primitive plants by horizontal gene transfer.
LysMgene evolution and plant genome duplications
Only one copy of F-box+LysM genes was retained in most plant genomes after many rounds of polyploidy events. Similarly, the copy numbers of LysMn and LYP genes remain roughly the same across different plant genomes. These suggest that increased gene copy number may have deleterious effects and the dosage of these genes is under tight regulation. In contrast, the LYK gene family underwent several successive rounds of expansion, especially in flowering plants. Indeed, the expansion of the LYK gene family can be found in primitive plant lineages; that is, from the 2 LYK genes in Physcomitrella (bryophytes) to 5 LYK genes in the Selaginella (lycophytes) genome (Figure 1). This pattern of differential expansion of LysM genes is probably due to the different rates of gene duplications of individual LysM gene families in plants. However, it is more likely that this is due to the different rates of gene retention following gene duplications. For example, LYK genes, compared to other LysM gene families, appear to have been retained more frequently following gene duplications.
The phylogenies of LysM genes, especially LYK genes, reveal several rounds of gene duplication, consistent with whole genome duplications (Figure 2, 3, 4 and 5). The timing of these duplications can be estimated by the accumulation of synonymous to non-synonymous changes in the sequences. Pairwise synonymous distance shows that the LYK subclades VIA and VIB likely resulted from the large-scale gene duplication shared by legume plants estimated at 54 mya [19, 23, 25]. Genome duplications around the time window of 50 mya were also reported in other flowering plants including grasses and Solanaceae plants . Pairwise synonymous estimates suggest that the splits between the LYK subclades IA and IB and between IIA and IIB may also be the outcome of the same round of large-scale duplication estimated at 300 mya (Figure 7). The Arabidopsis genome appears to have undergone ancient genome duplications around this time window [21, 22]. Moreover, our data suggest that this genome wide duplication is shared in flowering plants and may have pre-dated the divergence of gymnosperm and angiosperm plants ~300 mya .
A few LYK subclades are tandemly arrayed in plant genomes (Figure 2). These arrangements could arise from either a local tandem duplication or a genome doubling followed by local rearrangement. The split of subclades IA and IIA was estimated at 300 mya, probably upon or shortly after the divergence of gymnosperm and angiosperm plants . However, the tandem array pattern is observed only in Rosid plants (Figure 2). This pattern may be conserved in dicot plants but, lacking the data from Asterids and Caryophyllids, a firm conclusion is not possible. The fact that the gene duplication likely occurred earlier than the emergence of eudicot plants favors the hypothesis that the common ancestors of the subclades IA and IIA resulted from a genome duplication with their current tandemly arrayed positions arising later due to local rearrangement, The data do not support a conclusion that the common ancestors of subclades IA and IIA resulted from local tandem duplication and were de-associated in monocots and re-associated again in dicots. In contrast, the data do suggest that the tandemly arrayed VIA and VIB resulted from a recent large-scale duplication estimated at 54 mya.
Functional characterization of LysMgenes
The LysM domain was first described in bacterial enzymes whose role is to degrade the peptidoglycan cell wall [6, 7]. In this case, the LysM domain anchors these enzymes to the peptidoglycan, a polymer structurally similar to chitin. Hence, the discovery of the LysM domain in proteins genetically identified as the receptor for the bacterial chitooligosaccharide nodulation factor immediately suggested that the LysM domain mediated interaction with the oligosaccharide [9–13]. Such a role is also consistent with the finding of the LysM domain in LYKs or LYPs implicated in the recognition of chitin, a well known fungal PAMP, which elicits the plant innate immunity response ; [9, 10]. These data clearly suggest a similar oligosaccharide binding role for the LysM domains found in other LysM protein families. Recently, peptidoglycan was shown to trigger typical PAMP-elicited immune responses in plants , while chitin was identified as a PAMP active on animal cells [30, 31]. It is very likely that LysM domain containing receptors are involved in mediating these responses.
Uniformity of LysMgene nomenclature
A total of 201 LysM genes were identified from 10 plant species in this study. Considering that LysM genes are ubiquitous in the plant kingdom, one could imagine that innumerable LysM genes exist in plants. Naming these genes could be a great challenge. Indeed, there are already a diversity of names given to various LysM genes [8–11, 13, 32] and this creates confusion within the research community. In order to reduce this confusion, we incorporated all reported names of LysM genes in the phylogeny (Figure 2 and 3). Previously, we proposed a uniform nomenclature for all LysM domain containing proteins  and we used this same nomenclature in this study. This nomenclature system reflects subcellular localizations, phylogenetic relationships, and biological functions. We recommend adoption of this nomenclature as a way to reduce confusion in anticipation of an increasing interest in this important group of plant proteins.
We report a domain architecture catalogue of LysM proteins across all kingdoms and describe a comparative and evolutionary genomics study of LysM genes in the plant kingdom. Our data show that LysM-containing proteins display a wide range of domain architectures. Each individual kingdom appears to have evolved a distinct array of domain architectures, suggesting the presence of distinctive evolutionary paths in individual kingdoms. We show that early plant lineages acquired four characteristic domain architectures and progressively lost several primitive domain architectures. Apparently, LYK gene family underwent intensive expansion, while the dosages of other types of plant LysM genes roughly stay the same throughout the entire plant kingdom. One of the two groups of tandemly arrayed plant LYK genes likely resulted from an ancient genome duplication followed by local genomic rearrangement, while the origin of the other groups of tandemly arrayed LYK genes remains obscure. We defined the orthologous and paralogous relationships of plant LysM genes based on sequence alignment, phylogenetic topology, microsynteny, and nucleotide substitution levels. We also identified 16 pairs of putative homeologous genes in soybean. This study will provide clues to functional characterization of plant LysM genes and be informative to functional studies of animal LysM genes.
The mining of LysM sequences was performed as previously described . The genome databases in this study are: Arabidopsis (Arabidopsis thaliana; http://www.arabidopsis.org/); rice (Oryza sativa; http://rice.plantbiology.msu.edu/osa1.shtml#); maize (Zea mays; http://magi.plantgenomics.iastate.edu/downloadall.html); poplar (Populus trichocarpa; http://genome.jgi-psf.org/Poptr1_1/Poptr1_1.home.html); Medicago truncatula http://www.medicago.org/genome/; Lotus japonicus http://www.kazusa.or.jp/lotus/; Vitis vinifera http://www.genoscope.cns.fr/externe/GenomeBrowser/Vitis/; soybean (Glycine max; http://www.phytozome.net/soybean); pine (Pinus taeda; http://www.conifergdb.org/software.php); moss (Physcomitrella patens; http://genome.jgi-psf.org/Phypa1_1/Phypa1_1.home.html); spikemoss (Selaginella moellendorffii; http://genome.jgi-psf.org/Phypa1_1/Phypa1_1.home.html); green algae (Chlamydomonas reinhardtii; http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). The final LysM proteins sequences, CDS sequences, LysM positions, gene and intron structures, and other predicted features were compiled into the Additional file 3.
LysM domain architectures
The LysM domain architectures were extracted from the Pfam database http://pfam.sanger.ac.uk/family?acc=PF01476. For architectures in the prokaryotic, fungal and animal kingdoms, only those identified in more than 5 sequences were kept to draw the diagram of LysM domain architectures (Figure 1). The domain structures of all plant LysM proteins were analyzed with Pfam http://pfam.sanger.ac.uk/ and inter-ProScan http://www.ebi.ac.uk/Tools/InterProScan/. Signal peptides and transmembrane domains were predicted using SignalP http://www.cbs.dtu.dk/services/SignalP/ with both Neutral Network and Hidden Markov Models and TMHMM http://www.cbs.dtu.dk/services/TMHMM-2.0/, respectively.
Alignment, phylogeny, and synonymous distance
Protein sequences were aligned using MUSCLE3.6  with a fasta output format and manually edited using Jalview . Majority-ruled parsimonious trees were generated using the program protpars of PHYLIP  with maximum likelihood branch lengths calculated using TREE-PUZZLE . Bootstrap values were calculated using the program seqboot of the PHYLIP package. All trees were viewed and printed into a pdf format using A Tree Viewer . For calculation of synonymous distance, codon-aligned nucleic acid sequences were created using RevTrans 1.4 http://www.cbs.dtu.dk/services/RevTrans/. Synonymous nucleotide substitutions per site were calculated using the program yn00 of the PAML package . The histograms of calculated Ks values were plotted and descriptive statistics were displayed using the Minitab 15.0.
Genomic sequences surrounding selected LYP and LysMn genes, about 1–2 Mb in length, were extracted from the above databases. These stretches of genomic sequences were annotated using a dicot species model and Arabidopsis matrix of FGENESH for dicot plants and a monocot species model and rice matrix for rice. The annotated protein sequences were compiled together into a peptide sequence database using the BLAST program. Repetitive sequences were excluded from the databases. BLASTp was used to compare protein identity and similarity against the database with an E-value cutoff of 1e-20 and a percent identify cutoff of 35% between species and 40% within the same species. The gene and intron symbols were drawn using GenePicPipe Synteny Grapher http://www.medicago.org/genome/rpg1/. The microsynteny maps were finally drawn in Adobe Illustrator 10.0.
We thank Drs. Randy Shoemaker and Myron Peto for critical reading of this manuscript. This work was funded by grant to GS from the US Department of Energy, Energy Biosciences Program, Office of Basic Energy Sciences (grant No. DE-FG02-02ER15309), and supported by USDA-CSREES to the National Center for Soybean Biotechnology.
- Bateman A, Bycroft M: The structure of a LysM domain from E. coli membrane-bound lytic murein transglycosylase D (MltD). J Mol Biol. 2000, 299 (4): 1113-1119. 10.1006/jmbi.2000.3778.View ArticlePubMedGoogle Scholar
- Zhang XC, Wu XL, Findley S, Wan JR, Libault M, Nguyen HT, Cannon SB, Stacey G: Molecular evolution of lysin motif-type receptor-like kinases in plants. Plant Physiol. 2007, 144 (2): 623-636. 10.1104/pp.107.097097.PubMed CentralView ArticlePubMedGoogle Scholar
- Bielnicki J, Devedjiev Y, Derewenda U, Dauter Z, Joachimiak A, Derewenda ZS: B. subtilis ykuD protein at 2.0 A resolution: insights into the structure and function of a novel, ubiquitous family of bacterial enzymes. Proteins. 2006, 62: 144-151. 10.1002/prot.20702.PubMed CentralView ArticlePubMedGoogle Scholar
- Mulder L, Lefebvre B, Cullimore J, Imberty A: LysM domains of Medicago truncatula NFP protein involved in Nod factor perception. Glycosylation state, molecular modeling and docking of chitooligosaccharides and Nod factors. Glycobiol. 2006, 16 (9): 801-809. 10.1093/glycob/cwl006.View ArticleGoogle Scholar
- Nickson AA, Stoll KE, Clarke J: Folding of a LysM domain: entropy-enthalpy compensation in the transition state of an ideal two-state folder. J Mol Biol. 2008, 380 (3): 557-569. 10.1016/j.jmb.2008.05.020.PubMed CentralView ArticlePubMedGoogle Scholar
- Jerse AE, Yu J, Tall BD, Kaper JB: A genetic locus of enteropathogenic Escherichia coli necessary fro the production of attaching and effacting lesions on tissue culture cells. Proc Nat Acad Sci USA. 1990, 87: 7839-7843. 10.1073/pnas.87.20.7839.PubMed CentralView ArticlePubMedGoogle Scholar
- Ponting CP, Aravind L, Schultz J, Bork P, Koonin EV: Eukaryotic signalling domain homologues in archaea and bacteria. Ancient ancestry and horizontal gene transfer. J Mol Biol. 1999, 289 (4): 729-745. 10.1006/jmbi.1999.2827.View ArticlePubMedGoogle Scholar
- Kaku H, Nishizawa Y, Ishii-Minami N, Akimoto-Tomiyama C, Dohmae N, Takio K, Minami E, Shibuya N: Plant cells recognize chitin fragments for defense signaling through a plasma membrane receptor. Proc Natl Acad Sci USA. 2006, 103 (29): 11086-11091. 10.1073/pnas.0508882103.PubMed CentralView ArticlePubMedGoogle Scholar
- Miya A, Albert P, Shinya T, Desaki Y, Ichimura K, Shirasu K, Narusaka Y, Kawakami N, Kaku H, Shibuya N: CERK1, a LysM receptor kinase, is essential for chitin signaling in Arabidopsis. Proc Nat Acad Sci USA. 2007, 104 (49): 19613-19618. 10.1073/pnas.0705147104.PubMed CentralView ArticlePubMedGoogle Scholar
- Wan J, Zhang XC, Neece D, Ramonell KM, Clough S, Kim SY, Stacey MG, Stacey G: A LysM receptor-like kinase plays a critical role in chitin signaling and fungal resistance in Arabidopsis. Plant Cell. 2008, 20 (2): 471-481. 10.1105/tpc.107.056754.PubMed CentralView ArticlePubMedGoogle Scholar
- Limpens E, Franken C, Smit P, Willemse J, Bisseling T, Geurts R: LysM domain receptor kinases regulating rhizobial Nod factor-induced infection. Science. 2003, 302 (5645): 630-633. 10.1126/science.1090074.View ArticlePubMedGoogle Scholar
- Madsen EB, Madsen LH, Radutoiu S, Olbryt M, Rakwalska M, Szczyglowski K, Sato S, Kaneko T, Tabata S, Sandal N, et al: A receptor kinase gene of the LysM type is involved in legume perception of rhizobial signals. Nature. 2003, 425 (6958): 637-640. 10.1038/nature02045.View ArticlePubMedGoogle Scholar
- Radutoiu S, Madsen LH, Madsen EB, Felle HH, Umehara Y, Gronlund M, Sato S, Nakamura Y, Tabata S, Sandal N, et al: Plant recognition of symbiotic bacteria requires two LysM receptor-like kinases. Nature. 2003, 425 (6958): 585-592. 10.1038/nature02039.View ArticlePubMedGoogle Scholar
- Project IRGS: The map-based sequence of the rice genome. Nature. 2005, 436: 793-800. 10.1038/nature03895.View ArticleGoogle Scholar
- Rizzon C, Ponger L, Gaut BS: Striking similarities in the genomic distribution of tandemly arrayed genes in Arabidopsis and rice. PLoS Computational Biol. 2006, 2 (9): e115-10.1371/journal.pcbi.0020115.View ArticleGoogle Scholar
- Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313 (5793): 1596-1604. 10.1126/science.1128691.View ArticlePubMedGoogle Scholar
- Blanc G, Wolfe KH: Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004, 16 (7): 1667-1678. 10.1105/tpc.021345.PubMed CentralView ArticlePubMedGoogle Scholar
- Bowers JE, Chapman BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003, 422: 433-438. 10.1038/nature01521.View ArticlePubMedGoogle Scholar
- Pfeil BE, Schlueter JA, Shoemaker RC, Doyle JJ: Placing paleopolyploidy in relation to taxon divergence: A phylogenetic analysis in legumes using 39 gene families. System Biol. 2005, 54 (3): 441-454. 10.1080/10635150590945359.View ArticleGoogle Scholar
- Lynch M, Connery JS: The evolutionary fate and consequences of duplicate genes. Science. 2000, 209: 1151-1155. 10.1126/science.290.5494.1151.View ArticleGoogle Scholar
- Vision TJ, Brown DG, Tanksley SD: The origins of genomic duplications in Arabidopsis. Science. 2000, 290: 2114-2117. 10.1126/science.290.5499.2114.View ArticlePubMedGoogle Scholar
- Blanc G, Hokamp K, Wolfe KH: A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 2008, 13: 137-144. 10.1101/gr.751803.View ArticleGoogle Scholar
- Schlueter JA, Dixon P, Granger C, Grant D, Clark L, Doyle JJ, Shoemaker RC: Mining EST databases to resolve evolutionary events in major crop species. Genome. 2004, 47 (5): 868-876. 10.1139/g04-047.View ArticlePubMedGoogle Scholar
- Yang YW, Lai KN, Tai PY, Li WH: Rates of nucleotide substitution in angiosperm mitochondrial DNA sequences and dates of divergence between Brassica and other angiosperm lineages. J Mol Evol. 1999, 48: 597-604. 10.1007/PL00006502.View ArticlePubMedGoogle Scholar
- Lavin M, Herendeen PS, Wojciechowski MF: Evolutionary rates analysis of Leguminosae implicates a rapid diversification of lineages during the tertiary. System Biol. 2005, 54 (4): 575-594. 10.1080/10635150590947131.View ArticleGoogle Scholar
- Onaga S, Taira T: A new type of plant chitinase containing LysM domains from a fern (Pteris ryukyuensis): roles of LysM domains in chitin binding and antifungal activity. Glycobiol. 2008, 18 (5): 414-423. 10.1093/glycob/cwn018.View ArticleGoogle Scholar
- Ponting CP, Hofmann K, Bork P: A latrophilin/CL-1-like GPS domain in polycystin-1. Curr biol. 1999, 26: 585-588. 10.1016/S0960-9822(99)80379-0.View ArticleGoogle Scholar
- Sasaki G, Katoh K, Hirose N, Suga H, Kuma K, Miyata T, Su ZH: Multiple receptor-like kinase cDNAs from liverwort Marchantia polymorpha and two charophycean green algae, Closterium ehrenbergii and Nitella axillaris: Extensive gene duplications and gene shufflings in the early evolution of streptophytes. Gene. 2007, 401 (1–2): 135-144. 10.1016/j.gene.2007.07.009.View ArticlePubMedGoogle Scholar
- Bowe LM, Coat G, dePamphilis CW: Phylogeny of seed plants based on all three genomic compartments: Extant gymnosperms are monophyletic and Gnetales' closest relatives are conifers. Proc Nat Acad Sci USA. 2000, 97: 4092-4097. 10.1073/pnas.97.8.4092.PubMed CentralView ArticlePubMedGoogle Scholar
- Gust AA, Biswas R, Lenz HD, Rauhut T, Ranf S, Kemmerling B, Gotz F, Glawischnig E, Lee J, Felix G, et al: Bacteria-derived peptidoglycans constitute pathogen-associated molecular patterns triggering innate immunity in Arabidopsis. J Biol Chem. 2007, 282: 32338-32348. 10.1074/jbc.M704886200.View ArticlePubMedGoogle Scholar
- Da Silva CA, Hartl D, Liu W, Lee C, Elias JA: TLR-2 and IL-17A in chitin-induced macrophage activation and acute inflammation. J Immunol. 2008, 181: 4279-4286.PubMed CentralView ArticlePubMedGoogle Scholar
- Arrighi J, Barre A, Ben Amor B, Bersoult A, Soriano L, Mirabella R, Carvalho-Niebel F, Journet E, Gherardi M, Huguet T, et al: The Medicago truncatula lysin motif-receptor-like kinase gene family includes NFP and new nodule-expressed genes. Plant Physiol. 2006, 142: 265-279. 10.1104/pp.106.084657.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Clamp M, Cuff J, Searle SM, Barton GJ: The Jalview Java alignment editor. Bioinformatics. 2004, 20: 426-427. 10.1093/bioinformatics/btg430.View ArticlePubMedGoogle Scholar
- Felsenstein J: PHYLIP (Phylogeny Inference Package). Distributed by the Author. 2000, Seattle:Department of Genetics, University of Washington, 3.6Google Scholar
- Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18 (3): 502-504. 10.1093/bioinformatics/18.3.502.View ArticlePubMedGoogle Scholar
- Zmasek CM, Eddy SR: ATV: display and manipulation of annotated phylogenetic trees. Bioinformatics. 2001, 17 (4): 383-384. 10.1093/bioinformatics/17.4.383.View ArticlePubMedGoogle Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Computer Applications in BioSciences. 1997, 13: 555-556.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.