- Research article
- Open Access
Common peptides shed light on evolution of Olfactory Receptors
© Gottlieb et al; licensee BioMed Central Ltd. 2009
Received: 08 October 2008
Accepted: 05 May 2009
Published: 05 May 2009
Olfactory Receptors (ORs) form the largest multigene family in vertebrates. Their evolution and their expansion in the vertebrate genomes was the subject of many studies. In this paper we apply a motif-based approach to this problem in order to uncover evolutionary characteristics.
We extract deterministic motifs from ORs belonging to ten species using the MEX (Motif Extraction) algorithm, thus defining Common Peptides (CPs) characteristic to ORs. We identify species-specific CPs and show that their relative abundance is high only in fish and frog, suggesting relevance to water-soluble odorants. We estimate the origins of CPs according to the tree of life and track the gains and losses of CPs through evolution. We identify major CP gain in tetrapods and major losses in reptiles. Although the number of human ORs is less than half of the number of ORs in other mammals, the fraction of lost CPs is only 11%.
By examining the positions of CPs along the OR sequence, we find two regions that expanded only in tetrapods. Using CPs we are able to establish remote homology relations between ORs and non-OR GPCRs.
Selecting CPs according to their evolutionary age, we bicluster ORs and CPs for each species. Clean biclustering emerges when using relatively novel CPs. Evolutionary age is used to track the history of CP acquisition in the collection of mammalian OR families within HORDE (Human Olfactory Receptor Data Explorer).
The CP method provides a novel perspective that reveals interesting traits in the evolution of olfactory receptors. It is consistent with previous knowledge, and provides finer details. Using available phylogenetic trees, evolution can be rephrased in terms of CP origins.
Supplementary information is also available at http://adios.tau.ac.il/ORPS
Odor recognition in vertebrates is mediated by a large superfamily of olfactory receptor (OR) genes, G-protein coupled receptors (GPCRs) with seven trans-membrane domains [1, 2]. Whole genome studies discovered hundreds of intact ORs in the vertebrate genome, ranging in size from ~100 in fishes to ~1000 in mouse [3–5] and .
A recent study of OR evolutionary dynamics indicated the existence of nine ancestral genes common to fish and tetrapods, of which only two are found in birds and mammals. Specifically one of these, known as Class II, has expanded enormously in mammals . Several studies have applied computational sequence analysis and phylogeny methods to study the evolution of the OR repertoire in vertebrates [6, 7]. One of these studies  used motifs to analyze human and mouse OR repertoires, focusing on classification of the motifs into classes and classification of the ORs using these motifs as features.
We adopt a different motif-based approach that extracts deterministic motifs, i.e. peptides, and explores their appearance along OR evolution. We apply the motif extraction algorithm MEX , the efficacy of which has been previously demonstrated in the study of enzymes , to 4027 OR sequences of 10 vertebrates. A short explanation of MEX is also provided in the Methods section. The union of all motifs leads to a list of 2717 MEX-derived peptides, to be referred to as Common Peptides (CPs). These motifs can be mapped onto specific locations on the seven trans-membrane domains.
Following CP occurrences on ORs of different species we can trace the development of these domains with evolution. Using the Tree of Life, we perform an ancestral reconstruction of CPs and determine their evolutionary ages.
For each species we perform biclustering of the matrix of CP occurrences on ORs. Choosing CP groups according to their evolutionary age we get different clustering patterns.
The use of CPs for studying OR sequences enables us to explore different aspects regarding OR evolution than those uncovered by phylogenetic methods. It also enables us to uncover some fine details of OR groups that were previously studied using regular-expression motifs, due to the deterministic nature of our motifs (see also ).
CP mapping on the Tree of Life
Distribution of 3983 OR sequences, total CPs and species-specific CPs according to species
Number of ORs
Number of observed CPs
Number of species-specific CPs
Percentage of species-specific CPs
All CPs are tested for their occurrence on all ORs, irrespective of which species lead to their extraction. We define species-specific CPs as CPs observed only in one species.
On average an OR is matched by 48 CPs, covering 147 amino acids on its sequence. Some CPs partially overlap with one another. The total number of CPs found in sequences of one species (column 3 in Table 1) is highly correlated (Pearson correlation = 0.9) with the number of ORs per species (column 2 in Table 1).
The percentage of species-specific CPs is particularly high in fish and frog (although less than 6% of the pufferfish CPs are pufferfish-specific, the percentage of fish-specific, including both fish, is 18%). The percentage of species-specific CPs drops significantly to an average of 2% in other species, with human having the smallest amount of species-specific CPs. This finding might be attributed to the difference between aquatic environment, characteristic of fish and the amphibian frog X. tropicalis that remains aquatic also in its adult life (see  and ), and terrestrial environments characteristic of the other species: presumably CPs were lost – together with their ORs (groups δ, ε, ζ and η in )- in terrestrial species that have developed later.
We identify "novel CPs" as those that exist in the current ancestor/species but did not exist in previous ancestors, and "lost CPs" as those that do not exist in the current ancestor/species but did exist in the previous ancestor. CPs that date back to previous ancestors are referred to as "conserved CPs".
The analysis detects one major addition of novel CPs in the ancestor of tetrapods, A2. Judging by  the branch length between A1 and A3 is about the same as that between A3 to A6. 47% of the CPs at A6 are novel with regard to A3. This should be compared with the fact that 75% of CPs at A3 are novel with regard to A1. We thus may conclude that the main expansion of OR CPs has taken place at, or before, A3.
Reptiles have suffered major losses of CPs, a trend that was further increased in chicken. Another major loss occurred in pufferfish.
Interestingly, while humans lost more than half of their ORs relative to other mammals, they lost only 11% of the CPs existing in A6. This suggests that some redundancy in mammalian ORs has been removed by OR pseudogenization in human. This result is surprising considering the fact that the human intact OR repertoire contains much less subfamilies relative to other mammals (according to HORDE classification system ). For example, there are 242 and 227 subfamilies in mouse and dog respectively, but only 175 subfamilies in human. Investigating subfamilies of mouse and dog ORs that are not matched by human subfamilies, we nonetheless find many of their CPs (68% of mouse CPs and 35% of dog CPs) elsewhere in. other human subfamilies. In other words, according to the CP perspective the similarity between human and mouse or dog is larger than observed by the sequence similarity which is the basis of the subfamily classifications.  hypothesize that the reduced sense of smell in human could correlate with the loss of functional genes. The high co-occurrences of CPs in functional human, mouse and dog genes hints, however, that the reduction of the human OR repertoire may not necessarily cause loss of functionality.
CPs that make a difference
CPs specific to water-dwelling species. CPs facing the extracellular side of the membrane are in bold.
# of occurrences
CPs specific to land-dwelling species. CPs facing the extracellular side of the membrane are in bold.
# of occurrences
GPCR remote homologies
Figures S1 and S2 are histograms of the same kind for chicken and mouse respectively.
In figures S3–S5 we study the loci of OR CPs on non-OR GPCRs in chicken and mammals respectively. Sharp peaks in mammals correspond to known motifs . No sharp peaks are observed in chicken.
Locations of CPs on the OR sequence
Figure 4 indicates four regions which are highly populated with CPs along all vertebrate evolution. These regions are marked using a threshold drawn at 60% sequence population in zebrafish, displayed in figure 4B. All four regions reside in the interface between the transmembrane domains and the intracellular regions (IL1–3 and the C-terminal). These regions may be connected to structural constraints in the interface that binds the G-proteins. Figures displaying OR coverage by position for all other species ranging from frog to human look very similar (figures S10, S11 [see Additional file 1]). We observe that CPs within some regions have developed much higher coverage only in tetrapods. These regions are marked in figure 4D. They are: the end of the N-terminal, the interface between extracellular loop 1 (EL1) and TM1 and TM2 and the middle of extracellular loop 2 (EL2). Most of the newly emerged regions are facing the extracellular side of the membrane. This imposes structural constraints on the regions connected to odorant binding and might be specific to airborne odorants.
CP-space reveals internal clusters
Using biclustering, we obtain simultaneous co-occurrences of ORs and CPs for each species. This provides a powerful visualization and allows the study of evolutionary trends across species. Details of the biclustering algorithm and its application are found in the Methods section.
We perform the analysis using different sets of CPs characterized by their evolutionary ages.
We identify ten clusters in zebrafish using ancestral (A1) CPs and six using zebrafish-novel CPs. Each of the latter six clusters matches one of the former clusters. The detailed cluster assignments are displayed in the supplementary material [see Additional file 1].
Novel CPs emerge from speciation and duplication events occurring after the split of fish from A1. We find 10 ORs that do not have any novel CPs in zebrafish and fish common ancestor (A8). This can serve as a first estimate of the number of ORs that existed in A1. They reside in the OR clusters indicated by red circles in Figure. 5A.
Classification of zebrafish ORs into groups has been studied by  and . Both found eight groups with different OR membership (four groups of  and one of  contain only one OR each). Biclusters of novel CPs (Figure. 5B) map perfectly to some groups (groups δ, ζ and η of ), where some groups are further split to reveal finer details (e.g. groups δ and ζ of  and group E of  are split into two biclusters). The 10 ORs which contain no novel CPs have members only from groups δ, θ and κ of . For mapping between our clusters, and the groups of  and , see additional files 2, 3 and 4.
The biclustering algorithm allows us also to differentiate between the different zebrafish clusters. The assumption is that OR clusters which relate to recent ancestry might also bear functional similarity. While some of the CPs that differentiate between the OR clusters are conserved remnants of duplication events, other CPs represent segments of these ORs that might contribute to a common functionality of the OR cluster. A table of the CPs of each cluster is provided [see Additional file 5].
Pufferfish has few novel CPs. Biclusters formed using CPs belonging to A1 look similar to the ones displayed in Figure 5A. The biclustering of pufferfish appears in figure S16 [see Additional file 1].
Unlike zebrafish clusters, not all the A1 and A2 conserved CPs form identifiable biclusters. This suggests that they have been subjected to a higher mutation rate than observed in zebrafish, which may relate to the appearance of class II ORs in frog . The clusters in figure 6c relate to the groups γ and δ of , [see Additional file 4].
Chicken and lizard have too few novel A3 and A7 CPs, to construct biclusters. The novel CPs of chicken form one big cluster, while novel CPs of lizard form small disjoint clusters. Novel CPs to A1 and A2 also show difference between chicken and lizard. While the former reveals a robust big cluster, the latter show no clusters at all. This implies large number of recent duplications in chicken. The biclustering of chicken and lizard appear in figures S17–S18 [see Additional file 1].
Biclusters in mammals are displayed in figures S19–S23 [see Additional file 1]. Biclusters are significant for CPs novel to A3–A6. They can be mapped to class I (fish-like) and class II (mammals-like) ORs, and to families of the Human Olfactory Receptor Data Explorer (HORDE). The mapping appears in Additional files 6, 7, 8, 9, 10, 11 and 12.
Novel CPs and mammalian families
Discussion and conclusion
We use CPs extracted by MEX (Motif Extraction algorithm) to study evolutionary processes in olfactory receptors. Such conserved CPs are known to have biological importance  and are expected to play structural and functional roles in olfactory receptors. Having extracted such CPs from ten species, we use evolutionary constraints to further employ the extracted CPs in making sense of the complex relationships of ORs of different species with one another.
The evolutionary perspective is obtained by applying the parsimony principle to a tree-of-life accommodating the studied species. It allows us to construct an ancestral phyletic pattern of the presence or absence of CPs in internal nodes of the tree. Using this construction, we show that the number of species-specific CPs is relatively high in fish and frog, but remains fixed in terrestrial species. The species-specific CPs in the aquatic species might be related to ORs detecting water-soluble odorants. We observe a major emergence of CPs in the ancestor of tetrapods and major losses of CPs in pufferfish and in chicken. A surprising result stemming from this mapping is that although humans lost half of the intact mammalian ORs, they lost only 11% of the conserved CPs, suggesting a controlled process of loss of redundant ORs. In other words, the potential odorant recognition of humans may have suffered only a minor damage by the severe diminution of their OR repertoire.
CPs that differentiate between water-dwelling species and terrestrial species have potential biological significance and are candidates for further biochemical studies.
We show that some of the OR-extracted CPs exist in the general GPCR population, demonstrating the ancient origin of ORs and several other GPCRs.
The fact that the OR history stretches back to fish was made by  who claimed that 85%–90% of frog, chicken, mouse and human OR repertoires was constructed from duplication of a single fish OR of group γ, Dr3OR5.4. One or more of these 35 fish group γ CPs are also observed in 98% of the tetrapod ORs. This is larger than the coverage observed for CPs in any other fish ORs. These 35 CPs are also almost exclusively located in the five most conserved positions in figure 3 (boundary between IL1 and TM2, boundary between IL2 and TM3, middle of EL2, boundary between IL3 and TM6 and TM7). We point out, however, that major changes have occurred in other nodes of evolutionary history. By studying loci of CPs we identify two regions that show high CP coverage starting from tetrapods: the N-terminal and the middle of the second extracellular loop. This might imply that these regions are important for the adaptation of ORs to airborne odorants.
Gene multiplication events are most naturally exhibited by the existence of clusters of ORs. Using the evolutionary separation into novel and conserved CPs, we are able to demonstrate clean OR clusters. This is done by applying a biclustering algorithm to matrices associating CPs with ORs within species: clean clusters emerge when novel CPs are being employed. Results vary with increasing evolutionary age of the species in question. Our biclustering results of the species studied by [6, 21] (zebrafish, frog and chicken) generally support their phylogenetic models, but provide finer OR grouping and a cleaner selection of the responsible ancestor (where CP formation has occurred). Finally, we are able to use the CP analysis to provide developmental details of OR families of the Human Olfactory Receptor Data Explorer (HORDE).
For the described study we selected a set of 4027 intact olfactory receptors (ORs) from ten vertebrate species including pufferfish (Takifugu rubripes), zebrafish (Danio rerio), frog (Xenopus tropicalis), chicken (Gallus gallus), lizard (Anolis carolinensis), platypus (Ornithorhynchus anatinus), opossum (Monodelphis domestica), dog (Canis familiaris), mouse (Mus musculus) and human (Homo sapiens).
All mammalian, chicken and lizard OR sequences are available at the HORDE . OR sequences of fish and frog were taken from the study of . Lizard and Platypus ORs appear in . The number of ORs for each species is listed in Table 1.
MEX is a motif extraction algorithm introduced by  as part of a method for grammar induction from texts and was later used on proteins . Given a set of proteins, they are represented as different paths over a graph that consists of 20 vertices, corresponding to the 'alphabet' of 20 amino-acids. MEX proceeds by looking for convergence of many paths onto strings of amino-acids, and the subsequent divergence from such strings. The latter are defined as motifs if both convergence and divergence obey some statistical conditions. These conditions are imposed on context-dependent variable-order Markov chains that are constructed out of the data-paths. The algorithm has two parameters, η and α, specifying the amount of convergence/divergence and its statistical significance given the number of paths involved in the process. More information can be found on the website .
In the present analysis we ran MEX on the proteins of each species separately, using the parameter values η = 0.9 and α = 0.01. We restricted ourselves to peptides of length 5 amino-acids or more and appearing in at least 4 ORs. These peptides were merged into one list, where duplicates and peptides containing other peptides were removed. The resulting non-redundant list contains 2717 Common Peptides (CPs). Each of the CPs was then searched on the ORs of all species. CPs that appear only in the ORs of one of the studied species are defined to be species-specific.
Fitting CPs to the tree of life and phylogenetic analysis
We used the tree of life web project, available at  to construct the relationships between the species. The relations between the species is consistent with the tree of life of . Dog, Mouse and Human were put under one common ancestor according to the tree of life web project, although there are other possible ancestral orders based on different set of genes (see also[27, 28]-). Trying other arrangements for Dog, Mouse and Human did not alter the derived conclusions. The assessment of CP origins uses the Wagner parsimony, as implemented by the Phylogeny Inference Package computer programs PHYLIP. Similar results are also obtained by Dollo parsimony.
Since some CPs differ by only one amino acid from others, we have also checked whether loss and gain of a CP on any internal node corresponds to a mutation of a single amino-acid (interpreted as a loss of the CP) into another amino-acid (interpreted as a gain of a CP). We have found that the number of such events is negligible (1 such event in an ancestral node on average and 7 on average in the species, occurring mainly in chicken and lizard).
Following Parsimony estimation, each internal node A1–A8, and each species, has a list of CPs associated with it. We identify "novel CPs" as those that exist in the current ancestor/species but did not exist in previous ancestors and "lost CPs" are defined as those that exist in the current ancestor/species but did exist in the previous ancestor. CPs that date back to previous ancestors are referred to as "conserved CPs".
Normalizing CP positions
Each CP contains a set of positions relative to the start of each OR. Due to variable N-Terminal length and gaps, we needed to normalize the different positions of each CP appearing in different ORs. We normalized the OR relative positions using ClustalW2 (available at ). We first aligned the five sequences used in  to construct a profile (replacing MOR257-1 that was not available in our set with MOR257-10). Each OR was then aligned to this profile.
Biclustering is performed on the ORs of each species, using subsets of CPs, each subset corresponding to a different origin on the tree of life. Each OR is represented by a binary vector that signifies the existence or non-existence of each of the CPs on its sequence. In order to clear noise, we first removed all ORs having less than 5 CPs from the relevant tree of life node. We then removed CPs that appear in less than 5 ORs from the remaining set. ORs left with no CPs after the previous removal were also removed. We used a bipartite spectral graph partitioning algorithm of . Initially designed for documents and words, this bi-clustering algorithm handles sparse data well. This algorithm produces biclusters of ORs and CPs. We augmented the algorithm to produce good biclusters' images. This was achieved by applying single linkage hierarchical algorithm for each produced bicluster and sorting each bicluster according to the hierarchical clustering, thus handling less homogenous clusters better. This augmentation of the algorithm does not alter the assignment of ORs and CPs to biclusters, but merely provides better visualization of the biclusters.
- Firestein S: How the olfactory system makes sense of scents. Nature. 2001, 413: 211-218. 10.1038/35093026.View ArticlePubMedGoogle Scholar
- Mombaerts P: Genes and ligands for odorant, vomeronasal and taste receptors. Nat Rev Neurosci. 2004, 5: 263-278. 10.1038/nrn1365.View ArticlePubMedGoogle Scholar
- Glusman G, Yanai I, Rubin I, Lancet D: The complete human olfactory subgenome. Genome Res. 2001, 11: 685-702. 10.1101/gr.171001.View ArticlePubMedGoogle Scholar
- Olender T, Fuchs T, Linhart C, Shamir R, Adams M, Kalush F, Khen M, Lancet D: The Canine Olfactory Subgenome. Genomics. 2004, 83: 361-372. 10.1016/j.ygeno.2003.08.009.View ArticlePubMedGoogle Scholar
- Zhang X, Firestein S: The olfactory receptor gene superfamily of the mouse. Nat Neurosci. 2002, 5: 124-133.PubMedGoogle Scholar
- Niimura Y, Nei M: Evolutionary dynamics of olfactory receptor genes in fishes and tetrapods. Proc Natl Acad Sci USA. 2005, 102: 6039-6044. 10.1073/pnas.0501922102.PubMed CentralView ArticlePubMedGoogle Scholar
- Aloni R, Olender T, Lancet D: Ancient genomic architecture for mammalian olfactory receptor clusters. Genome Biol. 2006, 7: R88-10.1186/gb-2006-7-10-r88.PubMed CentralView ArticlePubMedGoogle Scholar
- Liu AH, Zhang X, Stolovitzky GA, Califano A, Firestein SJ: Motif-based construction of a functional map for mammalian olfactory receptors. Genomics. 2003, 81: 443-456. 10.1016/S0888-7543(03)00022-3.View ArticlePubMedGoogle Scholar
- Solan Z, Horn D, Ruppin E, Edelman S: Unsupervised learning of natural languages. Proc Natl Acad Sci. 2005, 102: 11629-11634. 10.1073/pnas.0409746102.PubMed CentralView ArticlePubMedGoogle Scholar
- Kunik V, Solan Z, Edelman S, Ruppin E, Horn D: Motif Extraction and Protein Classification. Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05). 2005Google Scholar
- Segal L, Lapidot M, Solan Z, Ruppin E, Pilpel Y, Horn D: Nucleotide variation of regulatory motifs may lead to distinct expression patterns. Bioinformatics. 2007, 23 (13): i440-i449. 10.1093/bioinformatics/btm183.View ArticlePubMedGoogle Scholar
- AmphibiaWeb: Information on amphibian biology and conservation. [web application]. Berkeley, California. 2008, [http://amphibiaweb.org/]
- Tinsley RC, Kobel HR: The Biology of Xenopus. 1996, Oxford: Oxford Scientific Press, 41-43.Google Scholar
- Ciccarelli FD, Doerks T, Mering Cv, Creevey CJ, Snel B, Bork P: Toward Automatic Reconstruction of a Highly Resolved Tree of Life. Science. 2006, 311 (5765): 1283-1287. 10.1126/science.1123061.View ArticlePubMedGoogle Scholar
- The HORDE Project. [http://bioportal.weizmann.ac.il/HORDE]
- Rouquier S, Blancher A, Giorgi D: The olfactory receptor gene repertoire in primates and mouse: Evidence for reduction of the functional fraction in primates. PNAS. 2000, 97 (6): 2870-2874. 10.1073/pnas.040580197.PubMed CentralView ArticlePubMedGoogle Scholar
- Lagerström MC, Hellström AR, Gloriam DE, Larsson TP, Schiöth HB, Fredriksson R: The G Protein-Coupled Receptor Subset of the Chicken Genome. PLoS Comput Biol. 2006, 2 (6): e54-10.1371/journal.pcbi.0020054.PubMed CentralView ArticlePubMedGoogle Scholar
- Bjarnadóttir TK, Gloriam DE, Hellstrand SH, Kristiansson H, Fredriksson R, Schiöth HB: Comprehensive repertoire and phylogenetic analysis of the G protein-coupled receptors in human and mouse. Genomics. 2006, 88: 263-273. 10.1016/j.ygeno.2006.04.001.View ArticlePubMedGoogle Scholar
- Parker M, Wong Y, Parker S: An ion-responsive motif in the second transmembrane segment of rhodopsin-like receptors. Amino Acids. 2008, 35 (1):Google Scholar
- Menashe I, Aloni R, Lancet D: A probabilistic classifier for olfactory receptor pseudogenes. BMC Bioinformatics. 2006, 7: 393-10.1186/1471-2105-7-393.PubMed CentralView ArticlePubMedGoogle Scholar
- Alioto TS, Ngai J: The odorant receptor repertoire of teleost fish. BMC Genomics. 2005, 6: 173-10.1186/1471-2164-6-173.PubMed CentralView ArticlePubMedGoogle Scholar
- Freitag J, Ludwig G, Andreini I, Rössler P, Breer H: Olfactory receptors in aquatic and terrestrial vertebrates. J Comp Physiol. 1998, 183 (5): 635-650. 10.1007/s003590050287.View ArticleGoogle Scholar
- Meroz Y, Horn D: Biological roles of specific peptides in enzymes. Proteins: Structure, Function, and Bioinformatics. 2008, 72 (2): 606-612. 10.1002/prot.21951.View ArticleGoogle Scholar
- Warren WC, Hillier LW, Graves JAM, Birney E, Ponting CP, Grützner F, Belov K, Miller W, Clarke L, Chinwalla AT, et al: Genome analysis of the platypus reveals unique signatures of evolution. Nature. 2008, 453 (7192): 175-183. 10.1038/nature06936.PubMed CentralView ArticlePubMedGoogle Scholar
- The ADIOS project. [http://adios.tau.ac.il/]
- Tree of Life web project. [http://www.tolweb.org/tree/]
- Mammalian Phylogeny. [http://whozoo.org/mammals/mammalianphylo.htm]
- Arnason U, dagger JA, Adegoke D, Bodin K, Born EW, Esa YB, Gullberg A, Nilsson M, Short RV, Xu X, et al: Mammalian mitogenomic relationships and the root of the eutherian tree. Proceedings of the National Academy of Science. 2002, 99: 8151-8156. 10.1073/pnas.102164299.View ArticleGoogle Scholar
- Lunter G: Dog as an Outgroup to Human and Mouse. PLoS Comput Biol. 2007, 3 (4): e74-10.1371/journal.pcbi.0030074.PubMed CentralView ArticlePubMedGoogle Scholar
- EBI Tools: ClustalW2. [http://www.ebi.ac.uk/Tools/clustalw2/index.html]
- Man O, Gilad Y, Lancet D: Prediction of the odorant binding site of olfactory receptor proteins by human-mouse comparisons. Protein Sci. 2004, 13: 240-254. 10.1110/ps.03296404.PubMed CentralView ArticlePubMedGoogle Scholar
- Dhillon IS: Co-clustering documents and words using bipartite spectral graph partitioning. Proceedings of the ACM SIGKDD Conference. 2001, 269-274.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.