Figure 4 | BMC Evolutionary Biology

From: Coiled-coil protein composition of 22 proteomes – differences and common themes in subcellular infrastructure and traffic control

Flowchart of sequence comparison and clustering. Coiled-coil prediction data was generated using the program MultiCoil [42] and output processing and coiled-coil domain selection were performed as described for the ARABI-COIL database [11]. Coiled-coil prediction data was used to generate a set of sequences with coiled-coil domains masked out. The masked sequences were used as a query against unmasked sequences in an all-against-all Smith-Waterman sequence comparison (SW Search). The P-scores from this comparison were used for clustering of the output.

