Skip to main content

Table 1 Characteristics of the datasets used

From: The origins of the evolutionary signal used to predict protein-protein interactions

 

SP-70 L POS

SP-70 L NEG

UP-50 L POS

UP-50 L NEG

UP-70 L POS

UP-70 L NEG

Number of pairs

42

92

86

201

65

107

Mean (± stdev) number of sequences per alignment

15 ± 7

14 ± 4.4

32 ± 13

25 ± 12

25 ± 11

20 ± 7.8

Median number of sequences per alignment

12

12

33

24

23

19

Full MSA

Mean genetic distance

1.297

1.694

1.567

1.779

1.394

1.548

Alignment Length

582

687

865

1008

755

796

Dataset containing maximum of 20% gapped columns in a MSA

Mean genetic distance

1.198

1.581

1.369

1.556

1.253

1.391

Alignment Length

420

502

491

598

516

540