Analysis of disruptive selection in subdivided populations

Background Analytical methods have been proposed to determine whether there are evolutionarily stable strategies (ESS) for a trait of ecological significance, or whether there is disruptive selection in a population approaching a candidate ESS. These criteria do not take into account all consequences of small patch size in populations with limited dispersal. Results We derive local stability conditions which account for the consequences of small and constant patch size. All results are derived from considering Rm, the overall production of successful emigrants from a patch initially colonized by a single mutant immigrant. Further, the results are interpreted in term of concepts of inclusive fitness theory. The condition for convergence to an evolutionarily stable strategy is proportional to some previous expressions for inclusive fitness. The condition for evolutionary stability stricto sensu takes into account effects of selection on relatedness, which cannot be neglected. It is function of the relatedness between pairs of genes in a neutral model and also of a three-genes relationship. Based on these results, I analyze basic models of dispersal and of competition for resources. In the latter scenario there are cases of global instability despite local stability. The results are developed for haploid island models with constant patch size, but the techniques demonstrated here would apply to more general scenarios with an island mode of dispersal. Conclusions The results allow to identity and to analyze the relative importance of the different selective pressures involved. They bridge the gap between the modelling frameworks that have led to the Rm concept and to inclusive fitness.


Background
Various criteria have been proposed to compute the stable states of the evolutionary dynamics of traits of ecological significance. Previous works ("adaptive dynamics", e.g., [1][2][3][4][5][6]) have highlighted the need to distinguish different kinds of stability. A strategy is convergence stable if the population evolves towards it by allelic substitutions. A convergence stable strategy is evolutionarily stable (noninvasible) if rare deviants are selected against. Otherwise, there is disruptive selection, and "branching" of the distribution of phenotypes in the population may occur [3,6].
When fitness can be evaluated exactly, the different kinds of stability can be evaluated. However, in many cases approximations are useful, either because exact results are not available or because they are too complex to allow better understanding of evolution. This occurs when populations are structured in patches occupied by a small number of individuals. In such a case, a widely used measure of fitness effects is inclusive fitness. Inclusive fitness measures fitness effects as the effect of a deviant strategy on the fitness of an individual which expresses this strategy, plus the effect on the fitness of an individual when the strategy is expressed by other individuals in the patch, the latter effect being weighted by a measure of genetic similarity of individuals within a patch [7]. Although the inclusive fitness approach often allows to identify selective pressures, it is desirable to integrate it in a more general framework where the different kinds of dynamics are distinguished [8]. Can inclusive fitness be used to compute convergence stability, evolutionary stability, or both? Some works made no distinction between the concepts of convergence and of evolutionary stability [9], while others have found that inclusive fitness is suitable for evaluating convergence stability but not for evolutionary stability [10,11]. There have been some attempts to derive evolutionary stability conditions using inclusive fitness concepts (see [10] and references therein) but further insight into the above issues has been limited by a dearth of well-established results which could be compared to some alternative approach. This paper will provide such results, using the R m concept introduced in ref. [12]. R m is the overall production of successful emigrants from a patch, descended from a single mutant immigrant. Ref. [12] presents an exact numerical method to compute R m in complex metapopulation models (also used in ref. [13]), but analytical conditions for convergence and evolutionary stability can also be deduced from R m . In this paper I show how this can be done. In particular, a new result is the analytical condition for local invasibility versus non-invasibility (i.e. evolutionary stability) of a convergence stable strategy for the island model of dispersal. Kin selection effects are taken into account in this computation, as the kin interactions that occur in the patch all the way from colonization to local allele extinction. Thus, we should also be able to recover known inclusive fitness expressions from R m , but how we can do that is not a priori obvious. We will see that inclusive fitness can be derived as a measure of local convergence stability from R m . But the evolutionary stability condition can also be understood in terms of the concepts of inclusive fitness theory.
Rather than considering the complex metapopulation model of ref. [12], I will consider discrete-time models which assume a constant number N of haploid adults per patch. This should help to see the logic of the method. Within this setting, I will analyze some basic models widely considered in previous works, dealing with the evolution of dispersal and with disruptive selection under competition for resources.

Results
We consider here models where N adults reproduce within each of a large ("infinite") number of patches. A large number of juveniles are produced by each adult. A fraction of them disperse, in which case they disperse randomly over all patches, following an "island" or "global" mode of dispersal. The juveniles then compete for access to reproduction so that exactly N of them survive this competition in each patch. No other exact assumption about reproduction, competition and dispersal is done at this stage (this is done later in applications).

The number of successful emigrants R m
We consider R m , the overall production of successful emigrants from a patch, descended from a single mutant immigrant in this patch. By successful I mean that the individual settles as one of the N adults in a patch. As originally described [12], R m counts individuals before the dispersal cost is paid and competition occurs, so that it counts the number of emigrant descendants, successful or not, of each emigrant, successful or not. Both definitions are effectively equivalent.
The mutant is considered rare enough at the metapopulation level that no further mutant immigrant occurs in the patch and no mutant will be encountered by emigrants. R m is a generalization of the net reproductive rate R 0 (or lifetime reproductive success) often considered in demography (e.g., [14,15]). However, R m does not count the number of offspring one generation later, but the number of successful emigrants from a patch, descended over several generations from one successful immigrant in this patch. R m is an appropriate fitness measure because all successful emigrants are equivalent in terms of their fitness expectations. The number of successful emigrants is counted until local extinction of the family descended from the immigrant. Even without local patch extinction, local extinction of the family occurs due to the recurrent inflow of other immigrants.

Exact computation of R m
For this computation, we follow the distribution of the number of descendants of the immigrant mutant over successive generations. N offspring are sampled independently from a large number of juveniles. Let π j be the probability that a randomly chosen offspring comes from one of the j mutants in the patch in the previous generation. π j is a function of the mutant and resident strategies (for examples, see the Applications below). Then, the probabilities a kj that there are k mutant descendants from j mutant parents in the patch are binomial terms: These transition probabilities define a Markov chain with one absorbing state (local extinction of the mutant allele). Now let A = (a kj ) for k = 1, ..., N, j = 1, ..., N (i.e., mutant allele being present). For j > 0, the probability of having j mutants in the patch in generation t is given by p j ≡ (A t i) j , where i is a vector representing the initial distribution of the mutant: a single mutant is represented as i ≡ (1,0,...,0) T , where T denotes transpose. In each generation, the expected number of successful emigrant gametes of each mutant adult in a patch with j mutants is written g j . The number of successful emigrant mutant gametes produced by the patch in generation t is therefore h j ≡ jg j . Let h ≡ (h j ). The expected outflow of successful mutants from the patch can be written Therefore, summing over generations we obtain the fitness measure R m in the form where B ≡ (I -A) -1 . This is a simple discrete time version of eq. 6 in ref. [12].
R m is a function of the resident strategy z and of the mutant strategy, whose deviation from z is denoted δ.
Local stability conditions are given in terms of the firstand second-order derivatives of the fitness measure [4][5][6]. We use the prime notation for differentiation with respect to δ throughout the paper. A strategy z* is a candidate ESS (evolutionarily stable strategy) if = 0 at z*. A candidate ESS z* is locally convergence stable if > 0 at z <z* and < 0 at z >z*, so that selection brings a population expressing strategy z ≠ z* closer to z*. Hence, a strategy is locally convergence stable if d /dz < 0 at z*, where is evaluated at δ = 0 before the derivation with respect to z is carried out. It is locally non-invasible if < 0 at z* (for the borderline case = 0, see [16]). In the following sections, we provide analytical expressions for and .
A technical device underlying results below is the expansion of derivatives of h j and π j in terms of powers of j.
Results will rely upon the highest order term in these expansions being j 2 for the first-order derivatives with respect to δ, and j 3 for the second-order derivatives. This holds because, whether interactions between individuals are additive or not, h j and π j can be expanded in terms of δ times the sum (or average) of individual phenotypes in the patch, plus δ 2 times the sum of products of pairs of individual phenotypes, plus δ 2 times the sum of products of three phenotypes, and so on. Dominance of allelic effects in diploid populations raises an exception to this pattern (see Discussion).

and relatedness
Let w j (δ) be the expected number of adult offspring (emigrant or not) of a mutant parent among j mutant parents. The expected number of offspring jw j of the j mutants in the next generation is simply the sum of the expected number of successful emigrants (h j ) and of locally settled offspring (Nπ j ). That is, In the Appendix, it is shown that can be written as where d is the probability that an adult is an immigrant in a population following the resident strategy, and are the coefficients of j and j 2 /N in the expansion for , and F is the probability that two individuals sampled with replacement in a patch have a common ancestor in the patch.
Simple and well-known arguments are available to compute F [17,18], so it is useful to express in terms of F. Between two successive generations t and t + 1, F obeys the recursion (6) where 1/N is the probability that the same individual is sampled twice, (1 -d) 2 is the probability that two different individuals are both non-immigrants, and then F(t) is the probability that their parent(s) was or were from the same family, i.e. descend from the same immigrant. Thus the stationary value of F is The definition of F as the probability that two individuals descend from the same immigrant is also a definition of the genetic structure parameters used in population genetics (e.g., [17]), which are often used as relatedness coeffi- cients in kin selection theory (e.g., [19,20]). Hence F is a relatedness coefficient. Indeed we can write = W IF /d (8) where W IF is the inclusive fitness considered in other works (see the dispersal example for a concrete illustration). The 1/d factor means that R m measures effects of selection over an average of 1/d generations, in contrast to W IF which measures them over one generation.
is a function of the resident strategy z. As noted above, serves to find candidate ESSs z* such that = 0 at z*. Local convergence stability is determined by the sign of around z*, which may be deduced from the sign of d /dz at z*. As the Applications below show, it may not be necessary to compute d /dz to obtain the sign of and thus to check convergence stability.
We now consider invasion by mutants near a candidate ESS z*. To express we generalize the κ notation introduced above. Let (f), (f) and (f) be the coefficients of j, j 2 /N and j 3 /N 2 in the expansion for some quantity f. We obtain where K is the probability that three individuals sampled with replacement in the same patch descend from the same immigrant. K can be computed by the same methods as F, as detailed in the Appendix. All elements here are evaluated at δ = 0. A proof is presented in the Appendix, and evaluation of this formula in the Applications was checked against numerical evaluation of R m by eq. 3. The second line of eq. 9 includes fitness effects depending on changes in relatedness due to selection (see Discussion). In general, π j is of the order of 1/N, F is of the order of 1/ N and K of the order of 1/N 2 , so the whole second line of eq. 9 is of the order of 1/N and cannot be easily neglected relative to the remainder of .

Applications
Here we apply the above results first to a simple model of evolution of dispersal [21]. Although disruptive selection has been found in other dispersal models (e.g., [22][23][24]), it is not expected to occur in this model, which serves mainly to compare the present approach with a previous computation of stable strategies. Next we consider two models of resource competition, one of which was previously considered in ref. [25]. These are small patch, limited dispersal versions of a class of models of competition widely considered in previous works: see e.g. [16,[25][26][27][28][29], and references therein.

Dispersal
The trait under selection is the dispersal probability of juveniles, and there is a survival cost c of dispersal: the survival probability of dispersed juveniles is (1 -c) times that of philopatric ones. For mutants with dispersal z + δ, the probability that an individual is the philopatric descendant of a mutant parent is This expression is obtained as the ratio of the relative number of mutant juveniles that do not disperse [which is 1 -(z + δ) for each of j mutant parents] to the relative number of juveniles which come in competition in the patch. The latter is the number of philopatric juveniles, 1 -(z + δ) for each of the j mutant parents and 1 -z for each of the N -j non-mutants, plus the relative number of immigrant juveniles from other patch, which is Nz(1 -c) as parents in other patches are not mutants and their juveniles pay the cost of dispersal.
Likewise, the number of successful emigrant offspring of each deviant parent is because each deviant parent contributes a relative number (1 -c)(z + δ) of emigrant juveniles which compete with a relative number N [1 -z + (1 -c)z] = N(1 -zc) of competing juveniles in every other patch, and N adult offspring are sampled out of these juveniles (hence N disappears).
From these expressions and from eq. 4, one obtains and so that = 0 only for z* = (F -c)/(F -c 2 ), a formula known since ref. [21]. Together with eq. 7 and the relationship d = (1 -c)z/(1 -cz) between the immigration probability d and the emigration probability z, this yields Explicitly evaluating d /dz at the candidate ESS would be cumbersome, but it is easily verified that is positive (resp. negative) as z approaches 0 (resp. 1). Since does not vanish between 0 and z*, must increase (resp. decrease) from 0 when z decreases (resp. increases) from z*. Hence, z* is locally convergence stable. Further, at z*, From this formula, < 0, i.e. the candidate ESS is indeed an ESS, since cK <F 2 at the candidate ESS. The latter result is not obvious, but can be verified as follows. Using eq. 40 to eliminate the three-genes relationship K, the sign of is seen to depend on the sign of the factors F -c (which is positive) and c 3 (N 2 -1) + cF -N 2 F 3 . The latter is always negative for 0 <c < 1 and N ≥ 1, because the third The equivalence with the "direct fitness" method previously used to obtain eq. 14 and related results (e.g., [20,30]) can be verified as follows. According to this method, one considers the fitness function which describes the expected number of offspring of an individual with dispersal probability z • , in a patch of individuals with mean dispersal z 0 in a population with resident dispersal z [20,30,31]. Since jw j = jw (z + δ, z + jδ/N), and are equal to the two partial derivatives of w, yielding The term in brackets is the inclusive fitness [20]. One would also obtain (with π 1 (0) = w p (z*, z*)/N) where w (i,j) is the derivative of w, i times with respect to z • , and j times with respect to z 0 . Such expressions are con-venient when fitness is exactly a function of z 0 (i.e. when the phenotypes of all neighbors do not need to be distinguished), but they may not be useful otherwise.

Competition for resource
Here we consider a model where an individual is in stronger competition with other individuals which have phenotypes similar to its own phenotype. Many previous models are phenomenological, effectively assuming that each individual exploits only one type of resource according to its phenotype, yet that it interferes more with individuals with similar phenotypes. For a population subdivided in small patches, such a model was formulated in ref. [25] and fully analyzed in the case N = 2. However, its formulation is relatively complex, so we will analyze here a slightly different model, and the final section of the Appendix gives comparable results for the model of ref. [25]. Both models have similar qualitative outcomes, which converge for large patch size.
Here we assume that each individual effectively exploits a range of resource and that interference on fitness of similar phenotypes results only from scramble competition for acquisition of resource (e.g., [26]). We consider a range of resource, such that resource of type y has abundance ρ(y), which is the same in all patches. We assume that resource follows a normal distribution, for some constant σ ρ , and some y m such that resource y m is the most abundant one. σ ρ describes the width of the distribution of resource over different types.
We assume that each individual cannot exploit all resources with equal efficiency. An individual i with phenotype z i exploits resource y with an efficiency given by an efficiency function α(y, z i ), such that the individual gets a fraction of resource y, which is the ratio of its own efficiency at exploiting resource y to the sum of efficiencies at exploiting y among all individuals in the population. We assume that the efficiency function is normal-shaped, for some σ α , identical for all individuals. Thus, individual i is best at exploiting resource z i , and σ α quantifies the diversity of resource that an individual may exploit. An individual strategy is therefore characterized by z i and con- Below we consider residents with strategy z i = z and mutants with strategy z i = z + δ. Hence, in a patch with j mutants and N -j non-mutants, .
The share s j (δ) of total resource that individual i obtains is its share of resource y, times abundance of resource y, integrated over the distribution of y. Hence, in a patch with j mutants and N -j non-mutants, it is We assume that the expected number of juveniles of an individual is proportional to this share s j of total resource. The probability that an individual in the offspring generation is the descendant of anyone of j mutant parents in its patch is then π j (δ) = js j (δ) (1 -d) (22) and the expected number of successful emigrant gametes of j mutant parents is h j (δ) = N djs j (δ). (23) The factor N appears here, as in the dispersal example, because N adults settle in each patch. A general, useful check of expressions for π j and h j is that jw j (0) = j = h j (0) + Nπ j (0) (from eq. 4).
Here = -= (y m -z)/ . Hence is of the sign of y m -z, which means, as expected, that the population will evolve to exploit the most abundant resource, z* = y m . In other words, y m is locally convergence stable. At z*, similar computations yield For given d, F and K become negligible as N increases (they are of the order of 1/N and 1/N 2 , respectively), and in the limit we recover the result for panmictic populations, that disruptive selection occurs when σ α > σ ρ , i.e. when the resource is more broadly distributed than can be efficiently exploited by one individual.
Population structure inhibits branching, a previously noted result [25] for the interference competition model (see Appendix): for N = 2, is always negative and, unexpectedly, is independent of σ ρ (because the term in brackets in eq. 25 is then independent of σ ρ and proportional to 1/ ). For any N, the exact condition for disruptive selection is 1/ > 1/ + 2(F - The latter condition is complex, because but it implies that branching is inhibited by small patch size and low dispersal. This result can be understood as follows [25]. There is disruptive selection when a deviant individual gains fitness from avoiding competition with the resident strategy. However, in a subdivided population, competition is preferentially with genetically related individuals, i.e. with individuals more likely to be deviant than the average individual in the population. Then, deviant individuals do not avoid competition as much as if the neighbors in the patch were not related. Since K is of the order of 1/N 2 , it may seem reasonable to approximate by setting K = 0. This seems to work well only for N large and d large (Figure 1), because K is not negligible when d → 0 even for large N. Ignoring K will be misleading in this case.
The strategy z* = 0 may be locally stable without being globally stable: < 0 does not exclude that mutants with large effects could invade (for examples, see [32,33]). Here this may happen for a narrow range of parameter values in the structured population model, even though it does not happen in a single large patch of infinite size. In a single large patch at the candidate ESS, the fitness of a single mutant is so that large mutations invade if and only if small mutations invade. In structured populations, global stability was investigated by numerical evaluation of R m , focusing  Figure 1. By continuity, if for some mutants R m > 1 in the neighborhood of these threshold values, there must be close values of σ ρ /σ α and d such that R m > 1 for some mutants while < 0, which is the sought phenomenon of global instability despite local stability. It was found to happen for some parameters combinations as shown in Figure 2. Note that R m can be quite large in some cases, e.g. R m > 182 for N = 36, d = 1/100 (implying the threshold value / 3.94), and δ = 0.729, so the selective pressures at work could be efficient on a short time scale.
To some extent, global instability could be sought from higher-order derivatives of R m . In the present case the symmetry of selection on mutants with effect δ and -δ implies that R m is an even function of δ around the candidate ESS and that all its odd-order derivatives are null, so at least the fourth-order derivatives should be considered. In practice, this would be very complex.

Discussion
This paper has shown how local evolutionary stability conditions can be computed in structured populations. The expression for , which determines convergence stability, is consistent with earlier results from inclusive fitness theory. The expression for , which determines evolutionary stability stricto sensu, is new. These are local stability conditions: numerical evaluation of R m is required to check that mutants of large effect could not invade even if mutants of small effect cannot. Here I will comment on the different roles that relatedness plays in Cases of global instability Figure 2 Cases of global instability. The shaded area is the set of parameter values for which some mutants invade despite local non-invasibility of the candidate ESS z*. This figure is obtained from computation of R m for δ up to 2.187.

6 12 24 36
Patch size N measures of convergence and evolutionary stability, and in this perspective I will discuss the interpretation of the different terms which appear in the computation of and as detailed in the Appendix.
Evaluation of involves computation of hB'i, and this leads to computation of j 2 Bi, where j 2 = (1,2 2 ,...,N 2 ). j 2 Bi is the sum over generations of the expected square number of descendants of an immigrant in its patch, and therefore in a neutral model it is proportional to relatedness (see eq. 37). Thus we recover expressions for inclusive fitness in terms of relatedness in a neutral model. Here, relatedness describes the probability that two genes have a common ancestor within the patch they are sampled in; this is the property of relatedness that is relevant in relating inclusive fitness to the fitness of rare alleles.
In contrast, evaluation of second-order fitness effect generally requires evaluation of the effect of selection on relatedness. This has been previously acknowledged [10] but not taken into account because it was not found how to compute these effects. This problem has been solved here. Since the term j 2 Bi is proportional to relatedness, j 2 B'i is proportional to the derivative of relatedness, and this term is included in . The present analysis further shows that it can be expressed as function of the probability that three genes have a common ancestor within the patch, in a neutral model (see eqs. 51-53).
The different terms of can be interpreted by distinguishing effects on the distribution of mutant family size in generation t (the parents of the next generation) and effects on the number of offspring of these parents. The first line of the expression for , represents second-order effects on the number of offspring of these parents, cumulated over all generations. They include effects on the probability that next-generation individuals will be of philopatric origin ( ) and on the production of successful emigrants ( ), which are gathered in the single term . These terms differ from previous expressions for second-order effects: we find a three-gene relationship K [in the terms ( ) K above and w (0,2) K in eq. 18] in place of relatedness in the corresponding term of condition (15b) in [10]. Here K does not appear as measuring an effect of selection on relatedness, but because fitness (w j ) of a mutant focal individual is affected in a nonadditive way by pairs of mutant neighbors (where individuals are considered "with replacement", so that the focal individual and its two neighbors may all be the same individual).
The remainder of , is entirely new. It is of order 1/N, so that it cannot be easily neglected relative to other kin competition effects. It compounds first-order effects on the distribution of family size in generation t with first-order effects on the number of offspring of these individuals, h j and π j . The distribution of family size is given by A t i (eq. 2). Therefore, the effects of selection on this distribution are given by derivatives of A t , and the cumulative effects of selection over generations are given by derivatives of (I -A) -1 ≡ B. Thus, all expected effects of selection on the genetic structure of the parents in each generation are accounted by B' in the 2(h' + jA')B'i term of eq. 56. The latter term determines the whole expression (30) above. It involves effects not only on relatedness but also on the mean size of the family descended from an immigrant.
Our results have relied upon the highest order term in the expansion of being j 2 , and j 3 for . In diploid populations with dominance at the locus under selection, the highest order term for would be j 3 . The evaluation of would involve the three-gene relationship, and would also depend on higher powers of j.
Previous consideration of R m has been based on the principle that we should compute fitness for a rare mutant [12], where rare means here that fitness effects can be computed by considering a single immigrant mutant in a population of residents. But the concept of convergence stability also requires that when ≠ 0, selection has the same direction whatever the mutant frequency, so that allele invasion implies allele replacement. In the infinite island model, the direction and magnitude of first-order effects on fitness are the same whatever the mutant allele frequency. This is a distinct result owing to a distinct concept of relatedness, the so-called regression definition of relatedness [34,35]. This relatedness concept serves to describe the probability that two individuals within a patch share the same allele when the allele is not rare in the population, and implies that selection is not frequency-dependent. Thus two distinct concepts of relatedness are called for in the computation of fitness effects on rare mutants and of convergence stability. However, in the infinite island model, the same measure F quantifies both concepts of relatedness, and is appropriate for computing convergence stability. This holds because whether the mutant allele is rare or not, the first-order fitness effects depend on the probability that two gene lineages sampled in a patch have a common ancestor in this patch, i.e. that they descend from the same immigrant [36]. By contrast, evolutionary stability depends on frequencydependent selection, so no use of the regression definition is called for in the evaluation of the condition for evolutionary stability stricto sensu.

Conclusions
The present paper makes clear the relationship between inclusive fitness concepts and the alternative approach based on the number of successful emigrants. The applications are straightforward. In the dispersal model, branching has never been suggested to occur, and it is shown here that selection is never disruptive at the candidate ESS. This example serves to illustrate the relationship with some techniques from inclusive fitness theory. In the model of competition for resources, population structure inhibits branching. Our results allow a deeper analysis of this example, and we find that there are a few cases of global instability despite local stability. ; from which eq. 31 follows.
Then, one way to obtain the eigenvalues and eigenvectors of A is to note that , where the 's are the Stirling numbers of the second kind [37]. Thus , which relates j k A for any k to lower element-wise powers of j. For k = 1, this yields trivially the first eigenvalue and eigenvector. Together with k = 2, this yields the second eigenvalue and eigenvector, and so on. The eigenvalues of A are and eigenvectors associated to λ 1 and λ 2 are e 1 = j and e 2 = j + [(N -1)π -1]j 2 . It appears that more eigenvectors are not needed.
From eq. 33, where F is given by eq. 7. Then, j 2 Bi = NF/d. This result can be obtained by a more enlightening approach.
can be viewed as the expectation of the sum over generations of the squared size j 2 of the family descended from an initial mutant. In any generation, expected squared family size is N 2 times the probability that two individuals sampled with replacement belong to the same family, i.e. that they descend from the same immigrant. The latter probability is F, as noted in the main text. Then, the sum over a long time T of expected squared family sizes is N 2 TF. It is also the number of immigrants TNd times the expectation of family size This argument is easily generalized to other functions of number of mutants. In particular (38) where K is the probability that three individuals sampled with replacement in a patch descend from the same immigrant. K can be computed as follows: the three individuals sampled with replacement are either three times the same individual (with probability 1/N 2 ), or two of them are the same and differ from a third [with probability (1 -1/N)3/ N], or they are all distinct. In the latter cases we consider their origin in the previous generation t. These different cases give a recursion over successive generations t and t + 1: Then, the stationary value of K is In δ = 0, h = dj, so that, using B' = BA'B (see [38], p. 151) and eq. 34,

Consider
The lth element of jA' is the derivative of the expectation of the binomial distribution (1), hence it is . Then and, using eq. 43, Recall that h j is the expected number of emigrant adults offspring of j mutant adults, and that Nπ j is the number of adult offspring in the patch of j mutant adults. Hence h j + Nπ j is simply the expected number of adult offspring of j mutant adults, denoted as jw j (eq. 4). Then from eqs. 42 and 45, With eqs. 34 and 37, this implies eq. 5.

Evaluation of j 2 A', j 2 A'Bi and j 2 B'i
These computations are preliminary to the computation of . The jth element of j 2 A' is the derivative of the second moment of the binomial distribution (1). Hence, it is Using π j = jπ 1 at δ = 0, this implies Then, using eqs. 37 and 38, Next j 2 B'i = j 2 BA'Bi = NF [(1/d -1)j + j 2 ] A'Bi (51) (from eq. 36) (from eq. 50) from eq. 43.
The first term of eq. 56 is and, arguing as in eq. 44, the third term of eq. 56 is From eq. 4, the and N terms in the last two expressions can be collected into terms, and then summing expressions 59, 60 and 61 yields eq. 9.

Interference competition
Here, following ref. [25], I assume that the relative number of juveniles produced by each mutant in a patch with j mutants may be written where the functions ρ and α are defined as in the scramble competition model of the main text (eqs. 19 and 20). This corresponds to eq. 5 in ref. [25], with some change of notation and β there being set to 0. Likewise, the relative number of juveniles produced by each non-mutant in a patch with j mutants is These numbers are relative to the juvenile production by individuals in other patches, so that the relative total fecundity of a patch [which is jφ j (δ) + (N -j) φ j (0)] depends on the number of mutants in the patch. By contrast, patch fecundity was constant in the scramble competition model because the total amount of resource was constant in each patch. In the jargon of population genetics, the scramble competition model is a soft selection model while the interference competition model is a hard selection model (e.g., [39]). Assuming that juveniles emigrate with probability and that there is a relative survival cost c for emigrants (as in the dispersal model), and These expressions are consistent with eqs. 8-10 in ref. [25], in the case of non-overlapping generations. Note that results will be expressed more simply in terms of the probability of immigration than in terms of the probability of emigration .
From the above expressions, one obtains as expected that the population converges to z* = y m . In the computation of , it turns out that fitness effects depending on changes in relatedness due to selection are null, because all κ coefficients in the second line of eq. 9 are null (consistent with ref. [25] for N = 2). One obtains The numerical analysis of this result is similar to that of the scramble competition model, with again some cases of global instability despite local stability being found (details not shown). The main difference is that threshold values of / such that = 0 are lower than in the scrambling competition model. That is, disruptive selection occurs more easily. This is seen most easily for strong dispersal, d = 1. In this case F = 1/N and K = 1/N 2 , so that disruptive selection occurs when (1 -2/N) > in the scrambling competition model, while it occurs when (1 -1/N) > in the interference competition model (the latter result being consistent with eq. 18 in [25]). This difference arises because the interference competition model gives a higher benefit to deviant individuals when σ ρ > σ α : in a single large patch at the candidate ESS, the fitness of a rare mutant is w j (δ) = exp [(1/ -1/ ) δ 2 /2 (68)  which is higher than the comparable result for the scramble competition model (eq. 28).