Evolution of Coordination in Pairwise and Multi-player Interactions via Prior Commitments

Upon starting a collective endeavour, it is important to understand your partners' preferences and how strongly they commit to a common goal. Establishing a prior commitment or agreement in terms of posterior benefits and consequences from those engaging in it provides an important mechanism for securing cooperation in both pairwise and multiparty cooperation dilemmas. Resorting to methods from Evolutionary Game Theory (EGT), here we analyse how prior commitments can also be adopted as a tool for enhancing coordination when its outcomes exhibit an asymmetric payoff structure, in both pairwise and multiparty interactions. Arguably, coordination is more complex to achieve than cooperation since there might be several desirable collective outcomes in a coordination problem (compared to mutual cooperation, the only desirable collective outcome in cooperation dilemmas), especially when these outcomes entail asymmetric benefits for those involved. Our analysis, both analytically and via numerical simulations, shows that whether prior commitment would be a viable evolutionary mechanism for enhancing coordination and the overall population social welfare strongly depends on the collective benefit and severity of competition, and more importantly, how asymmetric benefits are resolved in a commitment deal. Moreover, in multiparty interactions, prior commitments prove to be crucial when a high level of group diversity is required for optimal coordination. Our results are shown to be robust for different selection intensities. We frame our model within the context of the technology adoption decision making, but the obtained results are applicable to other coordination problems.


Introduction
Achieving a collective endeavour among individuals with their own personal interest is an important social and economic challenge in various societies (Hardin, 1968;Ostrom, 1990;Pitt et al., 2012;Barrett, 2016;Sigmund, 2010). From coordinating individuals in the workplace to maintaining cooperative and trust-based relationship among organisations and nations, it success is often jeopardised by individual self-interest (Barrett et al., 2007;Perc et al., 2017). The study of mechanisms that support the evolution of such collective behaviours has been of great interest in many disciplines, ranging from Evolutionary Biology, Economics, Physics and Computer Science (Nowak, 2006;Sigmund, 2010;West et al., 2007;Han, 2013;Perc et al., 2017;Andras et al., 2018;Kumar et al., 2020). Several mechanisms responsible for the emergence and stability of collective behaviours among such individuals, have been proposed, including kin and group selection, direct and indirect reciprocities, spatial networks, reward and punishment (Nowak, 2006;West et al., 2007;Perc et al., 2017;Okada, 2020;Skyrms, 1996).
Recently, establishing prior commitments has been proposed as an evolutionarily viable strategy inducing cooperative behaviour in the context of pairwise and multi-player cooperation dilemmas (Nesse, 2001;Frank, 1988;Han et al., 2017Han et al., , 2015aSasaki et al., 2015;Arvanitis et al., 2019;Ohtsuki, 2018); namely, the Prisoner's Dilemma (PD) Hasan and Raja, 2013) and the Public Goods Game (PGG) (Han et al., 2015aKurzban et al., 2001). It provides an enhancement to different forms of punishment against inappropriate behaviours and of rewards to stimulate the appropriate ones (Chen et al., 2014;Martinez-Vaquero et al., 2015Sasaki et al., 2015;Powers et al., 2012;Szolnoki and Perc, 2012;Wang et al., 2019), allowing ones to efficiently avoiding free-riders (Han and Lenaerts, 2016;Han et al., 2015b) and resolving the antisocial punishment problem (Han, 2016). These works have primarily focused on modelling prior commitments for improving mutual cooperation among self-interested agents.
In the context of cooperation dilemma games (i.e. PD and PGG), mutual cooperation is the only desirable collective outcome to which all parties are required to commit if an agreement is to be formed. In other contexts such as coordination problems, this is not the case anymore since there might be multiple optimal or desirable collective outcomes and players might have distinct, incompatible preferences regarding which outcome a mutual agreement should aim to achieve (e.g. due to asymmetric benefits). Such coordination problems are abundant in nature, ranging from collective hunting and foraging to international climate change actions and multi-sector coordination (Santos and Pacheco, 2011;Ostrom, 1990;Barrett, 2016;Ohtsuki, 2018;Bianca and Han, 2019;Skyrms, 1996;Santos et al., 2016).
Hence, we explore how arranging a prior agreement or commitment can be used as a mechanism for enhancing coordination and the population social welfare in this type of coordination problems, in both pairwise and multi-player interaction settings. Before individuals embark on a joint venture, a preagreement makes the motives and intentions of all parties involved more transparent, thereby enabling an easier coordination of personal interests (Nesse, 2001;Cohen and Levesque, 1990;Han, 2013;Han et al., 2015b). Although our approach is applicable for a wide range of coordination problems (e.g. single market product investments as described above), we will frame our models within the technology investment strategic decision making problem, allowing us to describe the models clearly. Namely, we describe technology adoption games capturing the competitive market and decision-making process among firms adopting new technologies (Zhu and Weyant, 2003;Bardhan et al., 2004), with a key parameter α representing how competitive the market is (thus describing how important coordination is). Similar to previous commitment models, we will perform theoretical analysis and numerical simulations resorting to stochastic methods from Evolutionary Game Theory (EGT) (Hofbauer and Sigmund, 1998;Sigmund et al., 2010).
We will start by modelling a pairwise technology adoption decision making, where two investment firms (or players) competing within a same product market who need to make strategic decision on which technology to adopt (Zhu and Weyant, 2003;Chevalier-Roignant et al., 2011), a low-benefit (L) or a highbenefit (H) technology. Individually, adopting H would lead to a larger benefit. However, if both firms invest on H they would end up competing with each other leading to a smaller accumulated benefit than if they could coordinate with each other to choose different technologies. However, given the asymmetry in the benefits in such an outcome, clearly no firm would want to commit to the outcome where its option is L, unless some form of compensation from the one selecting H can be ensured.
We then extend and generalize the pairwise model to a multi-player one, capturing the strategic interaction between more than two investment firms. In the multi-player model, a key parameter µ is ascribed to the market demand of high technology, i.e. what is the optimal fraction of the firms in a group to adopt H. We analytically examine how players can be coordinated when there is a market demand for a particular technology. We show that differently from the two-player game, the newly defined parameter µ leads to a new kind of complexity when trying to achieve group coordination. When there is a high level of diversity in demand (i.e. intermediate values of µ), as can be seen in different technologies adoption contexts (Beede and Young, 1998;Schewe and Stuart, 2015), introducing prior commitment can lead to significant improvement in the levels of coordination and population social welfare.
The next section discusses related work, which is followed by a description of our models and details of the EGT methods for analysing them. Results of the analysis and a final discussion will then follow.

Related Work
The problem of explaining the emergence and stability of collective behaviours has been actively addressed in different disciplines (Nowak, 2006;Sigmund, 2010). Among other mechanisms, such as reciprocity and costly punishment, closely related to our present model is the study of cooperative behaviours and precommitment in cooperation dilemmas, for both two-player and multiplayer games Sasaki et al., 2015;Hasan and Raja, 2013). It has shown that to enhance cooperation commitments need to be sufficiently enforced and the cost of setting up the commitments is justified with respect to the benefit derived from the interactions-both by means of theoretical analysis and of behavioural experiments (Ostrom, 1990;Cherry and McEvoy, 2013;Kurzban et al., 2001;Chen and Komorita, 1994;Arvanitis et al., 2019). Our results show that this same observation is seen for coordination problems. However, arranging commitments for enhancing coordination is more complex, exhibiting a larger behavioural space, and furthermore, their outcomes strongly depend on new factors only appearing in coordination problems; namely, a successful commitment deal needs to take into account the fact that multiple desirable collective outcomes exist for which players have incompatible preferences; and thus how benefits can be shared through compensations in order to resolve the issues of asymmetric benefits, is crucially important (Bianca and Han, 2019).
We moved further by expanding our two-player game in the previous work to a multi-player model, the outcome has shown to be more complex as there are more players involved. We yet again investigated how coordination and cooperation can be improved using prior commitment deal when there are multiple players involved and also when there is a particular market demand (Bianca and Han, 2019). Our approach in exploring how implementing prior commitment enhances cooperation dilemma has also been investigated by previous researchers in the past (Chen and Komorita, 1994). A good level of cooperation was seen in a Public Good Game experiment when there was a binding agreement made during the prior communication stage among members of the group. They hypothesized that if members of a group are allowed to make a pledge (a degree of bindings/commitment) before their actual decisions, they will be able to communicate their intentions and it will overall increase cooperation rate in the population. As predicted, their results clearly demonstrates that making a pledge improves cooperation although the degree of commitment required in the pledge deferentially affected the cooperation rate (Chen and Komorita, 1994;Cherry and McEvoy, 2013;Kurzban et al., 2001).
There have been several other works studying the evolution of coordination, using the so-called Stag Hunt game, see e.g. (Skyrms, 2003;Pacheco et al., 2009;Santos et al., 2006;Sigmund, 2010). However, to the best of our knowledge there has been no work studying how prior commitments can be modelled and used for enhancing the outcome of the evolution of coordination. As our results below show, significant enhancement of coordination and population welfare can be achieved via the arrangement of suitable commitment deals.
Furthermore, it is noteworthy that commitments have been studied extensively in Artificial Intelligence and Multi-agent systems literature, see e.g. (Castelfranchi and Falcone, 2010;Chopra and Singh, 2009;Rzadca et al., 2015). Differently from our work, these studies utilise commitments for the purpose of regulating individual and collective behaviours, formalising different aspects of commitments (such as norms and conventions) in multi-agent systems. However, our results and approach provide important new insights into the design of such systems as these require commitments to ensure high levels of efficient collaboration and coordination within a group or team of agents. For example, by providing suitable agreement deals agents can improve the chance that a desirable collective outcome (which is best for the systems as a whole) is reached even when benefits provided by the outcome are different for the parties involved.

Models and Methods
In the following we first describe a two-player technology adoption game then extend it with the option of arranging prior commitments before playing the game. We then present a multi-player version of the model, with and without commitments, too. Then, we describe the methods, which are based on Evolutionary Game Theory for finite populations, which will be used to analyse the resulting models.
3.1 Two-player Tech Adoption (TD) Game

Two-player TD Without Commitments
We consider the scenario that two firms (players) compete for the same product market, and they need to make a (strategic) decision on which technology to invest on, a low-benefit (L) or a high-benefit (H) technology. The outcome of the interaction can be described in terms of costs and benefits of investments by the following payoff matrix (for row player): where c L , c H and b L , b H (b L ≤ b H ) represent the costs and benefits of investing on L and H, respectively; α ∈ (0, 1) indicates the competitive level of the product market: the firms receive a partial benefit if they both choose to invest on the same technology. Collectively, the smaller α is (i.e. the higher the market competitiveness), the more important that the firms coordinate to choose different technologies. For simplicity, the entries of the payoff matrix are denoted by a, b, c, d, as above. We have b > a and c > d. Without loss of generality, we assume that H would generate a greater net benefit, i.e.
Note that although we describe our model in terms of technology adoption decision making, it is generally applicable to many other coordination problems for instance wherever there are strategic investment decisions to make (in competitive markets of any products) (Zhu and Weyant, 2003;Chevalier-Roignant et al., 2011).

Two-player TD in Presence of Commitments
We now extend the model allowing players to have the option to arrange a prior commitment before a TD interaction. A commitment proposal is to ask the co-player to adopt a different technology. That is, a strategist intending to use H (resp., L) would ask the co-player to adopt L (resp., H). We denote these commitment proposing strategies as HP and LP, respectively. Similarly to previous models of commitments (for PD and PGG) (Han et al., , 2015a, to make the commitment deal reliable, a proposer pays an arrangement cost . If the co-player agrees with the deal, then the proposer assumes that the opponent will adopt the agreed choice, yet there is no guarantee that this will actually be the case. Thus whenever a co-player refuses to commit, HP and LP would play H in the game. When the co-player accepts the commitment though later does not honour it, she has to compensate the honouring co-player at a personal cost δ. Differently from previous models on PD and PGG where an agreed outcome leads to the same payoff for all parties in the agreement (mutual cooperation benefit), in the current model such an outcome would lead to different payoffs for those involved. Therefore, as part of the agreement, HP would compensate after the game an amount θ 1 to accepted player that honours the agreement; while LP would request a compensation θ 2 from such an accepted co-player.
Besides HP and LP, we consider a minimal model with the following (basic) strategies in this commitment version: • Non-proposing acceptors, HC and LC, who always commit when being proposed a commitment deal wherein they are willing to adopt any technology proposed (even when it is different from their intended choice), honour the adopted agreement, but do not propose a commitment themselves.
They play their intended choice, i.e. H and L, respectively, when there is no agreement in place; • Non-acceptors, HN and LN, who do not accept commitment, play their intended choice during the game, and do not propose commitments; • Fake committers, HF and LF, who accept a commitment proposal yet play the choice opposite to what has been agreed whenever the game takes place. These players assume that they can exploit the commitment proposing players without suffering the consequences 1 .
Note that similar to the commitment models for the PD game , some possible strategies have been excluded from the analysis since they are dominated by at least one of the strategies in any configuration of the game: they can be omitted without changing the outcome of the analysis. For example, those who propose a commitment (i.e. paying a cost ) but then do not honour (thus have to pay the compensation when facing a honouring acceptors) would be dominated by the corresponding non-proposers.
Together the model consists of eight strategies that define the following payoff matrix, capturing the average payoffs that each strategy will receive upon interaction with one of the other seven strategies Note that when two commitment proposers interact only one of them will need to pay the cost of setting up the commitment. Yet, as either one of them can take this action they pay this cost only half of the time (on average). In addition, the average payoff of HP when interacting with LP is given by We say that an agreement is fair if both parties obtain the same benefit when they honour it (after having taken into account the cost of setting up the agreement). For that, we can show that θ 1 and θ 2 must , and thus, both parties obtain b+c− 2 . Indeed, they can be achieved by comparing the payoffs of HP and HC when they interact, i.e. b − c − θ 1 = c + θ 1 , where solving this With these conditions it also ensures that the payoffs of HP and LP when interacting with each other are equal. Our analysis below will first focus on whether and when the fair agreements can lead to improvement in terms of coordination and the overall social welfare (i.e. average population payoff). We will discuss how different kinds of agreements (varying θ 1 and θ 2 ) affect the outcome, with additional results provided in Appendix.

Multi-player TD Without Commitments
We now describe a N -player (N > 2) version of the TD model. Again, as before, we will introduce the model in the context of technology investment market decision making. In a group (of size N ) with k players of type H (i.e., N − k players of type L), the expected payoffs of playing H and L can be written as follows where α H (k) and α L (k) represent the fraction of the benefit obtained by H and L players, respectively, which depend on the composition of the group, k. For two-player TD, both are equal to α.
The rationale of these definitions is that whenever k ≤ µ, full benefits from adopting H can be obtained, and moreover, if k > µ, the larger k the stronger the competition is among H adopters. Similarly for L adopters. The parameters α 1 and α 2 stand for the intensities of competition for investing in H and in L, respectively. For simplicity we assume in this paper α 1 = α 2 = α. Note that for N = 2 we recover the two-player model given in Equation (1).
The optimal group payoff is achieved when there are exactly µ players adopting H and the rest adopting L, leading to an average payoff for each member given by

Multi-player TD in Presence of Commitments
We can define the N -player game version with prior commitments in a similar fashion as in the two-player game. Commitment proposing strategists (i.e. HP and LP players) will propose before an interaction that the group will play the optimal arrangement (so that every player obtains an average payoff A). For simplicity, we assume that the committed players adopt the fair agreement, i.e. every member will obtain the same payoff after compensation is made to those adopting L. As such, we don't need to consider who will adopt H or L, as all would receive the same payoff at the end. Moreover, whenever a player in the group refuses to commit, commitment proposers will adopt H. Details of payoff calculation will be provided in Results section (cf. Table 1).

Evolutionary Dynamics
In this work, we will perform theoretical analysis and numerical simulations (see next section) using EGT methods for finite populations (Nowak et al., 2004;Imhof et al., 2005). Let Z be the size of the population.
In such a setting, individuals' payoff represents their fitness or social success, and evolutionary dynamics is shaped by social learning (Hofbauer and Sigmund, 1998;Sigmund, 2010), whereby the most successful individuals will tend to be imitated more often by the other individuals. In the current work, social learning is modelled using the so-called pairwise comparison rule (Traulsen et al., 2006), a standard approach in EGT, assuming that an individual A with fitness f A adopts the strategy of another individual B with fitness f B with probability p given by the Fermi function, The parameter β represents the 'imitation strength' or 'intensity of selection', i.e., how strongly the individuals base their decision to imitate on fitness difference between themselves and the opponents. For β = 0, we obtain the limit of neutral drift -the imitation decision is random. For large β, imitation becomes increasingly deterministic.
In the absence of mutations or exploration, the end states of evolution are inevitably monomorphic: once such a state is reached, it cannot be escaped through imitation. We thus further assume that, with a certain mutation probability, an individual switches randomly to a different strategy without imitating another individual. In the limit of small mutation rates, the dynamics will proceed with, at most, two strategies in the population, such that the behavioural dynamics can be conveniently described by a Markov Chain, where each state represents a monomorphic population, whereas the transition probabilities are given by the fixation probability of a single mutant (Imhof et al., 2005;Nowak et al., 2004). The resulting Markov Chain has a stationary distribution, which characterises the average time the population spends in each of these monomorphic end states.
Before describing how to calculate this stationary distribution, we need to show how payoffs are calculated, which differ for two-player and N-player settings, as below.
• Average Payoff for the Two Player Game Let π ij represent the payoff obtained by strategist i in each pairwise interaction with strategist j, as defined in the payoff matrices in Equations (1) and (2). Suppose there are at most two strategies in the population, say, k individuals using i (0 ≤ k ≤ Z) and (Z − k) individuals using j. Thus the average payoff of the individual that uses i or j can be written respectively as follows • Expected Payoff in The Multiplayer Game In the case of N -player interactions, suppose the population includes x individuals of type i and Z − x individuals of type j. The probability to select k individuals of type i and N − k individuals of type j, in N trails, is given by the hypergeometric distribution as follows (Sigmund, 2010;Gokhale and Traulsen, 2010) Hence, in a population of x i-strategists and (Z − x) j strategists, the average payoff of i and j are given by Now, for both two-player and N -player settings, the probability to change the number k of individuals using strategy A by ± one in each time step can be written as (Traulsen et al., 2006) The fixation probability of a single mutant with a strategy i in a population of (N − 1) individuals using j is given by (Traulsen et al., 2006;Nowak et al., 2004) Considering a set {1, ..., q} of different strategies, these fixation probabilities determine a transition matrix ,j =i T ij , of a Markov Chain. The normalised eigenvector associated with the eigenvalue 1 of the transposed of M provides the stationary distribution described above (Imhof et al., 2005), describing the relative time the population spends adopting each of the strategies.
Risk-dominance An important measure to determine the evolutionary dynamic of a given strategy is its risk-dominance against others. For the two strategies i and j, risk-dominance is a criterion which determine which selection direction is more probable: an i mutant is able to fixating in a homogeneous population of agents using j or a j mutant fixating in a homogeneous population of individuals playing i.
In the case, for instance, the first was more probable than the latter then we say that i is risk-dominant against j (Nowak et al., 2004;Sigmund, 2010) which holds for any intensity of selection and in the limit This condition is applicable for both two-player games, N = 2, and when N-player games with N > 2 (Sigmund, 2010;Gokhale and Traulsen, 2010).

Results
We will first describe results for two-player games, then proceeding to provide those for the N -player version.

Analytical Conditions for the Viability of Commitment Proposers
To begin with, using the conditions given in Equation 10, we obtain that if Note that each element in the min expressions above corresponds to the condition for one of the six non-proposing strategies HN, LN, HC, LC, HF, LF, respectively.
In general, these conditions indicate that for commitments to be a viable option for improving coordination, the cost of arrangement must be sufficiently small while the compensation associated with the contract needs to be sufficiently large (see already Figure 2 for numerical validation). Furthermore, for the first condition to hold, it is necessary that b+c > 2 max{a, d}. It means that the total payoff of two players when playing the TD game is always greater when they can coordinate to choose different technologies, than when they both choose the same technology.

Moreover, the conditions in Equation 13
can be expressed in terms of α and the costs and benefits of investment, as follows (see again the payoff matrices in Equation 1) which can be rewritten as This condition indicates under what condition of the market competitiveness and the costs and benefits of investing in available technologies, commitments can be an evolutionarily viable mechanism. Intuitively, for given costs and benefits of investment (i.e. fixing c L , c H , b L , b H ), a larger cost of arranging a (reliable) agreement, , leads to a smaller threshold of α where commitment is viable. Moreover, given a commitment system (i.e. fixing and δ), assuming similar costs of investment for the two technologies, then a larger ratio of the benefits obtained from the two technologies, b H /b L , leads to a smaller upper bound for α for which commitment is viable.
Remarkably, our numerical analysis below (see already Figure 1) shows that the condition in Equation   14 accurately predicts the threshold of α where commitment proposing strategies (i.e. HP and LP) are highly abundant in the population, leading to improvement in terms of the average population payoff compared to when commitment is absent (Figure 3).
On the other hand, when α is sufficiently large, little improvement can be achieved, especially when b H /b L is large (which is in accordance with the analytical results above).

Numerical Results for Pairwise TD game
We calculate the stationary distribution in a population of eight strategies, HP, LP, HN, LN, HC, LC, HF and LF, using methods described above. In Figure 1, we show the frequency of these strategies as a This observation is robust for varying commitment parameters, i.e. the cost of arranging commitment, , and the compensation cost associated with commitment, δ, see Figure 2. Namely, we show the total frequency of commitment strategies (i.e. sum of the frequencies of HP and LP) for varying these parameters and for different values of α. It can be seen that, in general, the commitment strategies dominate the population whenever is sufficiently small and δ is sufficiently large. This observation is in accordance with previous commitment modelling works for the cooperation dilemma games (Han et al., , 2015a. In addition, we observe that in the current coordination problem, that the smaller α is, these commitment strategies dominate the population for wider range of and δ. Our additional results show that these observations are robust with respect to other game configurations. Furthermore, the results show that increasing β only have some effect for large α, where sharp increase in commitment frequency when δ is sufficiently larger. Now, in order to determine whether and when commitments can actually lead to meaningful improvement, in Figure 3, we compare the average population payoff or social welfare when a commitment is present and when it is absent. In general, it can be seen that when α is sufficiently small (below a thresh-  In general, the commitment proposing strategies HP and LP dominate the population when α is small while HN and HC dominate when α is sufficiently large in all cases, which is robust for different values of intensity of selection, β. The HN and HC dominate the population as the market competition decreases (i.e. when α increases). Larger values of β increase the difference between strategies' frequencies but do not change the outcomes in general. Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1), bH = 6 (i.e. b = 5). Other parameters: δ = 6; β = 0.01, 0.1 and 1; population size Z = 100; Fair agreements are used, where θ1 and θ2 are given by θ1 = (b − c − )/2 and θ2 = (b − c + )/2.  Figure 2: Total frequency of commitment strategies (i.e. sum of the frequencies of HP and LP), as a function of and δ, for different values of α and β. Primarily, the commitment proposing strategies dominate the population whenever is sufficiently small and δ is sufficiently large. Furthermore, the smaller α, these commitment strategies dominate for a wider range of and δ, especially when α is smaller. Increasing β only have some effect for large α, where sharp increase in commitment frequency when δ is sufficiently larger. Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1), and bH = 6 (i.e. b = 5). Other parameters: β = 0.01 in the first, β = 0.1 in the second and β = 1 in the third row; population size Z = 100; Fair agreements are used, where θ1 and θ2 are given by θ1 = (b − c − )/2 and θ2 = (b − c + )/2.  Figure 3: Average population payoff as a function of α, when commitment is absent and when it is present, for different values of and β. We observed that when α is small, significant improvement in terms of the average population payoff can be achieved through prior commitment. When α is sufficiently large, commitment leads to on improvement or might even be detrimental for social welfare, especially when β is small. That is, at α = 0.7 in panel a and α = 0.9 in panel d, without commitment will be more beneficial. Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1); in panel a, b and c) bH = 6 (i.e. b = 5) with β = 0.01, 0.1and1 respectively. Also, in panel d,e and f) bH = 3 (i.e. b = 2) with β = 0.01, 0.1and1 respectively; Other parameters: δ = 6; population size Z = 100; Fair agreements are used, where θ1 and θ2 are given by θ1 = (b − c − )/2 and θ2 = (b − c + )/2. old), the smaller it is, the greater improvement of social welfare is achieved through the presence of a commitment deal. Moreover, the smaller the cost of arranging commitments, , the greater improvement is obtained. When α is sufficiently large, commitment leads to on improvement or might even be detrimental for social welfare, especially when b H /b L is large (which is in accordance with the analytical results above). The detriment is further increased when β is small. We can observe that the thresholds for which a notable improvement can be achieved is the same as the one for the viability of HP and LP (i.e. as described in Equation 14).

Payoff Derivation in N -player TD game
As mentioned above, compared to cooperation dilemmas such as PD and PGG, fake strategies make less sense in the context of coordination games since they would not earn the temptation payoff by adopting a different choice from what being agreed. To focus on the group effect and the effect of the newly introduced parameter µ, we will consider a population consisting of HP, LP, HN, LN, HC and LC (i.e. excluding fake strategies). As shown in the two-player game analysis, the fake strategies (i.e. HF and LF) are not viable options in TD games and can be ignored. It is equivalent to consider to the full set of strategies with a sufficiently large δ.
First of all, we derive the payoffs received by each strategy when encountering specific other strategies (see a summary in Table 1). Namely, Π ij (k) and Π ji (k) denote the payoffs of a strategist of type i and j, respectively, in a group consisting of k player of type i and N − k players of type j. The first column of the table lists all possible strategies which can be used by player i (focal player), where as the second column shows strategies of co-players (opponents). The third column shows the payoffs of focal players.

Analytical conditions for the viability of commitment proposers in N-player TD game
We now derive the conditions under which HP is risk-dominant against the rest of strategies. Since we assume fair agreements, the conditions for LP would be equivalent to those for HP in terms of riskdominance.
which can be written as Hence we obtain  where For risk-dominance of HP against HN, which equivalently can be written as which can be rewritten as Further simplification leads to In short, in order for commitment proposers to be risk-dominant against all other strategies, it requires that is sufficiently small, namely, smaller than minimum of the right hand sides of Equations (15) Figure 4: Frequency of the six strategies HP, LP, HN, LN, HC and LC, as a function of in a N-player game with commitment, for different values of µ. In the N-player game, the new parameter µ describes the market demand for a high technology, which was set to 1 in the pairwise game. HP and LP have a high frequency for sufficiently small for µ = 2 in both games, and also when µ = 1 for the first, easy coordinate situation (first row). When µ = 5, i.e. when all players can adopt H without benefit reduction, HC always dominate and commitment strategies are not successful. This means that when there is a need for a diversity of technology adoption, initiating prior commitments to enhance coordination is important. Parameters: in panel a, b and c) bH = 6 (i.e. b = 5) with µ = 1, 2, 5 respectively. Also, in panel d,e and f) bH = 3 (i.e. b = 2) with µ = 1, 2, 5 respectively; Other parameters: N = 5, β = 0.1; α = 0.5; cH = 1, cL = 1, bL = 2 (i.e. c = 1); .

Numerical Results for N-player TD game
We compute stationary distributions in a population of six strategies HP, LP, HN, LN, HC and LC, for the N-player TD game, using the payoffs in Table 1 and the Methods described above. To begin with, in Figure 4 (see also Figure 9 in Appendix), we provide numerical validation for the analytical conditions obtained in the previous section regarding when commitment proposing strategies are evolutionarily viable strategies (being risk-dominant against others). Similar to the pairwise TD game, we observe that there is a threshold for below which it is the case. Moreover, Figure 5 shows that the frequencies of these strategies (HP and LP) decrease for increasing α. They dominate the population whenever is sufficiently small (e.g. = 0.1 and 1). That is, it is more beneficial to engage in a prior commitment deal when the Figure 5: Frequency of the six strategies HP, LP, HN, LN, HC and LC, as a function of α in a multiplayer game with commitment, for different values of and also two different game configurations. In general, the commitment proposing strategies (HP and LP) decrease in frequency for increasing α. They dominate over other strategies for sufficiently small α and . That is, it is more beneficial to engage in a prior commitment deal when the market competition is fierce and the cost of arranging the commitment is very minimal. Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1); in panel a, b and c) bH = 6 (i.e. b = 5) with = 0.1, 1 and 2, respectively. Also, in panel d, e and f: bH = 3 (i.e. b = 2) with = 0.1, 1 and 2 respectively; Other parameters: Figure 6: Total frequency of commitment proposing strategies HP and LP as a function of µ and . In general, the commitment proposing strategies are most successful for intermediate values of µ, especially for a sufficiently small cost of arranging prior commitment . Parameters: in all panels, cH = 1, cL = 1 (i.e. c = 1), bL = 2. In panel a), bH = 6 (i.e. b = 5) and in panel b) bH = 3 (i.e. b = 2). Other parameters: N = 5, β = 0.1; α = 0.5. market competition is harsher (i.e. small α). These results are robust for different intensities of selection (see Figure 10 in Appendix). In general, our results confirm the similar observations regarding the effects of and α on the evolutionary outcomes to obtained in the pairwise game above.
We now focus on understanding the effect of the new parameter in the N-player game, µ, on the evolutionary outcomes. Recall that µ indicates the demand for high technology (H) in the group, describing what is the maximal number of players in the group that can adopt H without reducing their benefit due to competition. Figure 4 shows the effect of different values of µ on the frequency or evolutionary success of all strategies as a function of . When µ is small to intermediate, and the cost of arranging prior commitment is also small, the commitment proposing strategies are dominant. This suggests that arranging prior commitments might be more beneficial in such instances. These results also imply that µ is very essential in determining when commitment should be initiated. Apparently, the greater need for a group mixture or market diversity of technologies, indicating a more difficult coordination situation, the greater need for the utilization of commitment to enhance coordination among players is. This observation is even more evident in Figure 6, where we examine the success of commitment for varying µ and , in regards to two different game configurations. It can be observed that an intermediate value of µ leads to the highest frequency of commitment strategies, especially in the more difficult coordination situation (i.e. the right panel).
We now closely examine the gain in terms of social welfare improvement when using prior commitments.
As shown in Figure 7, whenever µ < N (N = 5), i.e. there is a need to coordinate among the group players  showing when commitment is absent against when it is present. We compare results for different values of β in two game configurations. We observe that whenever µ < 5 (i.e. when there is a need for coordination to avoid competition in the group), arranging a prior commitment is beneficial to the population social welfare. Parameters: in panel a, b and c) bH = 6 (i.e. b = 5), in panel d,e and f) bH = 3 (i.e. b = 2). Other parameters: N = 5, α = 0.5, cH = 1, cL = 1, bL = 2 to avoid competition that induces benefit reduction, prior commitments lead to increase of social welfare.
This increase is more significant in the more difficult coordination situation (i.e. the lower row) and when the cost of arranging commitment is low, which is also slightly more significant for intermediate values of µ and higher values of intensity of selection, β.

Conclusions and Future Work
We have described in this paper novel evolutionary game theory models showing how prior commitments can be adopted as an efficient mechanism for enhancing coordination, in both pairwise and multi-player interactions. For that, we described technology adoption (TD) games where technology investment firms would achieve the best collective outcome if they can coordinate with each other to adopt a mixture of different technologies. To this end, a parameter α was used to capture the competitiveness level of the product market and how beneficial it is to achieve coordination, while another parameter µ to capture the optimal coordination mixture or diversity of technology adopters in a group (in the pairwise case, we assume the optimal mixture is where two firms adopt different technologies to avoid conflict).
In the coordination settings, there are multiple desirable outcomes and players have distinct preferences in terms of which outcome should be agreed upon, thus leading to a larger behavioural space than in the context of cooperation dilemmas (Han et al., , 2015aSasaki et al., 2015;Hasan and Raja, 2013). We have shown that whether commitment is a viable mechanism for promoting the evolution of coordination, strongly depends on α: when α is sufficiently small, prior commitment is highly abundant leading to significant improvement in terms of social welfare (i.e. population avarage payoff), compared to when commitment is absent. Importantly, we have derived the analytical condition for the threshold of α below which the success of commitments is guaranteed, for both pairwise and multi-player TD games.
Furthermore, moving from pairwise to a multi-player setting, it was shown that µ plays an important role for the success of commitment strategies as well. In general, when µ is intermediate, equivalent to a high level of diversity in group choices, arranging prior commitments proved to be highly important. It led to significant improvement in terms of social welfare, especially in a harsher coordination situation.
In the main text, we have considered that a fair agreement is arranged. In the Appendix (Figure 8), we have shown that whenever commitment proposers are allowed to freely choose which deal to propose to their co-players, our results show that, in a highly competitive market (i.e. small α), commitment proposers should be strict (i.e. sharing less benefits), while when the market is less competitive, commitment proposers should be more generous.
In short, our analysis has demonstrated that commitment is a viable tool for promoting the evolution of diverse collective behaviours among self-interested individuals, beyond the context of cooperation dilemmas where there is only one desirable collective outcome (Nesse, 2001;Skyrms, 1996;Barrett et al., 2007). In future work, we will consider how commitments can solve more complex collective problems, e.g. in a technological innovation race , bargaining games (Zisis et al., 2015;Rand et al., 2013), climate change actions (Barrett et al., 2007) and cross-sector coordination (Santos et al., 2016), where there might be a large number of desirable outcomes or equilibriums, especially when the number of players in an interaction increases (Duong and Han, 2015;Gokhale and Traulsen, 2010;Han et al., 2012 In the main text, we assume that a fair agreement is always arranged. We consider here what would happen if HP and LP can personalise the commitment deal they want to propose, i.e. any θ 1 and θ 2 can be proposed (instead of always being fair). Namely, Figure 8 shows the average population payoff varying these parameters, for different values of α. We observe that when α is small, the highest average payoff is achieved when θ 1 is sufficiently small and θ 2 is sufficiently large, while for large α, it is reverse for the two parameters. That is, in a highly competitive market (i.e. small α), commitment proposers should be strict (HP keeps sufficient benefit while LP requests sufficient payment, from their commitment partners), while when the market is less competitive (i.e. large α), commitment proposers should be more generous (HP proposes to give a larger benefit while LP requests a smaller payment, from their commitment partners).
Our results confirm that this observation is robust for different values of , δ and β.   When α is small (panels a and b), the highest average payoff is achieved when θ1 is sufficiently small and θ2 is sufficiently large, while for large α (panel c), it is the case when θ1 is sufficiently large and θ2 is sufficiently small. Figure 4 also shows that for a small value of β, the highest average payoff is achieved when α is very minimal compared to other panels with higher value of β (compare panel a, d and g). Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1), and bH = 6 (i.e. b = 5). Other parameters: δ = 4, = 1; β = 0.01, 0.1 and 1; population size Z = 100. b =5, c =1, µ =2, ? =0.5   Figure 10: Frequency of six strategies HP, LP, HN, LN, HC and LC, as a function of α and for different values of and β. The commitment proposing strategies HP and LP dominate the population when the values of α and are sufficiently small, in all cases of β. Furthermore, as the value of increases, the nonproposing strategies dominate the population. Parameters: in all panels cH = 1, cL = 1, bL = 2 (i.e. c = 1), bH = 6 (i.e. b = 5); Other parameters: = 0.1, 1, 2, β = 0.01, 0.1, 1