Class of games in game theory From Wikipedia, the free encyclopedia
Congestion games (CG) are a class of games in game theory. They represent situations which commonly occur in roads, communication networks, oligopoly markets and natural habitats. There is a set of resources (e.g. roads or communication links); there are several players who need resources (e.g. drivers or network users); each player chooses a subset of these resources (e.g. a path in the network); the delay in each resource is determined by the number of players choosing a subset that contains this resource. The cost of each player is the sum of delays among all resources he chooses. Naturally, each player wants to minimize his own delay; however, each player's choices impose a negative externality on the other players, which may lead to inefficient outcomes.
The research of congestion games was initiated by the American economist Robert W. Rosenthal in 1973.[1] He proved that every congestion game has a Nash equilibrium in pure strategies (aka pure Nash equilibrium, PNE). During the proof, he in fact proved that every congestion game is an exact potential game. Later, Monderer and Shapley[2] proved a converse result: any game with an exact potential function is equivalent to some congestion game. Later research focused on questions such as:
Does the existence of equilibrium, as well as the existence of a potential function, extend to more general models of congestion games?
Consider a traffic net where two players originate at point and need to get to point . Suppose that node is connected to node via two paths: -- and --, where is a little closer than (i.e. is more likely to be chosen by each player), as in the picture at the right.
The roads from both connection points to get easily congested, meaning the more players pass through a point, the greater the delay of each player becomes, so having both players go through the same connection point causes extra delay. Formally, the delay in each of and when players go there is .
A good outcome in this game will be for the two players to "coordinate" and pass through different connection points. Can such an outcome be achieved?
The following matrix expresses the costs of the players in terms of delays depending on their choices:
More information p2p1, OAT ...
Cost Matrix
p2
p1
OAT
OBT
OAT
(5,5)
(2,3)
OBT
(3,2)
(6,6)
Close
The pure Nash equilibria in this game are (OAT,OBT) and (OBT,OAT): any unilateral change by one of the players increases the cost of this player (note that the values in the table are costs, so players prefer them to be smaller). In this example, the Nash equilibrium is efficient - the players choose different lanes and the sum of costs is minimal.
In contrast, suppose the delay in each of and when players go there is . Then the cost matrix is:
More information p2p1, OAT ...
Cost Matrix
p2
p1
OAT
OBT
OAT
(2.6,2.6)
(1.8,2.8)
OBT
(2.8,1.8)
(3.6,3.6)
Close
Now, the only pure Nash equilibrium is (OAT,OAT): any player switching to OBT increases his cost from 2.6 to 2.8. An equilibrium still exists, but it is not effiicent: the sum of costs is 5.2, while the sum of cost in (OAT,OBT) and (OBT,OAT) is 4.6.
Notation
The basic definition of a CG has the following components.
A base set of congestible elements (also called resources or factors). In the above example, is the set of roads (, , and ).
A set of players. In the above example .
A finite set of strategies for each player, where each strategy is a subset of .
In the above example, both players have the same set of strategies: . CGs in which all players have the same set of strategies are called symmetric CGs. In general, different players may have different sets, for example, if each player has a different source and/or a different target. Such CGs are called asymmetricCGs.
In general, a strategy can be any subset of . CGs in which a strategy can only be a path in a given graph (as in the above example) are called network CGs. CGs in which a strategy can only be a single resource are called singleton CGs.
For each element and a vector of strategies , the load is defined as .
For each element there is a delay function (also called latency function or cost function). Given a vector of strategies, the delay on is . Each is assumed to be positive and monotone increasing.
Given a strategy , player experiences delay ; each player wants to minimize his delay.
A Nash equilibrium is a vector of strategies such that, for each player , replacing with a different strategy would not decrease the delay experienced by .
Existence of Nash equilibria
Every CG has a Nash equilibrium in pure strategies. This can be shown by constructing a potential function that assigns a value to each outcome.[1] Moreover, this construction will also show that iterated best response finds a Nash equilibrium. Define . Note that this function is not the social welfare , but rather a discrete integral of sorts. The critical property of a potential function for a congestion game is that if one player switches strategy, the change in his delay is equal to the change in the potential function.
Consider the case when player switches from to . Elements that are in both of the strategies
remain unaffected, elements that the player leaves (i.e. ) decrease the potential by , and the elements the player joins (i.e. ) increase the potential by . This change in potential is precisely the change in delay for player , so is in fact a potential function.
Now observe that any minimum of is a pure Nash equilibrium. Fixing all but one player, any improvement in strategy by that player corresponds to decreasing , which cannot happen at a minimum. Now since there are a finite number of configurations and each is monotone, there exists an equilibrium.
The existence of a potential function has an additional implication, called the finite improvement property (FIP). If we start with any strategy-vector, pick a player arbitrarily, and let him change his strategy to a better strategy for him, and repeat, then the sequence of improvements must be finite (that is, the sequence will not cycle). This is because each such improvement strictly increases the potential.
Extensions
Below we present various extensions and variations on the basic CG model.
A nonatomic (aka continuous) CG is the limit of a standard CG with n players, as . There is a continuum of players, the players are considered "infinitesimally small", and each individual player has a negligible effect on the congestion. Nonatomic CGs were studied by Milchtaich,[3] Friedman[4] and Blonsky.[5][6]
We keep a finite set of congestible elements.
Instead of recognizing players, as in the discrete case, we have types of players, where each type is associated with a number , representing the rate of traffic for that type.
Each agent in type i picks a strategy from the strategy set .
As before, the delay functions are monotone and positive, but we now add the assumption that they are continuous as well.
We allow players in a type to distribute fractionally over their strategy set. That is, for every strategy , let denote the fraction of players in type using strategy . By definition, .
For each element , the load is defined as the sum of fractions of players using e, that is, .
Existence of equilibria in nonatomic CGs
Strategies are now collections of strategy profiles . For a strategy set of size , the collection of all valid profiles is a compact subset of . We now define the potential function as , replacing the discrete integral with the standard one.
As a function of the strategy, is continuous: is continuous by assumption, and is a continuous function of the strategy. Then by the extreme value theorem, attains its global minimum.
The final step is to show that a minimum of is indeed a Nash equilibrium. Assume for contradiction that there exists a collection of that minimize but are not a Nash equilibrium. Then for some type , there exists some improvement over the current choice . That is, . The idea now is to take a small amount of players using strategy and move them to strategy . Now for any , we have increased its load by , so its term in is now . Differentiating the integral, this change is approximately , with error . The equivalent analysis of the change holds when we look at edges in .
Therefore, the change in potential is approximately , which is less than zero. This is a contradiction, as then was not minimized. Therefore, a minimum of must be a Nash equilibrium.
In a splittable CG, as in an atomic CG, there are finitely many players, each of whom has a certain load to transfer. As in nonatomic CGs, each player can split his load into fractional loads going through different paths, like a transportation company choosing a set of paths for mass transportation. In contrast to nonatomic CGs, each player has a non-negligible effect on the congestion.
Splittable CGs were first analyzed by Ariel Orda, Raphael Rom and Nachum Shimkin in 1993, in the context of communication networks.[7][8] They show that, for a simple network with two nodes and multiple parallel links, the Nash equilibrium is unique under reasonable convexity conditions, and has some interesting monotonicity properties. For general network topologies, more complex conditions are required to guarantee the uniqueness of Nash equilibrium.
In a weightedCG, different players may have different effects on the congestion. For example, in a road network, a truck adds congestion much more than a motorcycle. In general, the weight of a player may depend on the resource (resource-specific weights): for every player i and resource e, there is weight , and the load on the resource e is . An important special case is when the weight depends only on the player (resource-independent weights), that is, each player i has a weight , and .
Weighted singleton CGs with resource-independent weights
Milchtaich[9] considered the special case of weighted CGs in which each strategy is a single resource ("singleton CG"), the weights are resource-independent, and all players have the same strategy set. The following is proved:
If all players have the same delay functions, then the game has the finite-improvement property (and thus has a PNE).
If there are only two strategies (and arbitrarily many players with possibly different delay functions), then the game has the finite-improvement property (and thus has a PNE).
If there are only two players (with possibly different delay functions), then the game has the finite-best-response property (and thus has a PNE).
If there are three or more strategies and three or more players with different delay functions, a PNE might not exist.
Weighted network CGs
Milchtaich considered the special case of weighted CGs in which each strategy is a path in a given undirected graph ("network CG"). He proved that every finite game can be represented as a weighted network congestion game, with nondecreasing (but not necessarily negative) cost-functions.[10] This implies that not every such game has a PNE. Concrete examples of weighted CGs without PNE are given by Libman and Orda,[11] as well as Goemans Mirrokni and Vetta.[12] This raises the question of what conditions guarantee the existence of PNE.[13]
In particular, we say that a certain graph G guarantees a certain property if every weighted network CG in which the underlying network is G has that property. Milchtaich[14] characterized networks that guarantee the existence of PNE, as well as the finite-improvement property, with the additional condition that a player with a lower weight has weakly more allowed strategies (formally, implies ). He proved that:
A graph G guarantees the finite-improvement property iff G is homeomorphic to either a parallel network (a graph made of one or more single-edge networks connected in parallel), or to a parallel network connected in series with one or two single-edge networks.:Thm.2
A graph G guarantees the existence of a PNE iff G is homeomorphic to a connection in series of one or more networks from a set of six "allowed networks"; an equivalent condition is that no network from a set of six "forbidden network" is embedded in G.:Thm.3
In the special case in which every player is allowed to use any strategy ("public edges"), there are more networks that guarantee the existence of PNE; a complete characterization of such networks is posed as an open problem.[14]
Mlichtaich[15] analyzes the effect of network topology on the efficiency of PNE:
A graph G guarantees that every PNE is Pareto-efficient, iff three simple "forbidden networks" are not embedded in G.
Milchtaich[16] analyzes the effect of network topology on the uniqueness of the PNE costs:
A graph G guarantees that the PNE costs are unique iff G is a connection in series of one or more networks of several simple kinds.
A graph G does not guarantee that PNE costs are unique iff G contains an embedded network of a particular simple type.
Holzman and Law-Yone[17] also characterize the networks that guarantee that every atomic CG has a strong PNE, a unique PNE, or a Pareto-efficient PNE.
Richman and Shimkin[18] characterize the networks that guarantee that every splittable CG has a unique PNE.
General weighted CGs
We say that a class C of functions guarantees a certain property if every weighted CG in which all delay functions are elements of C has that property.
Fotakis, Kontogiannis and Spirakis[19] prove that the class of linear functions guarantees the existence of an exact potential, and hence the existence of PNE.
Panagopoulou and Spirakis[20] prove that the class of exponential functions guarantees the existence of a weighted potential, and hence the existence of PNE.
Harks, Klimm and Mohring[21] prove that a class of functions guarantees the existence of an exact potential, if and only if it contains only affine functions. This characterization remain valid when restricted to two-player games, three-resource games, singleton games, games with symmetric strategies, or games with integral weights. Moreover, a class of functions guarantees the existence of a weighted potential, if and only if either (1) it contains only affine functions, or (2) it contains only exponential functions of the form , where is the same for all resources. This characterization remain valid when restricted to four-player games, four-resource games, singleton games, games with symmetric strategies, or games with integral weights. For two-player games, a class of functions guarantees the existence of a weighted potential, if and only if all functions in it are of the form , where is a monotone function (the same for all resources).
Harks and Klimm[22] prove a similar result for the existence of PNE: they prove that a class of functions guarantees the existence of PNE if and only if either (1) it contains only affine functions, or (2) it contains only exponential functions of the form , where is the same for all resources. This characterization remain valid when restricted to three-player games. For two-player games, a class of functions guarantees the existence of PNE if and only if all functions in it are of the form , where is a monotone function (the same for all resources).
Other results
There are many other papers about weighted congestion games.[23][24][25]
The basic CG model can be extended by allowing the delay function of each resource to depend on the player. So for each resource e and player i, there is a delay function . Given a strategy , player experiences delay .
Player-specific costs in singleton CGs (crowding games)
Milchtaich[9] introduced and studied CGs with player-specific costs in the following special case:
Each player chooses a single resource (such games are called singleton CGs);
All players have the same set of strategies.
This special case of CG is also called a crowding game.[26][27] It represents a setting in which several people simultaneously choose a place to go to (e.g. a room, a settlement, a restaurant), and their payoff is determined both by the place and by the number of other players choosing the same place.
In a crowding game, given a strategy , player experiences delay . If the player switches to a different strategy , his delay would be ; hence, a strategy vector is a PNE iff for every player i, for all e,f.
In general, CGs with player-specific delays might not admit a potential function. For example, suppose there are three resources x,y,z and two players A and B with the following delay functions:
The following is a cyclic improvement path: . This shows that the finite-improvement property does not hold, so the game cannot have a potential function (not even a generalized-ordinal-potential function). However:
With only two resources, the finite improvement property holds.[9]:Thm.1 Hence, a PNE exists.
With only two players, every finite best-response property holds. Hence, a PNE exists.
When there are three or more players, even best-response paths might be cyclic. However, every CG still has a PNE.[9]:Thm.2 The proof is constructive and shows an algorithm that finds a Nash equilibrium in at most steps. Moreover, every CG is weakly acyclic: for any initial strategy-vector, at least one best-response path starting at this vector has a length of at most , which terminates at an equilibrium.[9]:Thm.3
Every crowding game is sequentially solvable.[26] This means that, for every ordering of the players, the sequential game in which each player in turn picks a strategy has a subgame-perfect equilibrium in which the players' actions are a PNE in the original simultaneous game. Every crowding game has at least one strong PNE;[28] every strong PNE of a crowding game can be attained as a subgame-perfect equilibrium of a sequential version of the game.[26]
In general, a crowding game might have many different PNE. For example, suppose there are n players and n resources, and the negative effect of congestion on the payoff is much higher than the positive value of the resources. Then there are n! different PNEs: every one-to-one matching of players to resources is a PNE, as no player would move to a resource occupied by another player. However, if a crowding game is replicated m times, then the set of PNEs converges to a single point as m goes to infinity. Moreover, in a "large" (nonatomic) crowding game, there is generically a unique PNE. This PNE has an interesting graph-theoretic property. Let G be a bipartite graph with players on one side and resources on the other side, where each player is adjacent to all the resources that his copies choose in the unique PNE. Then G contains no cycles.[27]
Separable cost functions
A special case of the player-specific delay functions is that the delay functions can be separated into a player-specific factor and a general factor. There are two sub-cases:
Multiplicatively-separablecost functions: , where is a constant that represents the base cost of resource e to player i, and d is a general delay function (the same for all resources).
Additively-separablecost functions:[29], where is a constant that represents the fixed cost of resource e to player i, and d is a general delay function (the same for all resources).
When only pure-strategies are considered, these two notions are equivalent, since the logarithm of a product is a sum. Moreover, when players may have resource-specific weights, the setting with resource-specific delay functions can be reduced to the setting with a universal delay function. Games with separable cost functions occur in load-balancing,[30]M/M/1 queueing,[31] and habitat selection.[32] The following is known about weighted singleton CGs with separable costs:[33]
If the base costs are player-independent ( for every player i), then the CG has the FIP, hence it has a PNE. The same holds if the base costs are resource-independent ( for every resource e).[30][34] The proof is based on a vector-valued potential function. For each state of the game, the potential is a vector of size n containing the costs of all players, sorted from large to small. Whenever a player deviates to a resource with a smaller cost for him, the vector of costs becomes smaller in the leximin order.
If the weights are player-independent (equivalently: the CG is unweighted and the delay-functions are resource-specific), then it has the FIP, hence it has a PNE.[35][29] If the cost-functions are additively-separable, then the game even has an exact potential function. The result holds even if the cost functions are not monotonically-increasing with the load. If the cost-functions are not additively-separable, then FIP may not hold, and there may be no potential function, but a PNE still exists.[9]:Thm.2
If the weights are resource-independent, then a PNE exists in the following cases:
When there are at most three players, a PNE exists,[36]:Cor.3 though the best-response improvement property might not hold. In contrast, there is a CG with separable costs and resource-independent weights with eight players in which no PNE exists.[33]:Thm.3
When cost functions are additively-separable with linear variable-cost functions, the CG has a weighted potential, hence it has the FIP, hence it has a PNE.[36]:Thm.6
When cost functions are additively-separable with logarithmic variable-cost function, and there are at most three players, the CG has the best-response improvement property, hence it has a PNE. However, it might not have the finite-improvement property.[37] For more than three players, the existence of PNE is open.
Every weighted singleton CG with separable player-specific preferences is isomorphic to a weighted network CG with player-independent preference.[33][2]
Network CGs with player-specific costs
Milchtaich considered the special case of CGs with player-specific costs, in which each strategy is a path in a given graph ("network CG"). He proved that every finite game can be represented as an (unweighted) network congestion game with player-specific costs, with nondecreasing (but not necessarily negative) cost-functions.[10] A complete characterization of networks that guarantee the existence of PNE in such CGs is posed as an open problem.[14]
Computing an equilibrium in unweighted CGs
The proof of existence of PNE is constructive: it shows a finite algorithm (an improvement path) that always finds a PNE. This raises the question of how many steps are required to find this PNE? Fabrikant, Papadimitriou and Talwar[38] proved:
If all strategies are paths in a network ("network CG"), and all players have the same set of strategies ("symmetric CG"), then a PNE can be computed in polynomial time by maximizing the potential, through reduction to min-cost flow. The algorithm can be adapted to nonatomic CGs: under certain smoothness assumptions, a Nash equilibirum in such a game can be approximated in strongly-polynomial time.
If the strategies can be general subsets, or the players may have different sets of strategies ("asymmetric CG"), then computing a PNE is PLS-complete. This implies that there are examples with exponentially-long improvement paths. It also implies that finding a Nash equilibrium reachable from a specified state is PSPACE-complete.
Every problem in the class PLS can be presented as a game whose pure equilibria are guaraneed to exist by a potential-function argument.
Even-Dar, Kesselman and Mansour[30] analyze the number of steps required for convergence to equilibrium in a load-balancing setting.
Caragiannis, Fanelli, Gravin and Skopalik[39] present an algorithm that computes a constant-factor approximation PNE. In particular:
With linear delay functions, the approximation ratio is 2+ε, and the runtime is polynomial in the number of playres, the number of resources, and 1/ε.
When delay functions are degree-d polynomials, the approximation ratio is dO(d).
Their algorithm identifies a short sequence of best-response moves, that leads to an approximate equilibrium. They also show that, for more general CGs, attaining any polynomial approximation of PNE is PLS-complete.
Computing an equilibrium in weighted network CGs
Fotakis, Kontogiannis and Spirakis[19] present an algorithm that, in any weighted network CG with linear delay functions, finds a PNE in pseudo-polynomial time (polynomial in the number of players n and the sum of players' weights W). Their algorithm is a greedybest-response algorithm: players enter the game in descending order of their weight, and choose a best-response to existing players' strategies.
Panagopoulou and Spirakis[20] show empirical evidence that the algorithm of Fotakis, Kontogiannis and Spirakis in fact runs in time polynomial in n and log W. They also propose an initial strategy-vector that dramatically speeds this algorithm.
In general, a weighted network CG may not have a PNE. Milchtaich[14] proves that deciding whether a given weighted network CG has a PNE is NP-hard even in the following cases:
There are two players; all players are allowed to use all paths; all cost-functions are nonnegative.
There are two players; the CG is unweighted; the costs are player-specific and nonnegative.
The proof is by reduction from the directed edge-disjoint paths problem.[40]
Caragiannis, Fanelli, Gravin and Skopalik[41] present an algorithm that computes a constant-factor approximation PNE in weighted CGs. In particular:
With linear delay functions, the approximation ratio is , and the runtime is polynomial in the number of playres, the number of resources, and 1/ε.
When delay functions are degree-d polynomials, the approximation ratio is .
To prove their results, they show that, although weighted CGs may not have a potential function, every weighted CG can be approximated by a certain potential game. This lets them show that every weighted CG has a (d!)-approximate PNE. Their algorithm identifies a short sequence of best-response moves, that leads to such an approximate PNE.
In summary, CGs can be classified according to various parameters:
Number and splittability of players: atomicCG, splittable CG or nonatomic CG;
Weight of players: unweightedCG or weightedCG (with resource-independent weights or resource-specific weights);
Cost functions for different players using the same resource: identical or player-specific (with separable or nonseparable cost-functions).
Possible strategies: one resource (singletonCG) or path in a network (network CG) or any subset (general CG).
Strategy sets of different players: different (asymmetric CG) or identical (symmetric CG).
Since every CG has a Nash equilibrium, the next natural topic is to analyze their quality. This is done using the concept of Price of anarchy in congestion games.
ּResource allocation games[42][31] are somewhat related to congestion games.
Coalitions: Fotakis, Kontogiannis and Spirakis[43] study CGs in which players participate in coalitions.
Congestion games in nature: Milinsky[44] describes an experiment in which a natural CG converges into a Nash equilibrium. In his experiment, he fed six sticklebacks from two ends of a tank. The fish distribution between the two ends was, on average, similar to the ratio of the food supply rates, so that no individual fish could increase his feeding rate by moving to the other side. Mlichtaich[3] presents a more general treatment of CGs in interspecific competition.
Rosenthal, Robert W. (1973), "A class of games possessing pure-strategy Nash equilibria", International Journal of Game Theory, 2: 65–67, doi:10.1007/BF01737559, MR0319584, S2CID121904640.
Even-Dar, Eyal; Kesselman, Alex; Mansour, Yishay (2003). "Convergence Time to Nash Equilibria". In Baeten, Jos C. M.; Lenstra, Jan Karel; Parrow, Joachim; Woeginger, Gerhard J. (eds.). Automata, Languages and Programming. Lecture Notes in Computer Science. Vol.2719. Berlin, Heidelberg: Springer. pp.502–513. doi:10.1007/3-540-45061-0_41. ISBN978-3-540-45061-0.
Kukushkin, N. S.; Men'Shikov, I. S.; Men'Shikova, O. R.; Morozov, V. V. (1990). "Resource allocation games". Computational Mathematics and Modeling. 1 (4): 433. doi:10.1007/BF01128293. S2CID120639586.