五黄煞是什么意思| 尿黄尿味大难闻是什么原因| bp是什么职位| 2031年是什么年| 什么大什么小| 1月25日什么星座| 95属什么生肖| 中国最长的河是什么河| 我们为什么会笑| 背疼是什么原因引起的女人| 簇新是什么意思| 为什么都说头胎很重要| 遇人不淑是什么意思| 为什么不能空腹喝牛奶| 检查颈椎挂什么科| 杀阴虱用什么药最好| 三白眼是什么意思| 什么的流| 鱼子酱是什么鱼| 什么电视剧好看| 尿道口红肿用什么药| 胰岛素抵抗有什么症状| 石榴花是什么季节开的| 掉头发是什么原因女性| 攻心翻是什么病| 什么是爱豆| 眉毛里有痣代表什么| mds医学上是什么意思| 81年五行属什么| 吃什么最减肥| 嗅觉失灵是什么原因| 腿走路没劲发软是什么原因| 阿斯伯格综合征是什么| 吉人自有天相什么意思| 白袜子是什么意思| 中国的国酒是什么| 宫颈锥切后需要注意什么| 梦见黑色的蛇是什么意思| 肾功能不全是指什么| 先锋霉素又叫什么| lamer是什么牌子| 大肠湿热吃什么中成药| 腔隙性脑梗吃什么药| 剪什么样的发型好看| 阴茎不硬吃什么药| 对应是什么意思| 连续做噩梦是什么原因| 梅长苏结局是什么| 衣原体是什么| 心脏传导阻滞是什么意思| 有且仅有什么意思| 胆结石可以吃什么水果| negative是什么意思| vos是什么意思| 患难见真情是什么意思| 去医院看嘴唇挂什么科| 疱疹性咽峡炎吃什么药| 火气重喝什么茶| 收到是什么意思| 为什么微信附近的人看不到我| 电导率是什么意思| 唐宋元明清前面是什么| 下聘是什么意思| preparing是什么意思| 什么行业赚钱| 多囊为什么要跳绳而不是跑步| 卒中中心是干什么的| 为什么瘦不下来| 虎牙长什么样子| 临期是什么意思| 阴囊炎用什么药治疗| 风对什么| 纤支镜检查是用来查什么的| 一什么场面| 和珅属什么生肖| 高血糖吃什么比较好| 餐后胆囊是什么意思| 男性雄激素低吃什么药| 股骨头坏死挂什么科| 太阳穴痛什么原因| 什么叫玄学| 疏肝理气喝什么茶| 08是什么生肖| 3.1415926是什么意思| 手指麻木是什么原因| 10月13号是什么星座| 大耗是什么意思| 一个山一个空念什么| 梦见涨洪水是什么兆头| 拿什么让你幸福| funfair是什么意思| 梦到点火是什么意思| 诺贝尔为什么没有数学奖| 八月十四是什么星座| 什么时候测血压最准确| 左腹下方隐痛什么原因| 炸酱面用什么酱| rst是什么意思| 月食是什么意思| 身份证复印件是什么| 石女是什么| 大爱是什么意思| 反清复明是什么意思| 人为什么要死| 中水是什么水| 粟米是什么米| 煮牛肉放什么容易烂| 小五行属性是什么| 梦见蛇蛋是什么意思啊| 2005年属鸡是什么命| 有偿服务是什么意思| 麦冬长什么样| 为什么现在不吃糖丸了| 牙齿变黑是什么原因| 无可奈何的笑是什么笑| 身上长小红点是什么原因| 莱特兄弟发明了什么| 瘴气是什么意思| 什么是集体户口| 宝宝嘴巴臭臭的是什么原因| 血压偏高是什么原因| 什么东西养胃| 无回声结节是什么意思| hcc是什么意思| 三七粉什么颜色| 感冒是什么症状| edp是什么意思| 焖子是什么做的| 剂量是什么意思| 多囊性改变是什么意思| dragon是什么意思| 木克什么| 微信头像 用什么好| 怀孕喝什么牛奶好| pr是什么| 什么病不能吃空心菜| 红红的枫叶像什么| 气血亏虚吃什么中成药| 梦见大领导有什么预兆| 社保缴费基数和工资有什么关系| 惊风是什么意思| 葛根和粉葛有什么区别| 12月21是什么星座| 为什么不建议小孩打流感疫苗| 梦到打死蛇是什么意思| 什么水果含铁量最高| 什么是数位| 痛风打什么针| 什么叫牙齿根管治疗| 阿莫西林有什么副作用| 长方形纸可以折什么| 髋关节弹响是什么原因| 甲状腺结节有什么症状表现| 自由基是什么意思| 咳嗽是什么原因| 为什么会有颈纹| bj是什么意思| 风土人情是什么意思| 夫妻分床睡意味着什么| 木薯淀粉可以做什么| 喝红茶对身体有什么好处| 出现的反义词是什么| 喝荷叶茶有什么好处和坏处| 五月23是什么星座| tdp是什么| 香砂是什么| 右眉毛跳是什么预兆| 流汗太多对身体有什么危害| 乳房是什么| 纳豆激酶有什么作用| 鲱鱼罐头为什么这么臭| 皮炎用什么药膏| 手发抖是什么病| 4朵玫瑰代表什么意思| 茯苓有什么功效和作用| 手发麻什么原因| 心率高是什么原因| 伤口发炎用什么药| 分诊是什么意思| 芈月是秦始皇的什么人| 指甲竖条纹是什么原因| 法益是什么意思| 拉肚子什么原因| 嫂夫人什么意思| 梦见鬼是什么预兆| 面色发黄是什么原因| 拍身份证照片穿什么衣服| 黄加黑变成什么颜色| 验血肝功能看什么指标| winner是什么意思| 肾低密度灶是什么意思| 孕妇梦见血是什么预兆| 什么的东西| 天气一热身上就痒是什么原因| 判处死刑缓期二年执行是什么意思| 诺贝尔奖是什么意思| 手容易出汗是什么原因| 杭州都有什么区| 中出什么意思| 煲什么汤含蛋白质高| 什么病会引起牙疼| 西瓜可以做成什么美食| 祛痣后应注意什么| 阴茎不硬吃什么药| 想入非非什么意思| 数不胜数是什么生肖| 学五行属什么| 黄瓜敷脸有什么效果| 的意思是什么| 家去掉一点念什么| 头发为什么会变白| 小螃蟹吃什么食物| 眼睛红肿吃什么消炎药| 投行是做什么的| 工作单位是什么意思| 花椒有什么功效与作用| 卖淫什么意思| 梦见小白兔是什么意思| 芊芊学子是什么意思| 做梦房子倒塌什么预兆| grader是什么意思| 9527是什么梗| 红色连衣裙配什么鞋子好看| 靶向药是什么药| 地包天是什么意思| 总胆红素偏高吃什么药| 左脸颊有痣代表什么| 女流之辈是什么意思| 面首是什么| 医保定点是什么意思| 阴毛有什么作用| 军绿色是什么颜色| 西洋参吃了有什么好处| 什么是试管婴儿| 不含而立是什么意思| 什么货币最值钱| 甲鱼补什么| 生姜泡水喝有什么好处| 嘴唇有黑斑是什么原因| 检查心脏挂什么科| 看阴茎挂什么科| 煮方便面什么时候放鸡蛋| 二便是什么意思| 生灵涂炭是什么意思| 八七年属什么的| 安吉白茶属于什么茶| 眉毛上长痣代表什么| 闭口是什么样子| 什么是肾炎| 哀鸿遍野什么意思| 战略纵深是什么意思| 黑发晶五行属什么| 头疼头胀是什么原因| 今年三十属什么| 什么牌子奶粉最好| 车厘子什么季节成熟| 鹿参膏有什么作用和功效| 小腿怕冷是什么原因| 解解乏是什么意思| cock什么意思| 吃什么可以增强记忆力| 蚂蚁代表什么风水| 吃了小龙虾不能吃什么| u型压迹是什么意思| 百度Jump to content

宪法必须体现党和人民事业的历史进步

From Wikipedia, the free encyclopedia
(Redirected from Mixed strategy)
百度 之所以作此说明,建议将授权决定期限延长两年,是因为2015年12月27日全国人大常委会通过的《关于授权国务院在实施股票发行注册制改革中调整适用有关规定的决定》将于2018年2月28日到期。

In game theory, a move, action, or play is any one of the options which a player can choose in a setting where the optimal outcome depends not only on their own actions but on the actions of others.[1] The discipline mainly concerns the action of a player in a game affecting the behavior or actions of other players. Some examples of "games" include chess, bridge, poker, monopoly, diplomacy or battleship.[2]

The term strategy is typically used to mean a complete algorithm for playing a game, telling a player what to do for every possible situation. A player's strategy determines the action the player will take at any stage of the game. However, the idea of a strategy is often confused or conflated with that of a move or action, because of the correspondence between moves and pure strategies in most games: for any move X, "always play move X" is an example of a valid strategy, and as a result every move can also be considered to be a strategy. Other authors treat strategies as being a different type of thing from actions, and therefore distinct.

It is helpful to think about a "strategy" as a list of directions, and a "move" as a single turn on the list of directions itself. This strategy is based on the payoff or outcome of each action. The goal of each agent is to consider their payoff based on a competitors action. For example, competitor A can assume competitor B enters the market. From there, Competitor A compares the payoffs they receive by entering and not entering. The next step is to assume Competitor B does not enter and then consider which payoff is better based on if Competitor A chooses to enter or not enter. This technique can identify dominant strategies where a player can identify an action that they can take no matter what the competitor does to try to maximize the payoff.

A strategy profile (sometimes called a strategy combination) is a set of strategies for all players which fully specifies all actions in a game. A strategy profile must include one and only one strategy for every player.

Strategy set

[edit]

A player's strategy set defines what strategies are available for them to play.

A player has a finite strategy set if they have a number of discrete strategies available to them. For instance, a game of rock paper scissors comprises a single move by each player—and each player's move is made without knowledge of the other's, not as a response—so each player has the finite strategy set {rock paper scissors}.

A strategy set is infinite otherwise. For instance the cake cutting game has a bounded continuum of strategies in the strategy set {Cut anywhere between zero percent and 100 percent of the cake}.

In a dynamic game, games that are played over a series of time, the strategy set consists of the possible rules a player could give to a robot or agent on how to play the game. For instance, in the ultimatum game, the strategy set for the second player would consist of every possible rule for which offers to accept and which to reject.

In a Bayesian game, or games in which players have incomplete information about one another, the strategy set is similar to that in a dynamic game. It consists of rules for what action to take for any possible private information.

Choosing a strategy set

[edit]

In applied game theory, the definition of the strategy sets is an important part of the art of making a game simultaneously solvable and meaningful. The game theorist can use knowledge of the overall problem, that is the friction between two or more players, to limit the strategy spaces, and ease the solution.

For instance, strictly speaking in the Ultimatum game a player can have strategies such as: Reject offers of ($1, $3, $5, ..., $19), accept offers of ($0, $2, $4, ..., $20). Including all such strategies makes for a very large strategy space and a somewhat difficult problem. A game theorist might instead believe they can limit the strategy set to: {Reject any offer ≤ x, accept any offer > x; for x in ($0, $1, $2, ..., $20)}.

Pure and mixed strategies

[edit]

A pure strategy provides a complete and deterministic plan for how a player will act in every possible situation in a game. It specifies exactly what action the player will take at each decision point, given any information they may have. A player's strategy set consists of all the pure strategies available to them.

A mixed strategy is a probability distribution over the set of pure strategies. Rather than committing to a single course of action, the player randomizes among pure strategies according to specified probabilities. Mixed strategies are particularly useful in games where no pure strategy constitutes a best response, allowing players to avoid being predictable. Since the outcomes depend on probabilities, we refer to the resulting payoffs as expected payoffs.

A pure strategy can be viewed as a special case of a mixed strategy—one in which a single pure strategy is chosen with probability 1, and all others with probability 0.

A totally mixed strategy is a mixed strategy in which every pure strategy in the player's strategy set is assigned a strictly positive probability—that is, no pure strategy is excluded or played with zero probability. This means the player randomizes across all of their options, never fully ruling any one out. Totally mixed strategies are important in some advanced game theory concepts like trembling hand perfect equilibrium, where the idea is to model players as occasionally making small mistakes. In that context, assigning positive probability to every strategy—even suboptimal ones—helps capture how players might still end up choosing them due to small "trembles" in decision-making.

Mixed strategy

[edit]

Illustration

[edit]

In a soccer penalty kick, the kicker must choose whether to kick to the right or left side of the goal, and simultaneously the goalie must decide which way to block it. Also, the kicker has a direction they are best at shooting, which is left if they are right-footed. The matrix for the soccer game illustrates this situation, a simplified form of the game studied by Chiappori, Levitt, and Groseclose (2002).[3] It assumes that if the goalie guesses correctly, the kick is blocked, which is set to the base payoff of 0 for both players. If the goalie guesses wrong, the kick is more likely to go in if it is to the left (payoffs of +2 for the kicker and -2 for the goalie) than if it is to the right (the lower payoff of +1 to kicker and -1 to goalie).

Goalie
Lean Left Lean Right
Kicker Kick Left  0,  0 +2, -2
Kick Right +1, -1  0,  0
 Payoff for the Soccer Game (Kicker, Goalie)

This game has no pure-strategy equilibrium, because one player or the other would deviate from any profile of strategies—for example, (Left, Left) is not an equilibrium because the Kicker would deviate to Right and increase his payoff from 0 to 1.

The kicker's mixed-strategy equilibrium is found from the fact that they will deviate from randomizing unless their payoffs from Left Kick and Right Kick are exactly equal. If the goalie leans left with probability g, the kicker's expected payoff from Kick Left is g(0) + (1-g)(2), and from Kick Right is g(1) + (1-g)(0). Equating these yields g= 2/3. Similarly, the goalie is willing to randomize only if the kicker chooses mixed strategy probability k such that Lean Left's payoff of k(0) + (1-k)(-1) equals Lean Right's payoff of k(-2) + (1-k)(0), so k = 1/3. Thus, the mixed-strategy equilibrium is (Prob(Kick Left) = 1/3, Prob(Lean Left) = 2/3).

In equilibrium, the kicker kicks to their best side only 1/3 of the time. That is because the goalie is guarding that side more. Also, in equilibrium, the kicker is indifferent which way they kick, but for it to be an equilibrium they must choose exactly 1/3 probability.

Chiappori, Levitt, and Groseclose try to measure how important it is for the kicker to kick to their favored side, add center kicks, etc., and look at how professional players actually behave. They find that they do randomize, and that kickers kick to their favored side 45% of the time and goalies lean to that side 57% of the time. Their article is well-known as an example of how people in real life use mixed strategies.

Significance

[edit]

In his famous paper, John Forbes Nash proved that there is an equilibrium for every finite game. One can divide Nash equilibria into two types. Pure strategy Nash equilibria are Nash equilibria where all players are playing pure strategies. Mixed strategy Nash equilibria are equilibria where at least one player is playing a mixed strategy. While Nash proved that every finite game has a Nash equilibrium, not all have pure strategy Nash equilibria. For an example of a game that does not have a Nash equilibrium in pure strategies, see Matching pennies. However, many games do have pure strategy Nash equilibria (e.g. the Coordination game, the Prisoner's dilemma, the Stag hunt). Further, games can have both pure strategy and mixed strategy equilibria. An easy example is the pure coordination game, where in addition to the pure strategies (A,A) and (B,B) a mixed equilibrium exists in which both players play either strategy with probability 1/2.

Interpretations of mixed strategies

[edit]

During the 1980s, the concept of mixed strategies came under heavy fire for being "intuitively problematic", since they are weak Nash equilibria, and a player is indifferent about whether to follow their equilibrium strategy probability or deviate to some other probability.[4] [5] Game theorist Ariel Rubinstein describes alternative ways of understanding the concept. The first, due to Harsanyi (1973),[6] is called purification, and supposes that the mixed strategies interpretation merely reflects our lack of knowledge of the players' information and decision-making process. Apparently random choices are then seen as consequences of non-specified, payoff-irrelevant exogenous factors.[5] A second interpretation imagines the game players standing for a large population of agents. Each of the agents chooses a pure strategy, and the payoff depends on the fraction of agents choosing each strategy. The mixed strategy hence represents the distribution of pure strategies chosen by each population. However, this does not provide any justification for the case when players are individual agents.

Later, Aumann and Brandenburger (1995),[7] re-interpreted Nash equilibrium as an equilibrium in beliefs, rather than actions. For instance, in rock paper scissors an equilibrium in beliefs would have each player believing the other was equally likely to play each strategy. This interpretation weakens the descriptive power of Nash equilibrium, however, since it is possible in such an equilibrium for each player to actually play a pure strategy of Rock in each play of the game, even though over time the probabilities are those of the mixed strategy.

Behavior strategy

[edit]

While a mixed strategy assigns a probability distribution over pure strategies, a behavior strategy (or behavioral strategy) assigns at each information set a probability distribution over the set of possible actions. While the two concepts are very closely related in the context of normal form games, they have very different implications for extensive form games. Roughly, a mixed strategy randomly chooses a deterministic path through the game tree, while a behavior strategy can be seen as a stochastic path. The relationship between mixed and behavior strategies is the subject of Kuhn's theorem, a behavioral outlook on traditional game-theoretic hypotheses. The result establishes that in any finite extensive-form game with perfect recall, for any player and any mixed strategy, there exists a behavior strategy that, against all profiles of strategies (of other players), induces the same distribution over terminal nodes as the mixed strategy does. The converse is also true.

A famous example of why perfect recall is required for the equivalence is given by Piccione and Rubinstein (1997)[full citation needed] with their Absent-Minded Driver game.

Outcome equivalence

[edit]

Outcome equivalence combines the mixed and behavioral strategy of Player i in relation to the pure strategy of Player i’s opponent. Outcome equivalence is defined as the situation in which, for any mixed and behavioral strategy that Player i takes, in response to any pure strategy that Player I’s opponent plays, the outcome distribution of the mixed and behavioral strategy must be equal. This equivalence can be described by the following formula: (Q^(U(i), S(-i)))(z) = (Q^(β(i), S(-i)))(z), where U(i) describes Player i's mixed strategy, β(i) describes Player i's behavioral strategy, and S(-i) is the opponent's strategy.[8]

Strategy with perfect recall

[edit]

Perfect recall is defined as the ability of every player in game to remember and recall all past actions within the game. Perfect recall is required for equivalence as, in finite games with imperfect recall, there will be existing mixed strategies of Player I in which there is no equivalent behavior strategy. This is fully described in the Absent-Minded Driver game formulated by Piccione and Rubinstein. In short, this game is based on the decision-making of a driver with imperfect recall, who needs to take the second exit off the highway to reach home but does not remember which intersection they are at when they reach it. Figure [2] describes this game.

Without perfect information (i.e. imperfect information), players make a choice at each decision node without knowledge of the decisions that have preceded it. Therefore, a player’s mixed strategy can produce outcomes that their behavioral strategy cannot, and vice versa. This is demonstrated in the Absent-minded Driver game. With perfect recall and information, the driver has a single pure strategy, which is [continue, exit], as the driver is aware of what intersection (or decision node) they are at when they arrive to it. On the other hand, looking at the planning-optimal stage only, the maximum payoff is achieved by continuing at both intersections, maximized at p=2/3 (reference). This simple one player game demonstrates the importance of perfect recall for outcome equivalence, and its impact on normal and extended form games.[9]

See also

[edit]

References

[edit]
  1. ^ Ben Polak Game Theory: Lecture 1 Transcript ECON 159, 5 September 2007, Open Yale Courses.
  2. ^ Aumann, R. (22 March 2017). Game Theory. In: Palgrave Macmillan. London: Palgrave Macmillan. ISBN 978-1-349-95121-5.
  3. ^ Chiappori, P. -A.; Levitt, S.; Groseclose, T. (2002). "Testing Mixed-Strategy Equilibria when Players Are Heterogeneous: The Case of Penalty Kicks in Soccer" (PDF). American Economic Review. 92 (4): 1138. CiteSeerX 10.1.1.178.1646. doi:10.1257/00028280260344678.
  4. ^ Aumann, R. (1985). "What is Game Theory Trying to accomplish?" (PDF). In Arrow, K.; Honkapohja, S. (eds.). Frontiers of Economics. Oxford: Basil Blackwell. pp. 909–924.
  5. ^ a b Rubinstein, A. (1991). "Comments on the interpretation of Game Theory". Econometrica. 59 (4): 909–924. doi:10.2307/2938166. JSTOR 2938166.
  6. ^ Harsanyi, John (1973). "Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points". Int. J. Game Theory. 2: 1–23. doi:10.1007/BF01737554. S2CID 154484458.
  7. ^ Aumann, Robert; Brandenburger, Adam (1995). "Epistemic Conditions for Nash Equilibrium". Econometrica. 63 (5): 1161–1180. CiteSeerX 10.1.1.122.5816. doi:10.2307/2171725. JSTOR 2171725.
  8. ^ Shimoji, Makoto (2025-08-14). "Outcome-equivalence of self-confirming equilibrium and Nash equilibrium". Games and Economic Behavior. 75 (1): 441–447. doi:10.1016/j.geb.2011.09.010. ISSN 0899-8256.
  9. ^ Kak, Subhash (2017). "The Absent-Minded Driver Problem Redux". arXiv:1702.05778 [cs.AI].
感冒了不能吃什么食物 什么病不能吃空心菜 ct胸部平扫检查出什么 大使是什么行政级别 阴虱什么症状
发热门诊属于什么科 唇珠在面相中代表什么 石榴红是什么颜色 图图是什么意思 福禄寿是什么意思
什么水果降火效果最好 流光是什么意思 怀孕小肚子疼是什么原因 怀孕期间不能吃什么 附件囊肿吃什么药可以消除
今年66岁属什么生肖的 蝙蝠是什么类动物 登徒子什么意思 清热去湿热颗粒有什么功效 赵匡胤为什么不传位给儿子
甲亢是一种什么病hcv9jop2ns1r.cn 复方乙酰水杨酸片是什么药onlinewuye.com 刘禅属什么生肖hcv9jop1ns1r.cn 黑匣子是什么hcv8jop4ns5r.cn 世界上最多笔画的字是什么hcv9jop3ns3r.cn
金开什么字hcv9jop6ns7r.cn 张顺的绰号是什么hcv9jop4ns6r.cn 九牛一毛是什么意思hcv7jop9ns3r.cn 考号是什么hcv8jop2ns3r.cn 甲胎蛋白是什么指标hcv9jop2ns3r.cn
牛腩烧什么好吃tiangongnft.com 牙周炎吃什么药最好hcv8jop0ns3r.cn 雪燕适合什么人吃hcv8jop9ns0r.cn 亦什么意思hcv9jop4ns0r.cn 心慌是什么感觉hcv9jop7ns0r.cn
女人下巴长痘痘是什么原因hcv8jop7ns9r.cn 气球是什么意思xinjiangjialails.com 羡煞旁人是什么意思hcv7jop9ns7r.cn 什么水晶招财旺事业hcv9jop1ns0r.cn 什么是三观hcv8jop1ns1r.cn
百度