Abstract
This paper analyzes a two-player game of strategic experimentation with three-armed exponential bandits in continuous time. Players face replica bandits, with one arm that is safe in that it generates a known payoff, whereas the likelihood of the risky arms’ yielding a positive payoff is initially unknown. It is common knowledge that the types of the two risky arms are perfectly negatively correlated. I show that the efficient policy is incentive-compatible if, and only if, the stakes are high enough. Moreover, learning will be complete in any Markov perfect equilibrium with continuous value functions if, and only if, the stakes exceed a certain threshold.
Dokumententyp: | Paper |
---|---|
Keywords: | Strategic Experimentation, Three-Armed Bandit, Exponential Distribution, Poisson Process, Bayesian Learning, Markov Perfect Equilibrium |
Fakultät: | Sonderforschungsbereiche > Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems
Sonderforschungsbereiche > Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems > A8 - Strategische Erzeugung und Weitergabe von Informationen |
Themengebiete: | 300 Sozialwissenschaften > 330 Wirtschaft |
JEL Classification: | C73, D83, O32 |
URN: | urn:nbn:de:bvb:19-epub-13221-7 |
Sprache: | Englisch |
Dokumenten ID: | 13221 |
Datum der Veröffentlichung auf Open Access LMU: | 10. Jul. 2012, 13:06 |
Letzte Änderungen: | 04. Nov. 2020, 12:53 |