March 9th, 2012, 11:59 am
I guess I need to make some clarification.This is a repeated game and each repeated game consists of the following sub-game.sub-game:-----------------------------------------1) host puts prize behind door 1, 2 or 3.2) player pays $1 to guess a door number3) if the number is correct, player wins the prize4) if the number is not correct, host always tells the player whether the door number he guessed is too high or too low. Host always provides true information.5) player pays $1 to guess another door numberthis sub-game keeps on playing till eventually player win.We assume that both player and host are intelligent and rational. The player minimizes the payment and the host maximizes the payment. The host only makes the decision on where to hide the prize at the beginning of this sub-game. Then, he follow mechanical process. But in each new sub-game, he might change the door number for prize from what he has learned from the full history of previous plays.Here are few example of the sub-games:example 1:host hides prize behind door 1, player pays $1 and guesses door 2, host tells player that the number he guessed is too high, player pays $1 and guess door 1, player wins the prize.Note that the host know player are ration and intelligent. So after he knows that the player will definitely guess door 1 once he tells him that his first guess is too high.example 2:host hides prize behind door 2, player pays $1 and guesses door 1, host tells player that the number he guessed is too low, player pays $1 to guess door 3 or 1. It is part of player's strategy to pick up 1 or 3. If player picks up 2, he wins. If player picks up 3, host tells him that his number is too high. He will pay $1 to pick up 2 and win.-------------------------------------end of description of sub-gameThe sub-game is repeated again and again. Originally, the player will pick up 2 in the first sub-game, as this will make sure he win the sub-game with $2 cost. But if the host learned that player always guess door 2, he might change the prize location and hide the prize more often behind door 1 or 3. After the player found that the host hide the prize more often behind 1 or 3. Maybe it is better to guess 1 or 3 in the first guess. After a while, the host notices that the player keeps on guessing door 1, for example, the host will change his behavior to hide prize at some other doors. If we keep on playing this sub-games for ever, what is the expected cost per sub-game?
Last edited by
ebenezer on March 8th, 2012, 11:00 pm, edited 1 time in total.