Show simple item record

dc.contributor.authorRIVAS, Javier
dc.date.accessioned2008-02-13T13:21:14Z
dc.date.available2008-02-13T13:21:14Z
dc.date.issued2008
dc.identifier.issn1725-6704
dc.identifier.urihttp://hdl.handle.net/1814/8084
dc.description.abstractWe investigate learning in a setting where each period a population has to choose between two actions and the payoff each action is unknown by the players. The population learns according to reinforcement and the environment is non-stationary, meaning that there is correlation between the payoff each action today and the payoff each action in the past. We show that when players observe realized and foregone payoff, a suboptimal mixed strategy is selected. On the other hand, when players only observe realized payoff, a unique action, which is optimal if actions perform different enough, is selected in the long run. When looking for efficient reinforcement learning rules, we find that it is optimal to disregard the information from foregone payoff and to learn as if only realized payoff were observed. population learns according to reinforcement and the environment is non-stationary, meaning that there is correlation between the payo of each action today and the payo of each action in the past. We show that when players observe realized and foregone payo s, a suboptimal mixed strategy is selected. On the other hand, when players only observe realized payo s, a unique action, which is optimal if actions perform di erent enough, is selected in the long run. When looking for e cient reinforcement learning rules, we nd that it is optimal to disregard the information from foregone payo s and to learn as if only realized payo s were observed.en
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherEuropean University Institute
dc.relation.ispartofseriesEUI ECOen
dc.relation.ispartofseries2008/13en
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectC73en
dc.subjectAdaptive Learningen
dc.subjectMarkov Chains,en
dc.subjectNon-stationarityen
dc.subjectReinforcement Learningen
dc.titleLearning within a Markovian Environmenten
dc.typeWorking Paperen
dc.neeo.contributorRIVAS|Javier|aut|
eui.subscribe.skiptrue


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record