Key points are not available for this paper at this time.
OPTIMAL LEARNING BY EXPERIMENTATIONThis paper analyses the dynamic decision problem of an agent who is initially uncertain as to the true shape of his payoff function, but who obtains information aboutit over time by observing the outcome of his past decisions.In the long run, the action is a short run optimum given the beliefs, but may not be an optimum for the true payoff function.We derive conditions under which the limit action is optimal for the true payoff function and establish the robustness of the results.Finally we study the adjustment process in an example where such complete learning does not achieve in the long run.
Aghion et al. (Sat,) studied this question.