Arbeitspapier

Nonparametric learning rules from bandit experiments: The eyes have it!

How do people learn? We assess, in a distribution-free manner, subjects' learning and choice rules in dynamic two-armed bandit (probabilistic reversal learning) experiments. To aid in identification and estimation, we use auxiliary measures of subjects' beliefs, in the form of their eye-movements during the experiment. Our estimated choice probabilities and learning rules have some distinctive features; notably that subjects tend to update in a non-smooth manner following choices made in accordance with current beliefs. Moreover, the beliefs implied by our nonparametric learning rules are closer to those from a (non-Bayesian) reinforcement learning model, than a Bayesian learning model.

Language
Englisch

Bibliographic citation
Series: Working Paper ; No. 560

Classification
Wirtschaft
Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
Design of Experiments: Laboratory, Individual
Semiparametric and Nonparametric Methods: General
Subject
Learning
experiments
eye-tracking
Bayesian vs. non-Bayesian learning
nonparametric estimation
Schätztheorie
Lernen
Nichtparametrisches Verfahren
Simulation
Dynamisches Modell

Event
Geistige Schöpfung
(who)
Hu, Yingyao
Kayaba, Yutaka
Shum, Matt
Event
Veröffentlichung
(who)
The Johns Hopkins University, Department of Economics
(where)
Baltimore, MD
(when)
2010

Handle
Last update
10.03.2025, 11:42 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Arbeitspapier

Associated

  • Hu, Yingyao
  • Kayaba, Yutaka
  • Shum, Matt
  • The Johns Hopkins University, Department of Economics

Time of origin

  • 2010

Other Objects (12)