Show simple item record

dc.contributor.advisorSimchi-Levi, David
dc.contributor.authorAi, Rui
dc.date.accessioned2025-11-05T19:33:20Z
dc.date.available2025-11-05T19:33:20Z
dc.date.issued2025-05
dc.date.submitted2025-07-16T16:02:27.376Z
dc.identifier.urihttps://hdl.handle.net/1721.1/163541
dc.description.abstractThe independence axiom (IA) proposed by Von Neumann and Morgenstern [50] is the cornerstone of the expected utility theory. However, some empirical experiments show that the IA is often violated in the real world. We propose a new kind of multi-armed bandit problem where the expectation of outcomes may influence the agent’s utility which we call expectation-dependent multi-armed bandits and rationalize the choice of agents in Machina’s paradox lacking the IA. We design provably efficient algorithms with low minimax regrets and show their consistency of time horizon T with corresponding regret lower bounds, revealing statistical optimality. Furthermore, as we first consider bandits whose corresponding utility depends on both reality and expectation, it provides a bridge between machine learning and economic behavior theory, shedding light on how to interpret some counterintuitive economic scenarios, like bounded rationality explored by Zhang et al. [54].
dc.publisherMassachusetts Institute of Technology
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleProblem-Independent Regrets on Expectation-Dependent Multi-Armed Bandits
dc.typeThesis
dc.description.degreeS.M.
dc.contributor.departmentMassachusetts Institute of Technology. Institute for Data, Systems, and Society
dc.identifier.orcidhttps://orcid.org/0009-0005-9262-0630
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Social and Engineering Systems


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record