The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits

Loecher, Markus (2021) The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits. Frontiers in Artificial Intelligence, 4. ISSN 2624-8212

[thumbnail of pubmed-zip/versions/1/package-entries/frai-04-715690.pdf] Text
pubmed-zip/versions/1/package-entries/frai-04-715690.pdf - Published Version

Download (1MB)

Abstract

The connection between optimal stopping times of American Options and multi-armed bandits is the subject of active research. This article investigates the effects of optional stopping in a particular class of multi-armed bandit experiments, which randomly allocates observations to arms proportional to the Bayesian posterior probability that each arm is optimal (Thompson sampling). The interplay between optional stopping and prior mismatch is examined. We propose a novel partitioning of regret into peri/post testing. We further show a strong dependence of the parameters of interest on the assumed prior probability density.

Item Type: Article
Subjects: European Scholar > Multidisciplinary
Depositing User: Managing Editor
Date Deposited: 13 Mar 2023 06:25
Last Modified: 05 Jul 2024 09:17
URI: http://article.publish4promo.com/id/eprint/898

Actions (login required)

View Item
View Item