X -ARMED BANDITS: OPTIMIZING QUANTILES, AND OTHER RISKS - INSA Toulouse - Institut National des Sciences Appliquées de Toulouse Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

X -ARMED BANDITS: OPTIMIZING QUANTILES, AND OTHER RISKS

Résumé

We propose and analyze StoROO, an algorithm for risk optimization on stochastic black-box functions derived from StoOO. Motivated by risk-averse decision making fields like agriculture, medicine, biology or finance, we do not focus on the mean payoff but on generic functionals of the return distribution, like for example quantiles. We provide a generic regret analysis of StoROO. Inspired by the bandit literature and black-box mean optimizers, StoROO relies on the possibility to construct confidence intervals for the targeted functional based on random-size samples. We explain in detail how to construct them for quantiles, providing tight bounds based on Kullback-Leibler divergence. The interest of these tight bounds is highlighted by numerical experiments that show a dramatic improvement over standard approaches.
Fichier principal
Vignette du fichier
quantile.pdf (709.87 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02101647 , version 1 (17-04-2019)
hal-02101647 , version 2 (30-10-2019)
hal-02101647 , version 3 (04-03-2020)

Identifiants

  • HAL Id : hal-02101647 , version 1

Citer

Léonard Torossian, Aurélien Garivier, Victor Picheny. X -ARMED BANDITS: OPTIMIZING QUANTILES, AND OTHER RISKS. 2019. ⟨hal-02101647v1⟩
159 Consultations
73 Téléchargements

Partager

Gmail Facebook X LinkedIn More