User Tools

Site Tools


ml:theory:multi-armed_bandit

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
ml:theory:multi-armed_bandit [2023/06/15 07:36] – external edit 127.0.0.1ml:theory:multi-armed_bandit [2024/03/06 21:46] (current) – [Surveys] jmflanig
Line 4: Line 4:
 ===== Surveys ===== ===== Surveys =====
   * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]] Very good survey   * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]] Very good survey
 +
 +===== Theory =====
 +  * [[https://proceedings.mlr.press/v202/mei23a/mei23a.pdf|Mei et al 2023 - Stochastic Gradient Succeeds for Bandits]]
 +
  
 ===== Related Pages ===== ===== Related Pages =====
ml/theory/multi-armed_bandit.1686814574.txt.gz · Last modified: 2023/06/15 07:36 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki