User Tools

Site Tools


ml:theory:multi-armed_bandit

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:theory:multi-armed_bandit [2022/05/16 08:02] – [Multi-Armed Bandits] jmflanigml:theory:multi-armed_bandit [2024/03/06 21:46] (current) – [Surveys] jmflanig
Line 4: Line 4:
 ===== Surveys ===== ===== Surveys =====
   * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]] Very good survey   * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]] Very good survey
 +
 +===== Theory =====
 +  * [[https://proceedings.mlr.press/v202/mei23a/mei23a.pdf|Mei et al 2023 - Stochastic Gradient Succeeds for Bandits]]
 +
  
 ===== Related Pages ===== ===== Related Pages =====
ml/theory/multi-armed_bandit.1652688159.txt.gz · Last modified: 2023/06/15 07:36 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki