User Tools

Site Tools


ml:theory:multi-armed_bandit

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:theory:multi-armed_bandit [2022/05/09 08:51] – [Mulit-Armed Bandits] jmflanigml:theory:multi-armed_bandit [2024/03/06 21:46] (current) – [Surveys] jmflanig
Line 1: Line 1:
 ====== Multi-Armed Bandits ====== ====== Multi-Armed Bandits ======
-See [[https://en.wikipedia.org/wiki/Multi-armed_bandit|Wikipedia - Multi-amred Bandit]].+See [[https://en.wikipedia.org/wiki/Multi-armed_bandit|Wikipedia - Multi-armed Bandit]].
  
 ===== Surveys ===== ===== Surveys =====
-  * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]]+  * [[http://homes.di.unimi.it/~cesabian/Pubblicazioni/banditSurvey.pdf|Bubeck & Cesa-Bianchi 2012 - Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems]] Very good survey 
 + 
 +===== Theory ===== 
 +  * [[https://proceedings.mlr.press/v202/mei23a/mei23a.pdf|Mei et al 2023 - Stochastic Gradient Succeeds for Bandits]] 
  
 ===== Related Pages ===== ===== Related Pages =====
ml/theory/multi-armed_bandit.1652086309.txt.gz · Last modified: 2023/06/15 07:36 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki