User Tools

Site Tools


ml:theory:regret_bounds

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:theory:regret_bounds [2022/05/09 08:31] jmflanigml:theory:regret_bounds [2023/06/15 07:36] (current) – external edit 127.0.0.1
Line 4: Line 4:
 ==== Surveys and Theses ==== ==== Surveys and Theses ====
   * [[http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=9AF556097D53F9170E8DC85C381F6971?doi=10.1.1.161.9973&rep=rep1&type=pdf|Shalev-Shwartz 2007 - Online Learning: Theory, Algorithms, and Applications]]  See section 2.4 (page 27 in pdf) for historical references   * [[http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=9AF556097D53F9170E8DC85C381F6971?doi=10.1.1.161.9973&rep=rep1&type=pdf|Shalev-Shwartz 2007 - Online Learning: Theory, Algorithms, and Applications]]  See section 2.4 (page 27 in pdf) for historical references
 +  * [[https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.419.9&rep=rep1&type=pdf|Battou - Online Learning and Stochastic Approximations]]
  
 ==== Key Papers ===== ==== Key Papers =====
-  * [[Zinkevich 2003 - Online Convex Programming and Generalized Infinitesimal Gradient Ascent]]+  * [[https://www.aaai.org/Papers/ICML/2003/ICML03-120.pdf|Zinkevich 2003 - Online Convex Programming and Generalized Infinitesimal Gradient Ascent]] See also the [[https://www.cs.cmu.edu/~maz/publications/techconvex.pdf|CMU tech report]]
  
 ===== Regret Bounds ===== ===== Regret Bounds =====
Line 18: Line 19:
  
 ===== Related Pages ===== ===== Related Pages =====
 +  * [[ml:theory:Multi-Armed Bandit]]
   * [[ml:Online Learning]]   * [[ml:Online Learning]]
 +  * [[ml:Reinforcement Learning#Theory|Reinforcement Learning - Theory]]
  
ml/theory/regret_bounds.1652085112.txt.gz · Last modified: 2023/06/15 07:36 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki