====== Self-Play and AI Feedback ======
Self play is where a model interacts with itself to improve - for example self-play against itself to get better at a game. 

===== Papers =====
  * AlphaGo: [[https://www.nature.com/articles/nature16961|Silver et al 2016 - Mastering The Game of Go with Deep Neural Networks and Tree Search]]

===== NLP Papers =====
  * [[https://arxiv.org/pdf/2305.10142.pdf|Fu et al 2023 - Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback]]

===== Related Pages =====
  * [[GANs]]
  * [[nlp:Human-in-the-Loop]]
  * [[Reinforcement Learning]]