Table of Contents
Bayesian Methods
Bayesian methods are a sub-field of statistics (bayesian statistics), and are often used in machine learning. Bayesian methods assume a prior over the space of models (the prior belief), and after observing the data, update the belief over the space of models (the posterior belief). The posterior belief can be used to make predictions, etc. (This is in contrast to frequentist methods that are “designed to create procedures with certain frequency gaurantees (consistency, coverage, minimaxity etc)” (Nonparametric Bayesian Methods, Chapter 8)). Bayesian methods often have very good frequentist properties, and often perform well in practice.
There is recent interest in combining bayesian methods with deep learning. For examples, see the Bayesian Deep Learning Workshop at NeurIPS. Bayesian methods such as bayesian optimization are also used in hyperparameter tuning.
Bayesian Papers in Machine Learning and NLP
Bayesian Neural Networks
Overviews
- Blog post: Intro to Bayesian Deep Learning
- Charnock et al 2020 - Bayesian Neural Networks (Draft book chapter)
Papers
Variational Bayes
Overviews
Papers
Bayesian Nonparametrics
Bayesian nonparametric methods are a sub-area of bayesian statistics, and are also commonly used in machine learning. They were very popular in NLP before the deep learning era started in 2014.
Overviews
- Wasserman - Nonparametric Bayesian Methods Notes from a statistics course
Resources
- Talks (with videos)
- Talk by Michael Jordan: Dirichlet Processes, Chinese Restaurant Processes, and All That
Papers
- Goldwater et al 2005 - Interpolating Between Types and Tokens by Estimating Power-Law Generators “We show that taking a particular stochastic process – the Pitman-Yor process – as an adaptor justifies the appearance of type frequencies in formal analyses of natural language.”
- Shindo et al 2012 - Bayesian Symbol-Refined Tree Substitution Grammars for Syntactic Parsing 92.4 on PTB. Was SOTA until Dyer 2016 surpassed it.
- Chahuneau et al 2013 - Knowledge-Rich Morphological Priors for Bayesian Language Models Combines a finite-state guesser with Bayesian non-parametrics
Papers Combining Deep Learning and Bayesian Nonparametrics
People
Related Pages
- Graphical Models Bayesian methods often use techniques from graphical models, such as MCMC and variational inference, as well as representing likelihood and prior as a graphical model