^ Year ^ Title ^ Notes ^ | 2002 | [[paper:Discriminative Training Methods for Hidden Markov Models - Theory and Experiments with Perceptron Algorithms]] | Structured Perceptron | | 2018 | [[paper:Language Models are Unsupervised Multitask Learners]] | GPT-2 | | 2020 | [[paper:Language Models are Few-Shot Learners]] | GPT-3 |