<sortable>

Year Title Notes
2002 Discriminative Training Methods for Hidden Markov Models - Theory and Experiments with Perceptron Algorithms Structured Perceptron
2018 Language Models are Unsupervised Multitask Learners GPT-2
2020 Language Models are Few-Shot Learners GPT-3

</sortable>