ml:curriculum_learning
Table of Contents
Curriculum Learning
Curriculum learning (CL) is where a neural network is trained on easier examples before training on harder examples. The method for deciding which examples to train on at different times is called the curriculum, and uses a measure of difficulty for the data points. As an example, one can train on shorter sentences before adding longer sentences to the training. CL can help the model learn features that generalize better, and help the model learn faster. For an overview, see Bengio 2009.
Overviews
Strategies for the Curriculum
- In NLP
- Length, word rarity
Papers
- Hacohen & Weinshall 2019 - On The Power of Curriculum Learning in Training Deep Networks Has some theoretical results
- Press et al 2020 - Shortformer: Better Language Modeling using Shorter Inputs Points out that BERT also uses curriculum learning.
Theory
- Weinshall et al 2018 - Curriculum Learning by Transfer Learning: Theory and Experiments with Deep Networks Theory result: Shows, that for SGD on a convex problem (linear regression), “the rate of convergence of an ideal curriculum learning method is monotonically increasing with the difficulty of the examples.”
Self-Paced Learning
Computer Vision
Curriculum learning is widely used in computer vision.
Papers using Curriculum Learning
Related Pages
ml/curriculum_learning.txt · Last modified: 2024/06/28 03:27 by jmflanig