ml:nn_tricks
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:nn_tricks [2022/05/11 19:29] – [Neural Network Tricks] jmflanig | ml:nn_tricks [2023/10/11 22:19] (current) – jmflanig | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Neural Network Tricks ====== | ====== Neural Network Tricks ====== | ||
| + | |||
| + | ===== Overviews ===== | ||
| + | * NLP 202 lecture: [[https:// | ||
| * Training Tricks (see [[NN Training]]) | * Training Tricks (see [[NN Training]]) | ||
| Line 9: | Line 12: | ||
| * [[Curriculum Learning]] | * [[Curriculum Learning]] | ||
| * Overcoming [[Catastrophic Forgetting]] | * Overcoming [[Catastrophic Forgetting]] | ||
| - | * Adjust the batch size, or use gradient accumulation to simulate larger batch sizes | + | * Adjust the batch size, or use gradient accumulation |
| - | * Try a different [[optimizers# | + | * Try a different [[optimizers# |
| * Adjust [[https:// | * Adjust [[https:// | ||
| + | * Fine-tuning Specific Tricks | ||
| + | * [[https:// | ||
| * Regularization Tricks (see [[Regularization]]) | * Regularization Tricks (see [[Regularization]]) | ||
| * [[Regularization# | * [[Regularization# | ||
ml/nn_tricks.1652297345.txt.gz · Last modified: 2023/06/15 07:36 (external edit)