User Tools

Site Tools


ml:nn_tricks

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:nn_tricks [2023/01/30 16:46] – [Overviews] brendanml:nn_tricks [2023/10/11 22:19] (current) jmflanig
Line 12: Line 12:
     * [[Curriculum Learning]]     * [[Curriculum Learning]]
     * Overcoming [[Catastrophic Forgetting]]     * Overcoming [[Catastrophic Forgetting]]
-    * Adjust the batch size, or use gradient accumulation to simulate larger batch sizes+    * Adjust the batch size, or use gradient accumulation (see [[https://kozodoi.me/blog/20210219/gradient-accumulation|this blog]], for example) to simulate larger batch sizes
     * Try a different [[optimizers#modern_deep_learning_optimizers|optimizer]], such as [[ https://arxiv.org/pdf/1908.03265.pdf|RAdam]]     * Try a different [[optimizers#modern_deep_learning_optimizers|optimizer]], such as [[ https://arxiv.org/pdf/1908.03265.pdf|RAdam]]
     * Adjust [[https://arxiv.org/pdf/2011.02150.pdf|epsilon]] in Adam     * Adjust [[https://arxiv.org/pdf/2011.02150.pdf|epsilon]] in Adam
ml/nn_tricks.1675097210.txt.gz · Last modified: 2023/06/15 07:36 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki