ml:nn_initialization
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:nn_initialization [2021/06/08 08:44] – [Papers] jmflanig | ml:nn_initialization [2023/06/15 07:36] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 5: | Line 5: | ||
| * Section 8.4 in [[https:// | * Section 8.4 in [[https:// | ||
| * Initialization section in [[https:// | * Initialization section in [[https:// | ||
| + | * [[https:// | ||
| ===== Papers ===== | ===== Papers ===== | ||
| Line 13: | Line 14: | ||
| * ADMIN: [[https:// | * ADMIN: [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * SkipInit: [[https:// | + | * SkipInit: [[https:// |
| ===== Software Defaults ===== | ===== Software Defaults ===== | ||
| * PyTorch 1.0 uses He initialization for most layers such as Linear, RNN, Conv2d, etc (see [[https:// | * PyTorch 1.0 uses He initialization for most layers such as Linear, RNN, Conv2d, etc (see [[https:// | ||
| Line 21: | Line 22: | ||
| * [[https:// | * [[https:// | ||
| * Blog post about Glorot and He: [[https:// | * Blog post about Glorot and He: [[https:// | ||
| + | |||
| + | ===== Related Pages ===== | ||
| + | * [[NN Training]] | ||
ml/nn_initialization.1623141843.txt.gz · Last modified: 2023/06/15 07:36 (external edit)