ml:nn_initialization
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:nn_initialization [2021/02/17 19:40] – jmflanig | ml:nn_initialization [2023/06/15 07:36] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Neural Network Initialization ====== | ====== Neural Network Initialization ====== | ||
| + | |||
| + | ===== Overviews ===== | ||
| + | * Blog post: [[https:// | ||
| + | * Section 8.4 in [[https:// | ||
| + | * Initialization section in [[https:// | ||
| + | * [[https:// | ||
| + | |||
| + | ===== Papers ===== | ||
| + | |||
| * Glorot (Xavier) initialization: | * Glorot (Xavier) initialization: | ||
| + | * Intution: if you initialize the network randomly, then for a given neuron, if there are a lot of incoming connections, | ||
| * He initialization: | * He initialization: | ||
| - | * ADMIN: [[https:// | + | * ADMIN: |
| + | * [[https:// | ||
| + | * SkipInit: [[https:// | ||
| + | ===== Software Defaults ===== | ||
| + | * PyTorch 1.0 uses He initialization for most layers such as Linear, RNN, Conv2d, etc (see [[https:// | ||
| ===== Resources ===== | ===== Resources ===== | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * Blog post about Glorot and He: [[https:// | ||
| + | |||
| + | ===== Related Pages ===== | ||
| + | * [[NN Training]] | ||
| + | |||
ml/nn_initialization.1613590816.txt.gz · Last modified: 2023/06/15 07:36 (external edit)