ml:normalization
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:normalization [2021/03/04 06:13] – jmflanig | ml:normalization [2023/06/15 07:36] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Normalization ====== | ====== Normalization ====== | ||
| + | Normalization can improve the optimizer' | ||
| + | |||
| + | ===== Overviews ===== | ||
| + | * Blog post: [[https:// | ||
| + | |||
| + | ===== Activation Normalization Schemes ===== | ||
| ==== Batch Normalization ==== | ==== Batch Normalization ==== | ||
| - | Batch normalization is popular in computer vision, but not usually used in NLP because it doesn' | + | Batch normalization is popular in computer vision, but not usually used in NLP because it doesn' |
| + | * [[https:// | ||
| * Issues with RNNs | * Issues with RNNs | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | | + | |
| + | * [[https:// | ||
| ==== Layer Normalization ==== | ==== Layer Normalization ==== | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| - | ==== Other Normalization Schemes ==== | + | ===== Weight |
| - | | + | |
| - | * [[https:// | + | ==== Weight Normalization |
| - | * [[https:// | + | * Weight |
| + | |||
| + | ===== Other or Uncategorized Schemes ===== | ||
| + | |||
| + | |||
| + | ===== Related Pages ===== | ||
| + | * [[NN Training# | ||
ml/normalization.1614838438.txt.gz · Last modified: 2023/06/15 07:36 (external edit)