ml:normalization
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:normalization [2021/03/04 06:09] – jmflanig | ml:normalization [2023/06/15 07:36] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Normalization ====== | ====== Normalization ====== | ||
| - | * **Batch | + | Normalization can improve the optimizer' |
| - | * Issues with RNNs | + | |
| - | * [[https:// | + | ===== Overviews ===== |
| - | | + | * Blog post: [[https:// |
| - | * [[https:// | + | |
| - | * [[https:// | + | ===== Activation Normalization Schemes ===== |
| - | * **Weight Normalization**, | + | |
| - | * [[https:// | + | ==== Batch Normalization ==== |
| + | Batch normalization is popular in computer vision, but not usually used in NLP because it doesn' | ||
| + | * [[https:// | ||
| + | | ||
| + | | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | |||
| + | ==== Layer Normalization ==== | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | |||
| + | ===== Weight Normalization Schemes ==== | ||
| + | |||
| + | ==== Weight Normalization ==== | ||
| + | * Weight normalization is billed as an alternative to batch normalization. [[https:// | ||
| + | |||
| + | ===== Other or Uncategorized Schemes ===== | ||
| + | |||
| + | |||
| + | ===== Related Pages ===== | ||
| + | * [[NN Training# | ||
ml/normalization.1614838196.txt.gz · Last modified: 2023/06/15 07:36 (external edit)