ml:scaling_laws
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:scaling_laws [2024/07/12 03:35] – jmflanig | ml:scaling_laws [2025/06/01 23:09] (current) – [Related Pages] jmflanig | ||
|---|---|---|---|
| Line 7: | Line 7: | ||
| * **[[https:// | * **[[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| ==== Training LLMs ==== | ==== Training LLMs ==== | ||
| Line 13: | Line 14: | ||
| ==== Emergent Abilities ==== | ==== Emergent Abilities ==== | ||
| + | See also [[nlp: | ||
| + | |||
| + | * GPT-3: [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * **[[https:// | * **[[https:// | ||
| Line 20: | Line 25: | ||
| * [[Hyperparameter Tuning]] | * [[Hyperparameter Tuning]] | ||
| * [[nlp: | * [[nlp: | ||
| + | * [[nlp: | ||
| * [[nlp: | * [[nlp: | ||
ml/scaling_laws.1720755341.txt.gz · Last modified: 2024/07/12 03:35 by jmflanig