User Tools

Site Tools


ml:scaling_laws

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:scaling_laws [2025/06/01 23:09] – [Emergent Abilities] jmflanigml:scaling_laws [2025/06/01 23:09] (current) – [Related Pages] jmflanig
Line 14: Line 14:
  
 ==== Emergent Abilities ==== ==== Emergent Abilities ====
-See also [[Language Model#Origin of Capabilities|Language Model - Origin of Capabilities]].+See also [[nlp:Language Model#Origin of Capabilities|Language Model - Origin of Capabilities]].
  
   * GPT-3: [[https://arxiv.org/pdf/2005.14165.pdf|Brown et al 2021 - Language Models are Few-Shot Learners]] GPT-3 showed emergent abilities.  See for example Fig 3.10.   * GPT-3: [[https://arxiv.org/pdf/2005.14165.pdf|Brown et al 2021 - Language Models are Few-Shot Learners]] GPT-3 showed emergent abilities.  See for example Fig 3.10.
Line 25: Line 25:
   * [[Hyperparameter Tuning]]   * [[Hyperparameter Tuning]]
   * [[nlp:Language Model]]   * [[nlp:Language Model]]
 +  * [[nlp:Language Model#Origin of Capabilities|Language Model - Origin of Capabilities]]
   * [[nlp:pretraining#Pretraining Methodology]]   * [[nlp:pretraining#Pretraining Methodology]]
ml/scaling_laws.1748819345.txt.gz · Last modified: 2025/06/01 23:09 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki