nlp:pretraining
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:pretraining [2024/04/19 23:04] – [Amount, Selection and Cleaning of Pretraining Data] jmflanig | nlp:pretraining [2026/02/20 06:35] (current) – [Key and Early Papers] jmflanig | ||
|---|---|---|---|
| Line 2: | Line 2: | ||
| ===== Overviews ===== | ===== Overviews ===== | ||
| + | See also [[language_model# | ||
| * **[[https:// | * **[[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 8: | Line 9: | ||
| ===== Key and Early Papers ===== | ===== Key and Early Papers ===== | ||
| For a history, see section 2.4 of [[https:// | For a history, see section 2.4 of [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 21: | Line 25: | ||
| Papers sorted chronologically. | Papers sorted chronologically. | ||
| * CoVe: [[https:// | * CoVe: [[https:// | ||
| + | * ULMFiT: [[https:// | ||
| * ELMO: [[https:// | * ELMO: [[https:// | ||
| * GPT: [[https:// | * GPT: [[https:// | ||
| Line 103: | Line 108: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * **System Descriptions** | * **System Descriptions** | ||
| * The following papers contain very useful descriptions of LLM pretraining methods and issues | * The following papers contain very useful descriptions of LLM pretraining methods and issues | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| ===== Amount, Selection and Cleaning of Pretraining Data ===== | ===== Amount, Selection and Cleaning of Pretraining Data ===== | ||
| Line 122: | Line 135: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * [[https:// | + | |
| + | * [[https:// | ||
| + | * **[[https:// | ||
| + | * [[https:// | ||
| ===== Pretraining On An Academic Budget ===== | ===== Pretraining On An Academic Budget ===== | ||
| Line 132: | Line 149: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
nlp/pretraining.1713567897.txt.gz · Last modified: 2024/04/19 23:04 by jmflanig