nlp:transformers
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:transformers [2025/04/04 23:27] – [Papers] jmflanig | nlp:transformers [2025/10/17 20:09] (current) – [Analysis and Interpretation] jmflanig | ||
|---|---|---|---|
| Line 18: | Line 18: | ||
| See also the group [[https:// | See also the group [[https:// | ||
| + | * **Overviews** | ||
| + | * [[https:// | ||
| * **[[https:// | * **[[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 31: | Line 33: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * Follow up work: | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * **[[https:// | ||
| ===== Analysis and Interpretation ===== | ===== Analysis and Interpretation ===== | ||
| Line 39: | Line 46: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * **For decoders/ | * **For decoders/ | ||
| Line 73: | Line 81: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| ===== Ablation Experiments on the Transformer ===== | ===== Ablation Experiments on the Transformer ===== | ||
| Line 109: | Line 118: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * Stabilization of Training | ||
| + | * [[https:// | ||
| * Miscellaneous topics | * Miscellaneous topics | ||
| * [[https:// | * [[https:// | ||
| Line 121: | Line 132: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 139: | Line 151: | ||
| ===== Long-Context Transformers ===== | ===== Long-Context Transformers ===== | ||
| + | |||
| + | ==== Survey Papers ==== | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| ==== Papers ==== | ==== Papers ==== | ||
| Line 149: | Line 165: | ||
| * [[https:// | * [[https:// | ||
| * **[[https:// | * **[[https:// | ||
| + | * [[https:// | ||
| + | 1, | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 155: | Line 173: | ||
| * **[[https:// | * **[[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| ===== Position Embeddings ===== | ===== Position Embeddings ===== | ||
nlp/transformers.1743809231.txt.gz · Last modified: 2025/04/04 23:27 by jmflanig