nlp:datasets
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| nlp:datasets [2021/02/11 00:21] – created jmflanig | nlp:datasets [2023/11/29 21:14] (current) – jmflanig | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== NLP Datasets ====== | ====== NLP Datasets ====== | ||
| - | See also [[http:// | + | See also [[http:// |
| + | |||
| + | ===== Language Modeling Corpora ===== | ||
| + | * BNC corpus | ||
| + | * Gigaword | ||
| + | * Common crawl | ||
| + | * [[https:// | ||
| + | |||
| + | ===== General Benchmarks or Multi-Task Benchmarks ===== | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | |||
| + | ===== Multilingual ===== | ||
| + | |||
| + | * Survey on Multilingual NLP Datasets: [[https:// | ||
| ===== Dialog ===== | ===== Dialog ===== | ||
| Line 20: | Line 36: | ||
| ===== Compositional Generalization ===== | ===== Compositional Generalization ===== | ||
| + | |||
| + | ===== Commonsense Reasoning ===== | ||
| + | |||
| + | ===== Paraphrase ===== | ||
| + | * [[https:// | ||
nlp/datasets.1613002861.txt.gz · Last modified: 2023/06/15 07:36 (external edit)