nlp:generation
Table of Contents
Natural Language Generation
Overviews
- Introduction: Eisenstein p. 457
- Gatt et al 2017 - Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation Now outdated. Goes over traditional non-neural methods.
Data-to-Text
Data-to-text generation is generation where the input is formatted data such as tables of numbers. A typical example is generating human-readable weather reports from numbers predicted from weather simulations. An example dataset is RotoWire (paper).
- Puduppully et al 2018 - Data-to-Text Generation with Content Selection and Planning A good baseline, used as baseline in Workshop in NLG and Translation
Meaning-to-Text
This is generation from a meaning representation, such as AMR or slots and values - the inverse of semantic parsing. See also AMR - Generation.
Controllable Text Generation
- Overviews
Evaluation
Survey: 2020 - Evaluation of Text Generation: A Survey
- Gehrmann et al 2021 - The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics (Website)
- Freitag et al 2021 - Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation Uses Multidimensional Quality Metrics (MQM) framework. MT paper, used in WMT
Historical Papers
Papers from a while ago.
Datasets
See also NLP progress - Generation and ACL Wiki - Datasets for NLG
- AMR dataset
- MultiWOZ Data to text generation datset
- Rotowire
- WikiBio
- WebNLG
- ViGGO Dataset, created a UCSC (paper)
Shared Tasks
People
Related Pages
nlp/generation.txt · Last modified: 2024/08/16 00:57 by jmflanig