nlp:statistical_machine_translation
This is an old revision of the document!
Table of Contents
Statistical Machine Translation
Papers and software for statistical machine translation, mostly for historical refererence. See also Wikipedia - Statistical Machine Translation.
Overviews
- Lopez 2008 - Statistical Machine Translation or Lopez 2007 - A Survey of Statistical Machine Translation (older, but nicer formatting)
IBM Models and Alignment
- IBM Models
- Brown et al 1988 - A Statistical Approach to Language Translation Overview paper of the approach
- Brown et al 1993 - The Mathematics of Statistical Machine Translation: Parameter Estimation Mathematical details of IBM alignment models 1-5
- Concavity of IBM model 1: Gimpel & Smith 2012
- Zenkel et al 2020 - End-to-End Neural Word Alignment Outperforms GIZA++ Not conclusive, since didn't do an extrinsic evaluation in an SMT system. Often, improvements in alignment eror rate (AER) don't translate to better SMT models
Phrase-Based Machine Translation (PBMT)
Syntax-Based Methods
- Hiero
- SAMT
Training
- Och 2003 - Minimum Error Rate Training in Statistical Machine Translation The standard method for training SMT systems since 2003
Evaluation
- Metrics
- Methodology
Software
- ReWrite Decoder Open source implementation of word-based SMT
Related Pages
nlp/statistical_machine_translation.1618907232.txt.gz · Last modified: 2023/06/15 07:36 (external edit)