nlp:statistical_machine_translation
Table of Contents
Statistical Machine Translation
Papers and software for statistical machine translation, mostly for historical refererence. See also Wikipedia - Statistical Machine Translation.
Overviews
- Lopez 2008 - Statistical Machine Translation or Lopez 2007 - A Survey of Statistical Machine Translation (older, but nicer formatting)
IBM Models and Alignment
- IBM Models
- Brown et al 1988 - A Statistical Approach to Language Translation Overview paper of the approach
- Brown et al 1993 - The Mathematics of Statistical Machine Translation: Parameter Estimation Mathematical details of IBM alignment models 1-5
- ReWrite decoder: Germann et al 2001 - Fast Decoding and Optimal Decoding for Machine Translation A* search, hillclimbing, and ILP for decoding for word-based models
- Riedel & Clark 2009 - Revisiting Optimal Decoding for Machine Translation IBM Model 4 Uses an ILP for IBM model 4
- Concavity of IBM model 1: Gimpel & Smith 2012
- Zenkel et al 2020 - End-to-End Neural Word Alignment Outperforms GIZA++ Not conclusive, since didn't do an extrinsic evaluation in an SMT system. Often, improvements in alignment eror rate (AER) don't translate to better SMT models
Phrase-Based Machine Translation (PBMT)
Syntax-Based Methods
- Melamed 2004 - Statistical Machine Translation by Parsing Introduces SCFGs for SMT
- Hiero
- SAMT
Training
- MERT: Och 2003 - Minimum Error Rate Training in Statistical Machine Translation The standard method for training SMT systems since 2003
Evaluation
- Metrics
- Methodology
Software
- ReWrite Decoder Open source implementation of word-based SMT
Related Pages
nlp/statistical_machine_translation.txt · Last modified: 2023/06/15 07:36 by 127.0.0.1