Dual Learning For Machine Translation
TLDR; The authors cast machine translation as a game of a primal task and a dual task, which allows two agents to teach each other in a reinforcement learning fashion (i.e., policy gradient method). With this implementation, the agent only requires monolingual English ↔ French text data (with 10% bilingual warmup) to achieve a comparable performance to NMT trained with pure bilingual data.