Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox


Neural Machine Translation with Gumbel-Greedy Decoding

Jun 22, 2017
Jiatao Gu, Daniel Jiwoong Im, Victor O. K. Li



Previous neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test time. In this paper, we propose the Gumbel-Greedy Decoding which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our generative network differentiable and trainable through standard stochastic gradient methods. We empirically demonstrate that our proposed model is effective for generating sequences of discrete words.



Share this with someone who'll enjoy it:

   Access Paper Source



Share this with someone who'll enjoy it: