Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Beyond English-Centric Multilingual Machine Translation

Oct 21, 2020
Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin


  Access Paper or Ask Questions

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset

Oct 09, 2020
Marina Fomicheva, Shuo Sun, Erick Fonseca, Frédéric Blain, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins


  Access Paper or Ask Questions

Self-training Improves Pre-training for Natural Language Understanding

Oct 05, 2020
Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau

* 8 pages 

  Access Paper or Ask Questions

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Aug 02, 2020
Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan

* 10 pages (main) + 5 pages (appendices). 9 tables and 2 figures 

  Access Paper or Ask Questions

Unsupervised Quality Estimation for Neural Machine Translation

May 21, 2020
Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

* Accepted for publication in TACL 

  Access Paper or Ask Questions

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data

Nov 15, 2019
Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave


  Access Paper or Ask Questions

A Massive Collection of Cross-Lingual Web-Document Pairs

Nov 10, 2019
Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzman, Philipp Koehn


  Access Paper or Ask Questions

Unsupervised Cross-lingual Representation Learning at Scale

Nov 05, 2019
Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

* 12 pages, 7 figures 

  Access Paper or Ask Questions

Facebook AI's WAT19 Myanmar-English Translation Task Submission

Oct 15, 2019
Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato

* The 6th Workshop on Asian Translation 

  Access Paper or Ask Questions

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia

Jul 16, 2019
Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán

* 13 pages, 3 figures, 6 tables 

  Access Paper or Ask Questions

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings

Jun 20, 2019
Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn

* Conference on Machine Translation (WMT) 2019 
* 6 pages, WMT 2019 

  Access Paper or Ask Questions

Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English

Feb 04, 2019
Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato


  Access Paper or Ask Questions