Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Vishrav Chaudhary

LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models


Jun 07, 2021
Hongyu Gong, Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán


  Access Paper or Ask Questions

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation


Jun 06, 2021
Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzman, Angela Fan


  Access Paper or Ask Questions

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data


Jun 02, 2021
Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona Diab

* ACL 2021 

  Access Paper or Ask Questions

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages


Apr 18, 2021
Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir, Gustavo A. Gim├ęnez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando A. Coto Solano, Ngoc Thang Vu, Katharina Kann


  Access Paper or Ask Questions

Quality Estimation without Human-labeled Data


Feb 08, 2021
Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

* Accepted by EACL2021 

  Access Paper or Ask Questions

Beyond English-Centric Multilingual Machine Translation


Oct 21, 2020
Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin


  Access Paper or Ask Questions

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset


Oct 09, 2020
Marina Fomicheva, Shuo Sun, Erick Fonseca, Fr├ęd├ęric Blain, Vishrav Chaudhary, Francisco Guzm├ín, Nina Lopatina, Lucia Specia, Andr├ę F. T. Martins


  Access Paper or Ask Questions

Self-training Improves Pre-training for Natural Language Understanding


Oct 05, 2020
Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau

* 8 pages 

  Access Paper or Ask Questions

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning


Aug 02, 2020
Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan

* 10 pages (main) + 5 pages (appendices). 9 tables and 2 figures 

  Access Paper or Ask Questions

Unsupervised Quality Estimation for Neural Machine Translation


May 21, 2020
Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Fr├ęd├ęric Blain, Francisco Guzm├ín, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

* Accepted for publication in TACL 

  Access Paper or Ask Questions

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data


Nov 15, 2019
Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave


  Access Paper or Ask Questions

A Massive Collection of Cross-Lingual Web-Document Pairs


Nov 10, 2019
Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzman, Philipp Koehn


  Access Paper or Ask Questions

Unsupervised Cross-lingual Representation Learning at Scale


Nov 05, 2019
Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

* 12 pages, 7 figures 

  Access Paper or Ask Questions

Facebook AI's WAT19 Myanmar-English Translation Task Submission


Oct 15, 2019
Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato

* The 6th Workshop on Asian Translation 

  Access Paper or Ask Questions

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia


Jul 16, 2019
Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán

* 13 pages, 3 figures, 6 tables 

  Access Paper or Ask Questions

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings


Jun 20, 2019
Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn

* Conference on Machine Translation (WMT) 2019 
* 6 pages, WMT 2019 

  Access Paper or Ask Questions

Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English


Feb 04, 2019
Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato


  Access Paper or Ask Questions