Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Vishrav Chaudhary

Few-shot Learning with Multilingual Language Models


Dec 20, 2021
Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

* 36 pages 

  Access Paper or Ask Questions

Alternative Input Signals Ease Transfer in Multilingual Machine Translation


Oct 15, 2021
Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzman


  Access Paper or Ask Questions

Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications


Sep 17, 2021
Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia

* EMNLP 2021 

  Access Paper or Ask Questions

LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models


Jun 07, 2021
Hongyu Gong, Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán


  Access Paper or Ask Questions

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation


Jun 06, 2021
Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzman, Angela Fan


  Access Paper or Ask Questions

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data


Jun 02, 2021
Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona Diab

* ACL 2021 

  Access Paper or Ask Questions

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages


Apr 18, 2021
Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir, Gustavo A. Gim├ęnez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando A. Coto Solano, Ngoc Thang Vu, Katharina Kann


  Access Paper or Ask Questions

Quality Estimation without Human-labeled Data


Feb 08, 2021
Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

* Accepted by EACL2021 

  Access Paper or Ask Questions

Beyond English-Centric Multilingual Machine Translation


Oct 21, 2020
Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin


  Access Paper or Ask Questions

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset


Oct 09, 2020
Marina Fomicheva, Shuo Sun, Erick Fonseca, Fr├ęd├ęric Blain, Vishrav Chaudhary, Francisco Guzm├ín, Nina Lopatina, Lucia Specia, Andr├ę F. T. Martins


  Access Paper or Ask Questions

Self-training Improves Pre-training for Natural Language Understanding


Oct 05, 2020
Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau

* 8 pages 

  Access Paper or Ask Questions

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning


Aug 02, 2020
Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan

* 10 pages (main) + 5 pages (appendices). 9 tables and 2 figures 

  Access Paper or Ask Questions

Unsupervised Quality Estimation for Neural Machine Translation


May 21, 2020
Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Fr├ęd├ęric Blain, Francisco Guzm├ín, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

* Accepted for publication in TACL 

  Access Paper or Ask Questions

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data


Nov 15, 2019
Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave


  Access Paper or Ask Questions

A Massive Collection of Cross-Lingual Web-Document Pairs


Nov 10, 2019
Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzman, Philipp Koehn


  Access Paper or Ask Questions

Unsupervised Cross-lingual Representation Learning at Scale


Nov 05, 2019
Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

* 12 pages, 7 figures 

  Access Paper or Ask Questions

Facebook AI's WAT19 Myanmar-English Translation Task Submission


Oct 15, 2019
Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato

* The 6th Workshop on Asian Translation 

  Access Paper or Ask Questions

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia


Jul 16, 2019
Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán

* 13 pages, 3 figures, 6 tables 

  Access Paper or Ask Questions

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings


Jun 20, 2019
Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn

* Conference on Machine Translation (WMT) 2019 
* 6 pages, WMT 2019 

  Access Paper or Ask Questions

Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English


Feb 04, 2019
Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato


  Access Paper or Ask Questions