Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Beyond English-Centric Multilingual Machine Translation

Oct 21, 2020
Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin


  Access Paper or Ask Questions

CCMatrix: Mining Billions of High-Quality Parallel Sentences on the WEB

Nov 10, 2019
Holger Schwenk, Guillaume Wenzek, Sergey Edunov, Edouard Grave, Armand Joulin

* 13 pages, 4 figures. arXiv admin note: text overlap with arXiv:1907.05791 

  Access Paper or Ask Questions

MLQA: Evaluating Cross-lingual Extractive Question Answering

Nov 07, 2019
Patrick Lewis, Barlas Oğuz, Ruty Rinott, Sebastian Riedel, Holger Schwenk


  Access Paper or Ask Questions

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia

Jul 16, 2019
Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán

* 13 pages, 3 figures, 6 tables 

  Access Paper or Ask Questions

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings

Jun 20, 2019
Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn

* Conference on Machine Translation (WMT) 2019 
* 6 pages, WMT 2019 

  Access Paper or Ask Questions

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

Dec 26, 2018
Mikel Artetxe, Holger Schwenk


  Access Paper or Ask Questions

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings

Nov 03, 2018
Mikel Artetxe, Holger Schwenk


  Access Paper or Ask Questions

XNLI: Evaluating Cross-lingual Sentence Representations

Sep 13, 2018
Alexis Conneau, Guillaume Lample, Ruty Rinott, Adina Williams, Samuel R. Bowman, Holger Schwenk, Veselin Stoyanov

* EMNLP 2018 

  Access Paper or Ask Questions

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Jul 08, 2018
Alexis Conneau, Douwe Kiela, Holger Schwenk, Loic Barrault, Antoine Bordes

* EMNLP 2017 

  Access Paper or Ask Questions

Filtering and Mining Parallel Data in a Joint Multilingual Space

May 24, 2018
Holger Schwenk

* ACL, July 2018, Melbourne 
* 8 pages 

  Access Paper or Ask Questions

A Corpus for Multilingual Document Classification in Eight Languages

May 24, 2018
Holger Schwenk, Xian Li

* LREC, May 2018, Miyazaki, Japan 
* 4 pages 

  Access Paper or Ask Questions

Learning Joint Multilingual Sentence Representations with Neural Machine Translation

Aug 08, 2017
Holger Schwenk, Matthijs Douze

* 11 pages, 2 figures, published at ACL workshop RepL4NLP 

  Access Paper or Ask Questions

Very Deep Convolutional Networks for Text Classification

Jan 27, 2017
Alexis Conneau, Holger Schwenk, Loïc Barrault, Yann Lecun

* 10 pages, EACL 2017, camera-ready 

  Access Paper or Ask Questions

Incremental Adaptation Strategies for Neural Network Language Models

Jul 07, 2015
Aram Ter-Sarkisov, Holger Schwenk, Loic Barrault, Fethi Bougares

* accepted as workshop paper at ACL-IJCNLP 2015 

  Access Paper or Ask Questions

On Using Monolingual Corpora in Neural Machine Translation

Jun 12, 2015
Caglar Gulcehre, Orhan Firat, Kelvin Xu, Kyunghyun Cho, Loic Barrault, Huei-Chi Lin, Fethi Bougares, Holger Schwenk, Yoshua Bengio

* 9 pages, 2 figures 

  Access Paper or Ask Questions

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Sep 03, 2014
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio

* EMNLP 2014 

  Access Paper or Ask Questions