Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Building Machine Translation Systems for the Next Thousand Languages

Ankur Bapna , Isaac Caswell , Julia Kreutzer , Orhan Firat , Daan van Esch , Aditya Siddhant , Mengmeng Niu , Pallavi Baljekar , Xavier Garcia , Wolfgang Macherey , Theresa Breiner , Vera Axelrod , Jason Riesa , Yuan Cao , Mia Xu Chen , Klaus Macherey , Maxim Krikun , Pidong Wang , Alexander Gutkin , Apurva Shah , Yanping Huang , Zhifeng Chen , Yonghui Wu , Macduff Hughes

* V2: updated with some details from 24-language Google Translate launch in May 2022 

   Access Paper or Ask Questions

Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview

Alena Butryna , Shan-Hui Cathy Chu , Isin Demirsahin , Alexander Gutkin , Linne Ha , Fei He , Martin Jansche , Cibu Johny , Anna Katanova , Oddur Kjartansson , Chenfang Li , Tatiana Merkulova , Yin May Oo , Knot Pipatsrisawat , Clara Rivera , Supheakmungkol Sarin , Pasindu de Silva , Keshan Sodimana , Richard Sproat , Theeraphol Wattanavekin , Jaka Aris Eko Wibawa

* Appeared in 2019 UNESCO International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, 4-6 December, Paris, France 

   Access Paper or Ask Questions

NEMO: Frequentist Inference Approach to Constrained Linguistic Typology Feature Prediction in SIGTYP 2020 Shared Task

Alexander Gutkin , Richard Sproat

* To appear in Second Workshop on Computational Research in Linguistic Typology (SIGTYP 2020) at 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) 

   Access Paper or Ask Questions

Towards Induction of Structured Phoneme Inventories

Alexander Gutkin , Martin Jansche , Lucy Skidmore

* To appear in the Second Workshop on Computational Research in Linguistic Typology (SIGTYP 2020) at EMNLP 2020 

   Access Paper or Ask Questions

Linguistic Typology Features from Text: Inferring the Sparse Features of World Atlas of Language Structures

Alexander Gutkin , Tatiana Merkulova , Martin Jansche

* Originally prepared as a conference submission to EMNLP 2018 

   Access Paper or Ask Questions

Sampling from Stochastic Finite Automata with Applications to CTC Decoding

Martin Jansche , Alexander Gutkin

   Access Paper or Ask Questions