Alert button
Picture for Toan Q. Nguyen

Toan Q. Nguyen

Alert button

Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution

May 04, 2021
Toan Q. Nguyen, Kenton Murray, David Chiang

Figure 1 for Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution
Figure 2 for Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution
Figure 3 for Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution
Figure 4 for Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution
Viaarxiv icon

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Mar 22, 2021
Isaac Caswell, Julia Kreutzer, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Javier Ortiz Suárez, Iroro Orife, Kelechi Ogueji, Rubungo Andre Niyongabo, Toan Q. Nguyen, Mathias Müller, André Müller, Shamsuddeen Hassan Muhammad, Nanda Muhammad, Ayanda Mnyakeni, Jamshidbek Mirzakhalov, Tapiwanashe Matangira, Colin Leong, Nze Lawson, Sneha Kudugunta, Yacine Jernite, Mathias Jenny, Orhan Firat, Bonaventure F. P. Dossou, Sakhile Dlamini, Nisansa de Silva, Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur Bapna, Pallavi Baljekar, Israel Abebe Azime, Ayodele Awokoya, Duygu Ataman, Orevaoghene Ahia, Oghenefego Ahia, Sweta Agrawal, Mofetoluwa Adeyemi

Figure 1 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 2 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 3 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 4 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Viaarxiv icon

Pseudolikelihood Reranking with Masked Language Models

Oct 31, 2019
Julian Salazar, Davis Liang, Toan Q. Nguyen, Katrin Kirchhoff

Figure 1 for Pseudolikelihood Reranking with Masked Language Models
Figure 2 for Pseudolikelihood Reranking with Masked Language Models
Figure 3 for Pseudolikelihood Reranking with Masked Language Models
Figure 4 for Pseudolikelihood Reranking with Masked Language Models
Viaarxiv icon

Transformers without Tears: Improving the Normalization of Self-Attention

Oct 14, 2019
Toan Q. Nguyen, Julian Salazar

Figure 1 for Transformers without Tears: Improving the Normalization of Self-Attention
Figure 2 for Transformers without Tears: Improving the Normalization of Self-Attention
Figure 3 for Transformers without Tears: Improving the Normalization of Self-Attention
Figure 4 for Transformers without Tears: Improving the Normalization of Self-Attention
Viaarxiv icon

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Oct 01, 2019
Kenton Murray, Jeffery Kinnison, Toan Q. Nguyen, Walter Scheirer, David Chiang

Figure 1 for Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation
Figure 2 for Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation
Figure 3 for Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation
Figure 4 for Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation
Viaarxiv icon

Improving Lexical Choice in Neural Machine Translation

Apr 17, 2018
Toan Q. Nguyen, David Chiang

Figure 1 for Improving Lexical Choice in Neural Machine Translation
Figure 2 for Improving Lexical Choice in Neural Machine Translation
Figure 3 for Improving Lexical Choice in Neural Machine Translation
Figure 4 for Improving Lexical Choice in Neural Machine Translation
Viaarxiv icon