Alert button
Picture for Antoine Bruguier

Antoine Bruguier

Alert button

Partial Rewriting for Multi-Stage ASR

Add code
Bookmark button
Alert button
Dec 08, 2023
Antoine Bruguier, David Qiu, Yanzhang He

Viaarxiv icon

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models

Add code
Bookmark button
Alert button
Mar 15, 2023
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw

Figure 1 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 2 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 3 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 4 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Viaarxiv icon

Neural-FST Class Language Model for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jan 31, 2022
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 2 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 3 for Neural-FST Class Language Model for End-to-End Speech Recognition
Viaarxiv icon

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer

Add code
Bookmark button
Alert button
Oct 26, 2020
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le

Figure 1 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 2 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 3 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 4 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Bookmark button
Alert button
Mar 28, 2020
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao

Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models

Add code
Bookmark button
Alert button
Jul 22, 2019
Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak

Figure 1 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 2 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 3 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 4 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Viaarxiv icon

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Add code
Bookmark button
Alert button
Feb 21, 2019
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel A. U. Bacchiani, Thomas B. Jablin, Rob Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon

Figure 1 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 2 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 3 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Viaarxiv icon

Model Unit Exploration for Sequence-to-Sequence Speech Recognition

Add code
Bookmark button
Alert button
Feb 05, 2019
Kazuki Irie, Rohit Prabhavalkar, Anjuli Kannan, Antoine Bruguier, David Rybach, Patrick Nguyen

Figure 1 for Model Unit Exploration for Sequence-to-Sequence Speech Recognition
Figure 2 for Model Unit Exploration for Sequence-to-Sequence Speech Recognition
Figure 3 for Model Unit Exploration for Sequence-to-Sequence Speech Recognition
Figure 4 for Model Unit Exploration for Sequence-to-Sequence Speech Recognition
Viaarxiv icon

NN-grams: Unifying neural network and n-gram language models for Speech Recognition

Add code
Bookmark button
Alert button
Jun 23, 2016
Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier

Figure 1 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 2 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 3 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 4 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Viaarxiv icon

On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition

Add code
Bookmark button
Alert button
May 02, 2016
Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Ian McGraw

Figure 1 for On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition
Figure 2 for On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition
Viaarxiv icon