Alert button
Picture for Yanzhang He

Yanzhang He

Alert button

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Bookmark button
Alert button
Feb 27, 2024
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno

Viaarxiv icon

USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

Add code
Bookmark button
Alert button
Jan 03, 2024
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Shivani Agrawal, Zhonglin Han, Jian Li, Amir Yazdanbakhsh

Figure 1 for USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Figure 2 for USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Figure 3 for USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Figure 4 for USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Viaarxiv icon

Partial Rewriting for Multi-Stage ASR

Add code
Bookmark button
Alert button
Dec 08, 2023
Antoine Bruguier, David Qiu, Yanzhang He

Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Bookmark button
Alert button
Sep 22, 2023
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara Sainath, Pedro Moreno Mengibar

Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

2-bit Conformer quantization for automatic speech recognition

Add code
Bookmark button
Alert button
May 26, 2023
Oleg Rybakov, Phoenix Meadowlark, Shaojin Ding, David Qiu, Jian Li, David Rim, Yanzhang He

Figure 1 for 2-bit Conformer quantization for automatic speech recognition
Figure 2 for 2-bit Conformer quantization for automatic speech recognition
Figure 3 for 2-bit Conformer quantization for automatic speech recognition
Figure 4 for 2-bit Conformer quantization for automatic speech recognition
Viaarxiv icon

RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

Add code
Bookmark button
Alert button
May 24, 2023
David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He

Figure 1 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 2 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 3 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Figure 4 for RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
Viaarxiv icon

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models

Add code
Bookmark button
Alert button
Mar 15, 2023
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw

Figure 1 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 2 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 3 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 4 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Viaarxiv icon

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

Add code
Bookmark button
Alert button
Nov 28, 2022
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman

Figure 1 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 2 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 3 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 4 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Viaarxiv icon

Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems

Add code
Bookmark button
Alert button
Nov 01, 2022
Shaan Bijwadia, Shuo-yiin Chang, Bo Li, Tara Sainath, Chao Zhang, Yanzhang He

Figure 1 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 2 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 3 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 4 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Viaarxiv icon