Alert button
Picture for Anshuman Tripathi

Anshuman Tripathi

Alert button

Contrastive Siamese Network for Semi-supervised Speech Recognition

May 27, 2022
Soheil Khorram, Jaeyoung Kim, Anshuman Tripathi, Han Lu, Qian Zhang, Hasim Sak

Figure 1 for Contrastive Siamese Network for Semi-supervised Speech Recognition
Figure 2 for Contrastive Siamese Network for Semi-supervised Speech Recognition
Figure 3 for Contrastive Siamese Network for Semi-supervised Speech Recognition
Figure 4 for Contrastive Siamese Network for Semi-supervised Speech Recognition
Viaarxiv icon

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

Oct 05, 2021
Wei Xia, Han Lu, Quan Wang, Anshuman Tripathi, Yiling Huang, Ignacio Lopez Moreno, Hasim Sak

Figure 1 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 2 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 3 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 4 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Viaarxiv icon

Reducing Streaming ASR Model Delay with Self Alignment

May 06, 2021
Jaeyoung Kim, Han Lu, Anshuman Tripathi, Qian Zhang, Hasim Sak

Figure 1 for Reducing Streaming ASR Model Delay with Self Alignment
Figure 2 for Reducing Streaming ASR Model Delay with Self Alignment
Figure 3 for Reducing Streaming ASR Model Delay with Self Alignment
Figure 4 for Reducing Streaming ASR Model Delay with Self Alignment
Viaarxiv icon

Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition

Oct 07, 2020
Anshuman Tripathi, Jaeyoung Kim, Qian Zhang, Han Lu, Hasim Sak

Figure 1 for Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition
Figure 2 for Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition
Figure 3 for Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition
Figure 4 for Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition
Viaarxiv icon

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

Feb 14, 2020
Qian Zhang, Han Lu, Hasim Sak, Anshuman Tripathi, Erik McDermott, Stephen Koo, Shankar Kumar

Figure 1 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 2 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 3 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 4 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Viaarxiv icon

UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime

Dec 31, 2018
Basaran Bahadir Kocer, Volkan Kumtepeli, Tegoeh Tjahjowidodo, Mahardhika Pratama, Anshuman Tripathi, Gerald Seet Gim Lee, Youyi Wang

Figure 1 for UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime
Figure 2 for UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime
Figure 3 for UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime
Figure 4 for UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime
Viaarxiv icon

Toward domain-invariant speech recognition via large scale training

Aug 16, 2018
Arun Narayanan, Ananya Misra, Khe Chai Sim, Golan Pundak, Anshuman Tripathi, Mohamed Elfeky, Parisa Haghani, Trevor Strohman, Michiel Bacchiani

Figure 1 for Toward domain-invariant speech recognition via large scale training
Figure 2 for Toward domain-invariant speech recognition via large scale training
Figure 3 for Toward domain-invariant speech recognition via large scale training
Figure 4 for Toward domain-invariant speech recognition via large scale training
Viaarxiv icon

Speech recognition for medical conversations

Jun 20, 2018
Chung-Cheng Chiu, Anshuman Tripathi, Katherine Chou, Chris Co, Navdeep Jaitly, Diana Jaunzeikare, Anjuli Kannan, Patrick Nguyen, Hasim Sak, Ananth Sankar, Justin Tansuwan, Nathan Wan, Yonghui Wu, Xuedong Zhang

Figure 1 for Speech recognition for medical conversations
Figure 2 for Speech recognition for medical conversations
Figure 3 for Speech recognition for medical conversations
Viaarxiv icon