Alert button
Picture for Kshitiz Kumar

Kshitiz Kumar

Alert button

Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss

Add code
Bookmark button
Alert button
Aug 11, 2023
Mohammad Soleymanpour, Mahmoud Al Ismail, Fahimeh Bahmaninezhad, Kshitiz Kumar, Jian Wu

Figure 1 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 2 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 3 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 4 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Viaarxiv icon

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study

Add code
Bookmark button
Alert button
Feb 07, 2022
Daniel Tompkins, Kshitiz Kumar, Jian Wu

Figure 1 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 2 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 3 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 4 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Viaarxiv icon

Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models

Add code
Bookmark button
Alert button
Jun 30, 2021
Amber Afshan, Kshitiz Kumar, Jian Wu

Figure 1 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 2 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 3 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 4 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Viaarxiv icon

Transfer Learning Approaches for Streaming End-to-End Speech Recognition System

Add code
Bookmark button
Alert button
Aug 17, 2020
Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li

Figure 1 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 2 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 3 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 4 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Viaarxiv icon

Speaker Adaptation for End-to-End CTC Models

Add code
Bookmark button
Alert button
Jan 04, 2019
Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong

Figure 1 for Speaker Adaptation for End-to-End CTC Models
Figure 2 for Speaker Adaptation for End-to-End CTC Models
Figure 3 for Speaker Adaptation for End-to-End CTC Models
Figure 4 for Speaker Adaptation for End-to-End CTC Models
Viaarxiv icon