Alert button
Picture for Viet Anh Trinh

Viet Anh Trinh

Alert button

CUNY Graduate Center

Two-pass Endpoint Detection for Speech Recognition

Add code
Bookmark button
Alert button
Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Add code
Bookmark button
Alert button
Mar 23, 2023
Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh

Figure 1 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 2 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 3 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 4 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Viaarxiv icon

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation

Add code
Bookmark button
Alert button
Jul 16, 2022
Viet Anh Trinh, Pegah Ghahremani, Brian King, Jasha Droppo, Andreas Stolcke, Roland Maas

Figure 1 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Figure 2 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Viaarxiv icon

ImportantAug: a data augmentation agent for speech

Add code
Bookmark button
Alert button
Dec 14, 2021
Viet Anh Trinh, Hassan Salami Kavaki, Michael I Mandel

Figure 1 for ImportantAug: a data augmentation agent for speech
Figure 2 for ImportantAug: a data augmentation agent for speech
Figure 3 for ImportantAug: a data augmentation agent for speech
Figure 4 for ImportantAug: a data augmentation agent for speech
Viaarxiv icon

Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses

Add code
Bookmark button
Alert button
Nov 16, 2021
Viet Anh Trinh, Sebastian Braun

Figure 1 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 2 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 3 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 4 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Viaarxiv icon

Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement

Add code
Bookmark button
Alert button
Dec 02, 2020
Felix Grezes, Zhaoheng Ni, Viet Anh Trinh, Michael Mandel

Figure 1 for Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Figure 2 for Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Figure 3 for Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Figure 4 for Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Viaarxiv icon

Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks

Add code
Bookmark button
Alert button
Dec 02, 2020
Zhaoheng Ni, Felix Grezes, Viet Anh Trinh, Michael I. Mandel

Figure 1 for Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Figure 2 for Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Figure 3 for Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Figure 4 for Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Viaarxiv icon

Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks

Add code
Bookmark button
Alert button
Dec 02, 2020
Felix Grezes, Zhaoheng Ni, Viet Anh Trinh, Michael Mandel

Figure 1 for Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Figure 2 for Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Figure 3 for Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Figure 4 for Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Viaarxiv icon

Large scale evaluation of importance maps in automatic speech recognition

Add code
Bookmark button
Alert button
May 21, 2020
Viet Anh Trinh, Michael I Mandel

Figure 1 for Large scale evaluation of importance maps in automatic speech recognition
Figure 2 for Large scale evaluation of importance maps in automatic speech recognition
Figure 3 for Large scale evaluation of importance maps in automatic speech recognition
Figure 4 for Large scale evaluation of importance maps in automatic speech recognition
Viaarxiv icon