Alert button
Picture for Ke Tan

Ke Tan

Alert button

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

Add code
Bookmark button
Alert button
Mar 03, 2024
Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

Figure 1 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 2 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 3 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 4 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Viaarxiv icon

Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment

Add code
Bookmark button
Alert button
Dec 11, 2023
Fei Yang, Shuang Peng, Ning Sun, Fangyu Wang, Ke Tan, Fu Wu, Jiezhong Qiu, Aimin Pan

Viaarxiv icon

TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio

Add code
Bookmark button
Alert button
Apr 04, 2023
Anurag Kumar, Ke Tan, Zhaoheng Ni, Pranay Manocha, Xiaohui Zhang, Ethan Henderson, Buye Xu

Figure 1 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 2 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 3 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 4 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Viaarxiv icon

Rethinking complex-valued deep neural networks for monaural speech enhancement

Add code
Bookmark button
Alert button
Jan 11, 2023
Haibin Wu, Ke Tan, Buye Xu, Anurag Kumar, Daniel Wong

Figure 1 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 2 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 3 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 4 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Viaarxiv icon

Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement

Add code
Bookmark button
Alert button
Nov 16, 2022
Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu

Figure 1 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Figure 2 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Figure 3 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Viaarxiv icon

Location-based training for multi-channel talker-independent speaker separation

Add code
Bookmark button
Alert button
Oct 08, 2021
Hassan Taherian, Ke Tan, DeLiang Wang

Figure 1 for Location-based training for multi-channel talker-independent speaker separation
Figure 2 for Location-based training for multi-channel talker-independent speaker separation
Figure 3 for Location-based training for multi-channel talker-independent speaker separation
Figure 4 for Location-based training for multi-channel talker-independent speaker separation
Viaarxiv icon

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

Add code
Bookmark button
Alert button
Mar 13, 2019
Peidong Wang, Ke Tan, DeLiang Wang

Figure 1 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 2 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 3 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 4 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Viaarxiv icon

Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective

Add code
Bookmark button
Alert button
Nov 22, 2018
Zhong-Qiu Wang, Ke Tan, DeLiang Wang

Figure 1 for Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Figure 2 for Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Figure 3 for Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Figure 4 for Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Viaarxiv icon