Alert button

"speech": models, code, and papers
Alert button

Speaker and Direction Inferred Dual-channel Speech Separation

Add code
Bookmark button
Alert button
Feb 08, 2021
Chenxing Li, Jiaming Xu, Nima Mesgarani, Bo Xu

Figure 1 for Speaker and Direction Inferred Dual-channel Speech Separation
Figure 2 for Speaker and Direction Inferred Dual-channel Speech Separation
Figure 3 for Speaker and Direction Inferred Dual-channel Speech Separation
Figure 4 for Speaker and Direction Inferred Dual-channel Speech Separation
Viaarxiv icon

Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Mar 16, 2021
Jama Hussein Mohamud, Lloyd Acquaye Thompson, Aissatou Ndoye, Laurent Besacier

Figure 1 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 2 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 3 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 4 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Viaarxiv icon

Exploiting Adapters for Cross-lingual Low-resource Speech Recognition

Add code
Bookmark button
Alert button
May 18, 2021
Wenxin Hou, Han Zhu, Yidong Wang, Jindong Wang, Tao Qin, Renjun Xu, Takahiro Shinozaki

Figure 1 for Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Figure 2 for Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Figure 3 for Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Figure 4 for Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Viaarxiv icon

Computing with Hypervectors for Efficient Speaker Identification

Aug 28, 2022
Ping-Chen Huang, Denis Kleyko, Jan M. Rabaey, Bruno A. Olshausen, Pentti Kanerva

Figure 1 for Computing with Hypervectors for Efficient Speaker Identification
Figure 2 for Computing with Hypervectors for Efficient Speaker Identification
Figure 3 for Computing with Hypervectors for Efficient Speaker Identification
Figure 4 for Computing with Hypervectors for Efficient Speaker Identification
Viaarxiv icon

WaDeNet: Wavelet Decomposition based CNN for Speech Processing

Nov 11, 2020
Prithvi Suresh, Abhijith Ragav

Figure 1 for WaDeNet: Wavelet Decomposition based CNN for Speech Processing
Figure 2 for WaDeNet: Wavelet Decomposition based CNN for Speech Processing
Figure 3 for WaDeNet: Wavelet Decomposition based CNN for Speech Processing
Viaarxiv icon

Visual Speech Enhancement Without A Real Visual Stream

Add code
Bookmark button
Alert button
Dec 20, 2020
Sindhu B Hegde, K R Prajwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C. V. Jawahar

Figure 1 for Visual Speech Enhancement Without A Real Visual Stream
Figure 2 for Visual Speech Enhancement Without A Real Visual Stream
Figure 3 for Visual Speech Enhancement Without A Real Visual Stream
Figure 4 for Visual Speech Enhancement Without A Real Visual Stream
Viaarxiv icon

Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks

Jan 30, 2021
Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S R Mahadeva Prasanna

Figure 1 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 2 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 3 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 4 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Viaarxiv icon

Speaker Separation Using Speaker Inventories and Estimated Speech

Oct 20, 2020
Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong

Figure 1 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 2 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 3 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 4 for Speaker Separation Using Speaker Inventories and Estimated Speech
Viaarxiv icon

Bridging the Modality Gap for Speech-to-Text Translation

Add code
Bookmark button
Alert button
Oct 28, 2020
Yuchen Liu, Junnan Zhu, Jiajun Zhang, Chengqing Zong

Figure 1 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 2 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 3 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 4 for Bridging the Modality Gap for Speech-to-Text Translation
Viaarxiv icon

Learning ASR pathways: A sparse multilingual ASR model

Sep 13, 2022
Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli

Figure 1 for Learning ASR pathways: A sparse multilingual ASR model
Figure 2 for Learning ASR pathways: A sparse multilingual ASR model
Figure 3 for Learning ASR pathways: A sparse multilingual ASR model
Figure 4 for Learning ASR pathways: A sparse multilingual ASR model
Viaarxiv icon