Alert button

"speech": models, code, and papers
Alert button

Conditional Diffusion Probabilistic Model for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 10, 2022
Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao

Figure 1 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 2 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 3 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 4 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Viaarxiv icon

SepIt: Approaching a Single Channel Speech Separation Bound

May 25, 2022
Shahar Lutati, Eliya Nachmani, Lior Wolf

Figure 1 for SepIt: Approaching a Single Channel Speech Separation Bound
Figure 2 for SepIt: Approaching a Single Channel Speech Separation Bound
Figure 3 for SepIt: Approaching a Single Channel Speech Separation Bound
Figure 4 for SepIt: Approaching a Single Channel Speech Separation Bound
Viaarxiv icon

Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation

Feb 28, 2023
Teven Le Scao, Claire Gardent

Figure 1 for Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
Figure 2 for Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
Figure 3 for Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
Figure 4 for Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
Viaarxiv icon

A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids

Oct 24, 2022
Abhijeet Bishnu, Ankit Gupta, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Amir Hussain, Mathini Sellathurai, Tharmalingam Ratnarajah

Figure 1 for A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids
Figure 2 for A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids
Figure 3 for A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids
Figure 4 for A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids
Viaarxiv icon

Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment

Add code
Bookmark button
Alert button
Apr 08, 2022
Tobias Weise, Philipp Klumpp, Andreas Maier, Elmar Noeth, Bjoern Heismann, Maria Schuster, Seung Hee Yang

Figure 1 for Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
Figure 2 for Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
Figure 3 for Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
Viaarxiv icon

Joint Speech Recognition and Audio Captioning

Add code
Bookmark button
Alert button
Feb 03, 2022
Chaitanya Narisetty, Emiru Tsunoo, Xuankai Chang, Yosuke Kashiwagi, Michael Hentschel, Shinji Watanabe

Figure 1 for Joint Speech Recognition and Audio Captioning
Figure 2 for Joint Speech Recognition and Audio Captioning
Figure 3 for Joint Speech Recognition and Audio Captioning
Figure 4 for Joint Speech Recognition and Audio Captioning
Viaarxiv icon

SepIt Approaching a Single Channel Speech Separation Bound

May 24, 2022
Shahar Lutati, Eliya Nachmani, Lior Wolf

Figure 1 for SepIt Approaching a Single Channel Speech Separation Bound
Figure 2 for SepIt Approaching a Single Channel Speech Separation Bound
Figure 3 for SepIt Approaching a Single Channel Speech Separation Bound
Figure 4 for SepIt Approaching a Single Channel Speech Separation Bound
Viaarxiv icon

Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention

Add code
Bookmark button
Alert button
May 20, 2022
Xinmeng Xu, Jianjun Hao

Figure 1 for Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention
Figure 2 for Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention
Figure 3 for Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention
Figure 4 for Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention
Viaarxiv icon

Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages

Add code
Bookmark button
Alert button
May 02, 2022
Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi

Figure 1 for Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Figure 2 for Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Figure 3 for Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Figure 4 for Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Viaarxiv icon

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

Mar 29, 2022
Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee

Figure 1 for Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations
Figure 2 for Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations
Figure 3 for Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations
Figure 4 for Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations
Viaarxiv icon