Alert button

"speech": models, code, and papers
Alert button

Speech Enhancement for Virtual Meetings on Cellular Networks

Add code
Bookmark button
Alert button
Feb 02, 2023
Hojeong Lee, Minseon Gwak, Kawon Lee, Minjeong Kim, Joseph Konan, Ojas Bhargave

Figure 1 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 2 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 3 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 4 for Speech Enhancement for Virtual Meetings on Cellular Networks
Viaarxiv icon

Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials

Add code
Bookmark button
Alert button
Jun 20, 2023
Malikeh Ehghaghi, Marija Stanojevic, Ali Akram, Jekaterina Novikova

Figure 1 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 2 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 3 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 4 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Viaarxiv icon

A processing framework to access large quantities of whispered speech found in ASMR

Add code
Bookmark button
Alert button
Mar 13, 2023
Pablo Perez Zarazaga, Gustav Eje Henter, Zofia Malisz

Figure 1 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 2 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 3 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 4 for A processing framework to access large quantities of whispered speech found in ASMR
Viaarxiv icon

Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning

Mar 07, 2023
Zhaoxi Mu, Xinyu Yang, Wenjing Zhu

Figure 1 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 2 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 3 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 4 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Viaarxiv icon

VoxWatch: An open-set speaker recognition benchmark on VoxCeleb

Add code
Bookmark button
Alert button
Jun 30, 2023
Raghuveer Peri, Seyed Omid Sadjadi, Daniel Garcia-Romero

Figure 1 for VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Figure 2 for VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Figure 3 for VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Figure 4 for VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Viaarxiv icon

Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples

Jul 16, 2023
M. Zakaria Kurdi

Figure 1 for Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples
Figure 2 for Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples
Figure 3 for Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples
Figure 4 for Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples
Viaarxiv icon

Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech

Add code
Bookmark button
Alert button
Feb 27, 2023
Jiyoung Lee, Joon Son Chung, Soo-Whan Chung

Figure 1 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 2 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 3 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 4 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Viaarxiv icon

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Jan 25, 2023
Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen

Figure 1 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 2 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 3 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 4 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Viaarxiv icon

Improving Meeting Inclusiveness using Speech Interruption Analysis

Apr 02, 2023
Szu-Wei Fu, Yaran Fan, Yasaman Hosseinkashi, Jayant Gupchup, Ross Cutler

Figure 1 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 2 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 3 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 4 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Viaarxiv icon

Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos

Jul 04, 2023
Ashwin Rao

Figure 1 for Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos
Figure 2 for Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos
Figure 3 for Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos
Figure 4 for Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos
Viaarxiv icon