Alert button

"speech recognition": models, code, and papers
Alert button

GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge

Add code
Bookmark button
Alert button
Oct 06, 2022
Dongkeon Park, Yechan Yu, Kyeong Wan Park, Ji Won Kim, Hong Kook Kim

Figure 1 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 2 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 3 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 4 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Viaarxiv icon

SEMOUR: A Scripted Emotional Speech Repository for Urdu

May 19, 2021
Nimra Zaheer, Obaid Ullah Ahmad, Ammar Ahmed, Muhammad Shehryar Khan, Mudassir Shabbir

Figure 1 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 2 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 3 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 4 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Viaarxiv icon

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone

Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 2 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 3 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Viaarxiv icon

Lightweight dynamic filter for keyword spotting

Sep 24, 2021
Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko

Figure 1 for Lightweight dynamic filter for keyword spotting
Figure 2 for Lightweight dynamic filter for keyword spotting
Figure 3 for Lightweight dynamic filter for keyword spotting
Figure 4 for Lightweight dynamic filter for keyword spotting
Viaarxiv icon

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

Oct 20, 2021
Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang

Figure 1 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 2 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 3 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 4 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Viaarxiv icon

Automatic Learning of Subword Dependent Model Scales

Add code
Bookmark button
Alert button
Oct 18, 2021
Felix Meyer, Wilfried Michel, Mohammad Zeineldeen, Ralf Schlüter, Hermann Ney

Figure 1 for Automatic Learning of Subword Dependent Model Scales
Figure 2 for Automatic Learning of Subword Dependent Model Scales
Figure 3 for Automatic Learning of Subword Dependent Model Scales
Figure 4 for Automatic Learning of Subword Dependent Model Scales
Viaarxiv icon

Intent Classification Using Pre-Trained Embeddings For Low Resource Languages

Add code
Bookmark button
Alert button
Oct 18, 2021
Hemant Yadav, Akshat Gupta, Sai Krishna Rallabandi, Alan W Black, Rajiv Ratn Shah

Figure 1 for Intent Classification Using Pre-Trained Embeddings For Low Resource Languages
Figure 2 for Intent Classification Using Pre-Trained Embeddings For Low Resource Languages
Figure 3 for Intent Classification Using Pre-Trained Embeddings For Low Resource Languages
Figure 4 for Intent Classification Using Pre-Trained Embeddings For Low Resource Languages
Viaarxiv icon

BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge

Add code
Bookmark button
Alert button
Dec 03, 2021
Yuting Yang, Binbin Du, Yingxin Zhang, Wenxuan Wang, Yuke Li

Figure 1 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Figure 2 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Figure 3 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Viaarxiv icon

A higher order Minkowski loss for improved prediction ability of acoustic model in ASR

Dec 02, 2021
Vishwanath Pratap Singh, Shakti P. Rath, Abhishek Pandey

Figure 1 for A higher order Minkowski loss for improved prediction ability of acoustic model in ASR
Figure 2 for A higher order Minkowski loss for improved prediction ability of acoustic model in ASR
Figure 3 for A higher order Minkowski loss for improved prediction ability of acoustic model in ASR
Figure 4 for A higher order Minkowski loss for improved prediction ability of acoustic model in ASR
Viaarxiv icon

Model-Based Approach for Measuring the Fairness in ASR

Sep 19, 2021
Zhe Liu, Irina-Elena Veliche, Fuchun Peng

Figure 1 for Model-Based Approach for Measuring the Fairness in ASR
Figure 2 for Model-Based Approach for Measuring the Fairness in ASR
Figure 3 for Model-Based Approach for Measuring the Fairness in ASR
Figure 4 for Model-Based Approach for Measuring the Fairness in ASR
Viaarxiv icon