Alert button

"speech recognition": models, code, and papers
Alert button

Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis

May 09, 2022
Zhenzi Weng, Zhijin Qin, Xiaoming Tao, Chengkang Pan, Guangyi Liu, Geoffrey Ye Li

Figure 1 for Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis
Figure 2 for Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis
Figure 3 for Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis
Figure 4 for Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis
Viaarxiv icon

DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction

Add code
Bookmark button
Alert button
May 26, 2023
Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

Figure 1 for DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction
Figure 2 for DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction
Viaarxiv icon

Improved DeepFake Detection Using Whisper Features

Add code
Bookmark button
Alert button
Jun 02, 2023
Piotr Kawa, Marcin Plata, Michał Czuba, Piotr Szymański, Piotr Syga

Figure 1 for Improved DeepFake Detection Using Whisper Features
Figure 2 for Improved DeepFake Detection Using Whisper Features
Figure 3 for Improved DeepFake Detection Using Whisper Features
Figure 4 for Improved DeepFake Detection Using Whisper Features
Viaarxiv icon

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Add code
Bookmark button
Alert button
Jun 02, 2023
Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland

Figure 1 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 2 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 3 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 4 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Viaarxiv icon

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning

Jun 23, 2023
Zhongzhi Yu, Yang Zhang, Kaizhi Qian, Yonggan Fu, Yingyan Lin

Figure 1 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 2 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 3 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 4 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Viaarxiv icon

Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework

Jul 06, 2023
Eliya Segev, Maya Alroy, Ronen Katsir, Noam Wies, Ayana Shenhav, Yael Ben-Oren, David Zar, Oren Tadmor, Jacob Bitterman, Amnon Shashua, Tal Rosenwein

Figure 1 for Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework
Figure 2 for Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework
Figure 3 for Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework
Figure 4 for Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework
Viaarxiv icon

Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems

Nov 01, 2022
Shaan Bijwadia, Shuo-yiin Chang, Bo Li, Tara Sainath, Chao Zhang, Yanzhang He

Figure 1 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 2 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 3 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Figure 4 for Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Viaarxiv icon

End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation

Add code
Bookmark button
Alert button
Oct 19, 2022
Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono

Figure 1 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 2 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 3 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 4 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Viaarxiv icon

Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition

Oct 28, 2022
Zezhong Jin, Dading Zhong, Xiao Song, Zhaoyi Liu, Naipeng Ye, Qingcheng Zeng

Figure 1 for Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Figure 2 for Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Figure 3 for Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Figure 4 for Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Viaarxiv icon

From Audio to Symbolic Encoding

Add code
Bookmark button
Alert button
Feb 26, 2023
Shenli Yuan, Lingjie Kong, Jiushuang Guo

Figure 1 for From Audio to Symbolic Encoding
Figure 2 for From Audio to Symbolic Encoding
Figure 3 for From Audio to Symbolic Encoding
Figure 4 for From Audio to Symbolic Encoding
Viaarxiv icon