Alert button

"speech": models, code, and papers
Alert button

How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR

Jan 18, 2022
Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri

Figure 1 for How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Figure 2 for How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Figure 3 for How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Figure 4 for How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR
Viaarxiv icon

Semi-Supervised Learning Based on Reference Model for Low-resource TTS

Oct 25, 2022
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for Semi-Supervised Learning Based on Reference Model for Low-resource TTS
Figure 2 for Semi-Supervised Learning Based on Reference Model for Low-resource TTS
Figure 3 for Semi-Supervised Learning Based on Reference Model for Low-resource TTS
Figure 4 for Semi-Supervised Learning Based on Reference Model for Low-resource TTS
Viaarxiv icon

Efficient Training of Neural Transducer for Speech Recognition

Apr 22, 2022
Wei Zhou, Wilfried Michel, Ralf Schlüter, Hermann Ney

Figure 1 for Efficient Training of Neural Transducer for Speech Recognition
Figure 2 for Efficient Training of Neural Transducer for Speech Recognition
Figure 3 for Efficient Training of Neural Transducer for Speech Recognition
Figure 4 for Efficient Training of Neural Transducer for Speech Recognition
Viaarxiv icon

Compressing Transformer-based self-supervised models for speech processing

Add code
Bookmark button
Alert button
Nov 17, 2022
Tzu-Quan Lin, Tsung-Huan Yang, Chun-Yao Chang, Kuang-Ming Chen, Tzu-hsun Feng, Hung-yi Lee, Hao Tang

Figure 1 for Compressing Transformer-based self-supervised models for speech processing
Figure 2 for Compressing Transformer-based self-supervised models for speech processing
Figure 3 for Compressing Transformer-based self-supervised models for speech processing
Figure 4 for Compressing Transformer-based self-supervised models for speech processing
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Apr 13, 2022
Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw

Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Add code
Bookmark button
Alert button
Dec 16, 2022
Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlicek, Alexei V. Ivanov, Aravind Ganapathiraju

Figure 1 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 2 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 3 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 4 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Viaarxiv icon

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora

Add code
Bookmark button
Alert button
Sep 21, 2022
Sara Papi, Alina Karakanta, Matteo Negri, Marco Turchi

Figure 1 for Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
Figure 2 for Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
Figure 3 for Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
Figure 4 for Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
Viaarxiv icon

Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection

Add code
Bookmark button
Alert button
Jun 27, 2022
Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir H. Poorjam, Deepak Mittal, Maneesh Singh

Figure 1 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 2 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 3 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 4 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Viaarxiv icon

Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages

Add code
Bookmark button
Alert button
Jan 27, 2022
Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal

Figure 1 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 2 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 3 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 4 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Viaarxiv icon

Predicting Knowledge Gain for MOOC Video Consumption

Add code
Bookmark button
Alert button
Dec 13, 2022
Christian Otto, Markos Stamatakis, Anett Hoppe, Ralph Ewerth

Viaarxiv icon