Alert button

"speech": models, code, and papers
Alert button

Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments

Aug 26, 2021
Luis Felipe Parra-Gallego, Juan Rafael Orozco-Arroyave

Figure 1 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 2 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 3 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 4 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Viaarxiv icon

Grammar Detection for Sentiment Analysis through Improved Viterbi Algorithm

May 26, 2022
Surya Teja Chavali, Charan Tej Kandavalli, Sugash T M

Figure 1 for Grammar Detection for Sentiment Analysis through Improved Viterbi Algorithm
Figure 2 for Grammar Detection for Sentiment Analysis through Improved Viterbi Algorithm
Figure 3 for Grammar Detection for Sentiment Analysis through Improved Viterbi Algorithm
Viaarxiv icon

Differentially Private Speaker Anonymization

Feb 23, 2022
Ali Shahin Shamsabadi, Brij Mohan Lal Srivastava, Aurélien Bellet, Nathalie Vauquier, Emmanuel Vincent, Mohamed Maouche, Marc Tommasi, Nicolas Papernot

Figure 1 for Differentially Private Speaker Anonymization
Figure 2 for Differentially Private Speaker Anonymization
Figure 3 for Differentially Private Speaker Anonymization
Figure 4 for Differentially Private Speaker Anonymization
Viaarxiv icon

Applying wav2vec2.0 to Speech Recognition in various low-resource languages

Add code
Bookmark button
Alert button
Dec 22, 2020
Cheng Yi, Jianzhong Wang, Ning Cheng, Shiyu Zhou, Bo Xu

Figure 1 for Applying wav2vec2.0 to Speech Recognition in various low-resource languages
Figure 2 for Applying wav2vec2.0 to Speech Recognition in various low-resource languages
Figure 3 for Applying wav2vec2.0 to Speech Recognition in various low-resource languages
Figure 4 for Applying wav2vec2.0 to Speech Recognition in various low-resource languages
Viaarxiv icon

Improving Data Driven Inverse Text Normalization using Data Augmentation

Jul 20, 2022
Laxmi Pandey, Debjyoti Paul, Pooja Chitkara, Yutong Pang, Xuedong Zhang, Kjell Schubert, Mark Chou, Shu Liu, Yatharth Saraf

Figure 1 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 2 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 3 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 4 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Viaarxiv icon

fairseq S2T: Fast Speech-to-Text Modeling with fairseq

Add code
Bookmark button
Alert button
Oct 11, 2020
Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Pino

Figure 1 for fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Figure 2 for fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Figure 3 for fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Figure 4 for fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Viaarxiv icon

End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection

Add code
Bookmark button
Alert button
Aug 23, 2021
Hemlata Tak, Jee-weon Jung, Jose Patino, Madhu Kamble, Massimiliano Todisco, Nicholas Evans

Figure 1 for End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Figure 2 for End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Figure 3 for End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Figure 4 for End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection
Viaarxiv icon

ASR Error Correction with Constrained Decoding on Operation Prediction

Add code
Bookmark button
Alert button
Aug 09, 2022
Jingyuan Yang, Rongjun Li, Wei Peng

Figure 1 for ASR Error Correction with Constrained Decoding on Operation Prediction
Figure 2 for ASR Error Correction with Constrained Decoding on Operation Prediction
Figure 3 for ASR Error Correction with Constrained Decoding on Operation Prediction
Figure 4 for ASR Error Correction with Constrained Decoding on Operation Prediction
Viaarxiv icon

Efficient conformer-based speech recognition with linear attention

Apr 14, 2021
Shengqiang Li, Menglong Xu, Xiao-Lei Zhang

Figure 1 for Efficient conformer-based speech recognition with linear attention
Figure 2 for Efficient conformer-based speech recognition with linear attention
Figure 3 for Efficient conformer-based speech recognition with linear attention
Figure 4 for Efficient conformer-based speech recognition with linear attention
Viaarxiv icon

Similarity Analysis of Self-Supervised Speech Representations

Add code
Bookmark button
Alert button
Oct 22, 2020
Yu-An Chung, Yonatan Belinkov, James Glass

Figure 1 for Similarity Analysis of Self-Supervised Speech Representations
Figure 2 for Similarity Analysis of Self-Supervised Speech Representations
Figure 3 for Similarity Analysis of Self-Supervised Speech Representations
Figure 4 for Similarity Analysis of Self-Supervised Speech Representations
Viaarxiv icon