Alert button

"speech": models, code, and papers
Alert button

Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective

Nov 05, 2022
Hannaneh B. Pasandi, Haniyeh B. Pasandi

Figure 1 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 2 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 3 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 4 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Viaarxiv icon

Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities

Add code
Bookmark button
Alert button
Jul 04, 2023
Riccardo Orlando, Simone Conia, Roberto Navigli

Figure 1 for Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities
Figure 2 for Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities
Figure 3 for Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities
Figure 4 for Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities
Viaarxiv icon

The Double Helix inside the NLP Transformer

Jun 23, 2023
Jason H. J. Lu, Qingzhen Guo

Figure 1 for The Double Helix inside the NLP Transformer
Figure 2 for The Double Helix inside the NLP Transformer
Figure 3 for The Double Helix inside the NLP Transformer
Figure 4 for The Double Helix inside the NLP Transformer
Viaarxiv icon

Implementing contextual biasing in GPU decoder for online ASR

Add code
Bookmark button
Alert button
Jun 23, 2023
Iuliia Nigmatulina, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motliček, Juan Zuluaga-Gomez, Karthik Pandia, Aravind Ganapathiraju

Figure 1 for Implementing contextual biasing in GPU decoder for online ASR
Figure 2 for Implementing contextual biasing in GPU decoder for online ASR
Figure 3 for Implementing contextual biasing in GPU decoder for online ASR
Viaarxiv icon

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

Add code
Bookmark button
Alert button
May 29, 2023
Ambuj Mehrish, Abhinav Ramesh Kashyap, Li Yingting, Navonil Majumder, Soujanya Poria

Figure 1 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 2 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 3 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 4 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Viaarxiv icon

Generating Holistic 3D Human Motion from Speech

Dec 08, 2022
Hongwei Yi, Hualin Liang, Yifei Liu, Qiong Cao, Yandong Wen, Timo Bolkart, Dacheng Tao, Michael J. Black

Figure 1 for Generating Holistic 3D Human Motion from Speech
Figure 2 for Generating Holistic 3D Human Motion from Speech
Figure 3 for Generating Holistic 3D Human Motion from Speech
Figure 4 for Generating Holistic 3D Human Motion from Speech
Viaarxiv icon

Multi-resolution location-based training for multi-channel continuous speech separation

Jan 16, 2023
Hassan Taherian, DeLiang Wang

Figure 1 for Multi-resolution location-based training for multi-channel continuous speech separation
Figure 2 for Multi-resolution location-based training for multi-channel continuous speech separation
Viaarxiv icon

Modality Influence in Multimodal Machine Learning

Add code
Bookmark button
Alert button
Jun 10, 2023
Abdelhamid Haouhat, Slimane Bellaouar, Attia Nehar, Hadda Cherroun

Figure 1 for Modality Influence in Multimodal Machine Learning
Figure 2 for Modality Influence in Multimodal Machine Learning
Figure 3 for Modality Influence in Multimodal Machine Learning
Figure 4 for Modality Influence in Multimodal Machine Learning
Viaarxiv icon

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

Add code
Bookmark button
Alert button
Mar 14, 2023
Chunyu Qiang, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang

Figure 1 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 2 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 3 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 4 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Viaarxiv icon

Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail

Jun 29, 2023
Stefan Fichna, Steven van de Par, Stephan D. Ewert

Figure 1 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Figure 2 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Figure 3 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Viaarxiv icon