Alert button

"speech": models, code, and papers
Alert button

Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives

Add code
Bookmark button
Alert button
Jan 25, 2023
Tanvi Dinkar, Chloé Clavel, Ioana Vasilescu

Figure 1 for Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Figure 2 for Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Viaarxiv icon

Active Learning of Non-semantic Speech Tasks with Pretrained Models

Add code
Bookmark button
Alert button
Nov 03, 2022
Harlin Lee, Aaqib Saeed, Andrea L. Bertozzi

Figure 1 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 2 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 3 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 4 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Viaarxiv icon

UniCT DMI Solution for 3rd COV19D Competition on COVID-19 Detection through attention-based CNN for CT Scan

Mar 22, 2023
Alessia Rondinella, Francesco Guarnera, Oliver Giudice, Alessandro Ortis, Francesco Rundo, Sebastiano Battiato

Figure 1 for UniCT DMI Solution for 3rd COV19D Competition on COVID-19 Detection through attention-based CNN for CT Scan
Figure 2 for UniCT DMI Solution for 3rd COV19D Competition on COVID-19 Detection through attention-based CNN for CT Scan
Viaarxiv icon

Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

Add code
Bookmark button
Alert button
Jun 22, 2022
Running Zhao, Jiangtao Yu, Tingle Li, Hang Zhao, Edith C. H. Ngai

Figure 1 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 2 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 3 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 4 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Viaarxiv icon

SEM-POS: Grammatically and Semantically Correct Video Captioning

Apr 04, 2023
Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

Figure 1 for SEM-POS: Grammatically and Semantically Correct Video Captioning
Figure 2 for SEM-POS: Grammatically and Semantically Correct Video Captioning
Figure 3 for SEM-POS: Grammatically and Semantically Correct Video Captioning
Figure 4 for SEM-POS: Grammatically and Semantically Correct Video Captioning
Viaarxiv icon

Self-Supervised Speech Representation Learning: A Review

Add code
Bookmark button
Alert button
May 21, 2022
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe

Figure 1 for Self-Supervised Speech Representation Learning: A Review
Figure 2 for Self-Supervised Speech Representation Learning: A Review
Figure 3 for Self-Supervised Speech Representation Learning: A Review
Figure 4 for Self-Supervised Speech Representation Learning: A Review
Viaarxiv icon

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

Add code
Bookmark button
Alert button
Mar 11, 2023
Shuangfei Zhai, Tatiana Likhomanenko, Etai Littwin, Dan Busbridge, Jason Ramapuram, Yizhe Zhang, Jiatao Gu, Josh Susskind

Figure 1 for Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Figure 2 for Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Figure 3 for Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Figure 4 for Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Viaarxiv icon

End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation

Add code
Bookmark button
Alert button
Apr 01, 2022
Xuankai Chang, Takashi Maekaku, Yuya Fujita, Shinji Watanabe

Figure 1 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 2 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 3 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 4 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Viaarxiv icon

Learning Speaker-specific Lip-to-Speech Generation

Jun 04, 2022
Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde

Figure 1 for Learning Speaker-specific Lip-to-Speech Generation
Figure 2 for Learning Speaker-specific Lip-to-Speech Generation
Figure 3 for Learning Speaker-specific Lip-to-Speech Generation
Figure 4 for Learning Speaker-specific Lip-to-Speech Generation
Viaarxiv icon

A Deliberation-based Joint Acoustic and Text Decoder

Mar 23, 2023
Sepand Mavandadi, Tara N. Sainath, Ke Hu, Zelin Wu

Figure 1 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 2 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 3 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 4 for A Deliberation-based Joint Acoustic and Text Decoder
Viaarxiv icon