Alert button

"speech": models, code, and papers
Alert button

Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation

Jul 30, 2023
Yuanhao Chen

Figure 1 for Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation
Figure 2 for Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation
Figure 3 for Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation
Figure 4 for Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Bookmark button
Alert button
Jun 22, 2023
Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Frank

Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks

Aug 09, 2023
Yuanhao Gong

Figure 1 for TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks
Figure 2 for TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks
Figure 3 for TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks
Figure 4 for TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks
Viaarxiv icon

Classifying Rhoticity of /r/ in Speech Sound Disorder using Age-and-Sex Normalized Formants

May 25, 2023
Nina R Benway, Jonathan L Preston, Asif Salekin, Yi Xiao, Harshit Sharma, Tara McAllister

Figure 1 for Classifying Rhoticity of /r/ in Speech Sound Disorder using Age-and-Sex Normalized Formants
Figure 2 for Classifying Rhoticity of /r/ in Speech Sound Disorder using Age-and-Sex Normalized Formants
Figure 3 for Classifying Rhoticity of /r/ in Speech Sound Disorder using Age-and-Sex Normalized Formants
Figure 4 for Classifying Rhoticity of /r/ in Speech Sound Disorder using Age-and-Sex Normalized Formants
Viaarxiv icon

Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes

May 12, 2023
Emma O'Neill, Julie Carson-Berndsen

Figure 1 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 2 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 3 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 4 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Viaarxiv icon

LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification

Add code
Bookmark button
Alert button
Apr 03, 2023
Ankit Yadav, Shubham Chandel, Sushant Chatufale, Anil Bandhakavi

Figure 1 for LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification
Figure 2 for LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification
Figure 3 for LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification
Figure 4 for LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification
Viaarxiv icon

Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks

May 02, 2023
Gašper Beguš, Thomas Lu, Zili Wang

Figure 1 for Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks
Figure 2 for Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks
Figure 3 for Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks
Figure 4 for Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks
Viaarxiv icon

MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction

Add code
Bookmark button
Alert button
Aug 15, 2023
Jie Yang, Soyeon Caren Han, Siqu Long, Josiah Poon, Goran Nenadic

Figure 1 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 2 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 3 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 4 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Viaarxiv icon

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

Aug 15, 2023
Bolaji Yusuf, Jan Cernocky, Murat Saraclar

Viaarxiv icon

AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics

Add code
Bookmark button
Alert button
Aug 28, 2023
Vahid Ghafouri, Vibhor Agarwal, Yong Zhang, Nishanth Sastry, Jose Such, Guillermo Suarez-Tangil

Figure 1 for AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics
Figure 2 for AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics
Figure 3 for AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics
Figure 4 for AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics
Viaarxiv icon