Alert button

"speech": models, code, and papers
Alert button

Deliberation Model for On-Device Spoken Language Understanding

Apr 04, 2022
Duc Le, Akshat Shrivastava, Paden Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Deliberation Model for On-Device Spoken Language Understanding
Figure 2 for Deliberation Model for On-Device Spoken Language Understanding
Figure 3 for Deliberation Model for On-Device Spoken Language Understanding
Figure 4 for Deliberation Model for On-Device Spoken Language Understanding
Viaarxiv icon

Model Blending for Text Classification

Aug 05, 2022
Ramit Pahwa

Figure 1 for Model Blending for Text Classification
Figure 2 for Model Blending for Text Classification
Figure 3 for Model Blending for Text Classification
Figure 4 for Model Blending for Text Classification
Viaarxiv icon

Towards Semi-Supervised Semantics Understanding from Speech

Add code
Bookmark button
Alert button
Nov 11, 2020
Cheng-I Lai, Jin Cao, Sravan Bodapati, Shang-Wen Li

Figure 1 for Towards Semi-Supervised Semantics Understanding from Speech
Figure 2 for Towards Semi-Supervised Semantics Understanding from Speech
Figure 3 for Towards Semi-Supervised Semantics Understanding from Speech
Figure 4 for Towards Semi-Supervised Semantics Understanding from Speech
Viaarxiv icon

Learning from human perception to improve automatic speaker verification in style-mismatched conditions

Jun 28, 2022
Amber Afshan, Abeer Alwan

Figure 1 for Learning from human perception to improve automatic speaker verification in style-mismatched conditions
Figure 2 for Learning from human perception to improve automatic speaker verification in style-mismatched conditions
Figure 3 for Learning from human perception to improve automatic speaker verification in style-mismatched conditions
Viaarxiv icon

TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter

Aug 27, 2021
Sumit Kumar, Raj Ratn Pranesh

Figure 1 for TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter
Figure 2 for TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter
Figure 3 for TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter
Figure 4 for TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter
Viaarxiv icon

Analysis and Synthesis of Hypo and Hyperarticulated Speech

Jun 07, 2020
Benjamin Picart, Thomas Drugman, Thierry Dutoit

Figure 1 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 2 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 3 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 4 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Viaarxiv icon

QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification

Add code
Bookmark button
Alert button
Dec 15, 2020
Deepak Kumar, Nalin Kumar, Subhankar Mishra

Figure 1 for QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification
Figure 2 for QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification
Figure 3 for QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification
Figure 4 for QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification
Viaarxiv icon

Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection

May 08, 2021
Aswin Sivaraman, Minje Kim

Figure 1 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Figure 2 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Figure 3 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Viaarxiv icon

Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio

Add code
Bookmark button
Alert button
Apr 03, 2021
Jeffrey Tumminia, Amanda Kuznecov, Sophia Tsilerides, Ilana Weinstein, Brian McFee, Michael Picheny, Aaron R. Kaufman

Figure 1 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 2 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 3 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 4 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Viaarxiv icon

Into-TTS : Intonation Template based Prosody Control System

Add code
Bookmark button
Alert button
Apr 04, 2022
Jihwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Chanwoo Kim

Figure 1 for Into-TTS : Intonation Template based Prosody Control System
Figure 2 for Into-TTS : Intonation Template based Prosody Control System
Figure 3 for Into-TTS : Intonation Template based Prosody Control System
Figure 4 for Into-TTS : Intonation Template based Prosody Control System
Viaarxiv icon