Alert button

"speech": models, code, and papers
Alert button

Consistent Transcription and Translation of Speech

Add code
Bookmark button
Alert button
Jul 24, 2020
Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik

Figure 1 for Consistent Transcription and Translation of Speech
Figure 2 for Consistent Transcription and Translation of Speech
Figure 3 for Consistent Transcription and Translation of Speech
Figure 4 for Consistent Transcription and Translation of Speech
Viaarxiv icon

Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge

Apr 30, 2020
Saturnino Luz, Fasih Haider, Sofia de la Fuente, Davida Fromm, Brian MacWhinney

Figure 1 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 2 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 3 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 4 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Viaarxiv icon

The Perceptimatic English Benchmark for Speech Perception Models

Add code
Bookmark button
Alert button
May 07, 2020
Juliette Millet, Ewan Dunbar

Figure 1 for The Perceptimatic English Benchmark for Speech Perception Models
Figure 2 for The Perceptimatic English Benchmark for Speech Perception Models
Figure 3 for The Perceptimatic English Benchmark for Speech Perception Models
Viaarxiv icon

Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks

Aug 07, 2020
Gábor Gosztolya, László Tóth

Figure 1 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Figure 2 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Figure 3 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Viaarxiv icon

Continuous speech separation: dataset and analysis

Add code
Bookmark button
Alert button
Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li

Figure 1 for Continuous speech separation: dataset and analysis
Figure 2 for Continuous speech separation: dataset and analysis
Figure 3 for Continuous speech separation: dataset and analysis
Figure 4 for Continuous speech separation: dataset and analysis
Viaarxiv icon

Gaze-enhanced Crossmodal Embeddings for Emotion Recognition

Apr 30, 2022
Ahmed Abdou, Ekta Sood, Philipp Müller, Andreas Bulling

Figure 1 for Gaze-enhanced Crossmodal Embeddings for Emotion Recognition
Figure 2 for Gaze-enhanced Crossmodal Embeddings for Emotion Recognition
Figure 3 for Gaze-enhanced Crossmodal Embeddings for Emotion Recognition
Figure 4 for Gaze-enhanced Crossmodal Embeddings for Emotion Recognition
Viaarxiv icon

dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter

Add code
Bookmark button
Alert button
Mar 16, 2021
Maximilian Kupi, Michael Bodnar, Nikolas Schmidt, Carlos Eduardo Posada

Figure 1 for dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter
Figure 2 for dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter
Figure 3 for dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter
Figure 4 for dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter
Viaarxiv icon

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks

Oct 21, 2020
Yun Tang, Juan Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel

Figure 1 for A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Figure 2 for A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Figure 3 for A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Figure 4 for A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Viaarxiv icon

Compute Cost Amortized Transformer for Streaming ASR

Jul 05, 2022
Yi Xie, Jonathan Macoskey, Martin Radfar, Feng-Ju Chang, Brian King, Ariya Rastrow, Athanasios Mouchtaris, Grant P. Strimel

Figure 1 for Compute Cost Amortized Transformer for Streaming ASR
Figure 2 for Compute Cost Amortized Transformer for Streaming ASR
Figure 3 for Compute Cost Amortized Transformer for Streaming ASR
Figure 4 for Compute Cost Amortized Transformer for Streaming ASR
Viaarxiv icon

SLNSpeech: solving extended speech separation problem by the help of sign language

Jul 21, 2020
Jiasong Wu, Taotao Li, Youyong Kong, Guanyu Yang, Lotfi Senhadji, Huazhong Shu

Figure 1 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 2 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 3 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 4 for SLNSpeech: solving extended speech separation problem by the help of sign language
Viaarxiv icon