Alert button

"speech": models, code, and papers
Alert button

FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference

Add code
Bookmark button
Alert button
Sep 21, 2022
Qinglan Wei, Xuling Huang, Yuan Zhang

Figure 1 for FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference
Figure 2 for FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference
Figure 3 for FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference
Figure 4 for FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference
Viaarxiv icon

Attention based end to end Speech Recognition for Voice Search in Hindi and English

Nov 15, 2021
Raviraj Joshi, Venkateshan Kannan

Figure 1 for Attention based end to end Speech Recognition for Voice Search in Hindi and English
Figure 2 for Attention based end to end Speech Recognition for Voice Search in Hindi and English
Viaarxiv icon

MLS: A Large-Scale Multilingual Dataset for Speech Research

Add code
Bookmark button
Alert button
Dec 19, 2020
Vineel Pratap, Qiantong Xu, Anuroop Sriram, Gabriel Synnaeve, Ronan Collobert

Figure 1 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 2 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 3 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 4 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Viaarxiv icon

Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech

Add code
Bookmark button
Alert button
Oct 14, 2021
Haoyue Zhan, Xinyuan Yu, Haitong Zhang, Yang Zhang, Yue Lin

Figure 1 for Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
Figure 2 for Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
Figure 3 for Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
Figure 4 for Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
Viaarxiv icon

An Optimized Signal Processing Pipeline for Syllable Detection and Speech Rate Estimation

Mar 07, 2021
Kamini Sabu, Syomantak Chaudhuri, Preeti Rao, Mahesh Patil

Figure 1 for An Optimized Signal Processing Pipeline for Syllable Detection and Speech Rate Estimation
Figure 2 for An Optimized Signal Processing Pipeline for Syllable Detection and Speech Rate Estimation
Figure 3 for An Optimized Signal Processing Pipeline for Syllable Detection and Speech Rate Estimation
Figure 4 for An Optimized Signal Processing Pipeline for Syllable Detection and Speech Rate Estimation
Viaarxiv icon

Contextual Lexicon-Based Approach for Hate Speech and Offensive Language Detection

Add code
Bookmark button
Alert button
May 09, 2021
Francielle Alves Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto, Thiago Alexandre Salgueiro Pardo

Figure 1 for Contextual Lexicon-Based Approach for Hate Speech and Offensive Language Detection
Figure 2 for Contextual Lexicon-Based Approach for Hate Speech and Offensive Language Detection
Figure 3 for Contextual Lexicon-Based Approach for Hate Speech and Offensive Language Detection
Figure 4 for Contextual Lexicon-Based Approach for Hate Speech and Offensive Language Detection
Viaarxiv icon

Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks

Oct 29, 2020
Masood S. Mortazavi

Figure 1 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 2 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 3 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 4 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Viaarxiv icon

BART based semantic correction for Mandarin automatic speech recognition system

Mar 26, 2021
Yun Zhao, Xuerui Yang, Jinchao Wang, Yongyu Gao, Chao Yan, Yuanfu Zhou

Figure 1 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 2 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 3 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 4 for BART based semantic correction for Mandarin automatic speech recognition system
Viaarxiv icon

Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals

Jan 22, 2022
Xing-Yu Chen, Qiu-Shi Zhu, Jie Zhang, Li-Rong Dai

Figure 1 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 2 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 3 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 4 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Viaarxiv icon

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

Apr 29, 2022
Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou

Figure 1 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 2 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 3 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 4 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Viaarxiv icon