Alert button

"speech": models, code, and papers
Alert button

A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation

Feb 03, 2022
Linjuan Cheng, Chengshi Zheng, Andong Li, Renhua Peng, Xiaodong Li

Figure 1 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 2 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 3 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 4 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Viaarxiv icon

ESPnet: End-to-End Speech Processing Toolkit

Mar 30, 2018
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai

Figure 1 for ESPnet: End-to-End Speech Processing Toolkit
Figure 2 for ESPnet: End-to-End Speech Processing Toolkit
Figure 3 for ESPnet: End-to-End Speech Processing Toolkit
Figure 4 for ESPnet: End-to-End Speech Processing Toolkit
Viaarxiv icon

Hate Speech Dataset from a White Supremacy Forum

Sep 12, 2018
Ona de Gibert, Naiara Perez, Aitor García-Pablos, Montse Cuadros

Figure 1 for Hate Speech Dataset from a White Supremacy Forum
Figure 2 for Hate Speech Dataset from a White Supremacy Forum
Figure 3 for Hate Speech Dataset from a White Supremacy Forum
Figure 4 for Hate Speech Dataset from a White Supremacy Forum
Viaarxiv icon

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

Nov 26, 2019
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka

Figure 1 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 2 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 3 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Viaarxiv icon

Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems

Feb 22, 2022
Zhongkai Sun, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Chenlei Guo

Figure 1 for Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems
Figure 2 for Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems
Figure 3 for Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems
Figure 4 for Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems
Viaarxiv icon

Improving the Robustness of Speech Translation

Nov 02, 2018
Xiang Li, Haiyang Xue, Wei Chen, Yang Liu, Yang Feng, Qun Liu

Figure 1 for Improving the Robustness of Speech Translation
Figure 2 for Improving the Robustness of Speech Translation
Figure 3 for Improving the Robustness of Speech Translation
Figure 4 for Improving the Robustness of Speech Translation
Viaarxiv icon

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

Jun 03, 2021
Henri Gode, Marvin Tammen, Simon Doclo

Figure 1 for Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors
Figure 2 for Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors
Viaarxiv icon

Improving Stability of LS-GANs for Audio and Speech Signals

Aug 12, 2020
Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

Figure 1 for Improving Stability of LS-GANs for Audio and Speech Signals
Figure 2 for Improving Stability of LS-GANs for Audio and Speech Signals
Figure 3 for Improving Stability of LS-GANs for Audio and Speech Signals
Figure 4 for Improving Stability of LS-GANs for Audio and Speech Signals
Viaarxiv icon

Where are we in semantic concept extraction for Spoken Language Understanding?

Jun 24, 2021
Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève

Figure 1 for Where are we in semantic concept extraction for Spoken Language Understanding?
Figure 2 for Where are we in semantic concept extraction for Spoken Language Understanding?
Figure 3 for Where are we in semantic concept extraction for Spoken Language Understanding?
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Nov 15, 2018
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-yiin Chang, Kanishka Rao, Alexander Gruenstein

Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon