Alert button

"speech": models, code, and papers
Alert button

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models

Add code
Bookmark button
Alert button
Oct 13, 2022
Haoyu Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan

Figure 1 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 2 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 3 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 4 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Viaarxiv icon

Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE

Add code
Bookmark button
Alert button
Oct 25, 2022
Hui Lu, Disong Wang, Xixin Wu, Zhiyong Wu, Xunying Liu, Helen Meng

Figure 1 for Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE
Figure 2 for Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE
Figure 3 for Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE
Figure 4 for Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAE
Viaarxiv icon

Joint Speech Activity and Overlap Detection with Multi-Exit Architecture

Sep 24, 2022
Ziqing Du, Kai Liu, Xucheng Wan, Huan Zhou

Figure 1 for Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Figure 2 for Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Figure 3 for Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Figure 4 for Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Viaarxiv icon

A Review of Challenges in Machine Learning based Automated Hate Speech Detection

Sep 12, 2022
Abhishek Velankar, Hrushikesh Patil, Raviraj Joshi

Figure 1 for A Review of Challenges in Machine Learning based Automated Hate Speech Detection
Figure 2 for A Review of Challenges in Machine Learning based Automated Hate Speech Detection
Viaarxiv icon

Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation

Add code
Bookmark button
Alert button
Mar 29, 2022
Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura

Figure 1 for Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation
Figure 2 for Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation
Figure 3 for Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation
Figure 4 for Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation
Viaarxiv icon

Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR

Nov 03, 2022
Vrunda N. Sukhadia, A. Arunkumar, S. Umesh

Figure 1 for Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR
Figure 2 for Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR
Figure 3 for Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR
Figure 4 for Channel-Aware Pretraining of Joint Encoder-Decoder Self-Supervised Model for Telephonic-Speech ASR
Viaarxiv icon

Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning

May 10, 2023
Ahmad Al Harere, Khloud Al Jallad

Figure 1 for Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning
Figure 2 for Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning
Figure 3 for Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning
Figure 4 for Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning
Viaarxiv icon

GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents

Mar 26, 2023
Tenglong Ao, Zeyi Zhang, Libin Liu

Figure 1 for GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Figure 2 for GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Figure 3 for GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Figure 4 for GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Viaarxiv icon

Localizing Spatial Information in Neural Spatiospectral Filters

Mar 14, 2023
Annika Briegleb, Thomas Haubner, Vasileios Belagiannis, Walter Kellermann

Figure 1 for Localizing Spatial Information in Neural Spatiospectral Filters
Figure 2 for Localizing Spatial Information in Neural Spatiospectral Filters
Figure 3 for Localizing Spatial Information in Neural Spatiospectral Filters
Figure 4 for Localizing Spatial Information in Neural Spatiospectral Filters
Viaarxiv icon

SumREN: Summarizing Reported Speech about Events in News

Add code
Bookmark button
Alert button
Dec 02, 2022
Revanth Gangi Reddy, Heba Elfardy, Hou Pong Chan, Kevin Small, Heng Ji

Figure 1 for SumREN: Summarizing Reported Speech about Events in News
Figure 2 for SumREN: Summarizing Reported Speech about Events in News
Figure 3 for SumREN: Summarizing Reported Speech about Events in News
Figure 4 for SumREN: Summarizing Reported Speech about Events in News
Viaarxiv icon