Alert button

"speech": models, code, and papers
Alert button

Two-Stage Voice Anonymization for Enhanced Privacy

Jun 28, 2023
Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor

Figure 1 for Two-Stage Voice Anonymization for Enhanced Privacy
Figure 2 for Two-Stage Voice Anonymization for Enhanced Privacy
Figure 3 for Two-Stage Voice Anonymization for Enhanced Privacy
Figure 4 for Two-Stage Voice Anonymization for Enhanced Privacy
Viaarxiv icon

TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 16, 2023
Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 2 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 3 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 4 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon

Developmental Bootstrapping of AIs

Aug 08, 2023
Mark Stefik, Robert Price

Figure 1 for Developmental Bootstrapping of AIs
Figure 2 for Developmental Bootstrapping of AIs
Figure 3 for Developmental Bootstrapping of AIs
Figure 4 for Developmental Bootstrapping of AIs
Viaarxiv icon

Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure

Add code
Bookmark button
Alert button
Jul 04, 2023
Yikang Wang, Hiromitsu Nishizaki, Ming Li

Figure 1 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 2 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 3 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 4 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Viaarxiv icon

Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models

Add code
Bookmark button
Alert button
May 09, 2023
Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka

Figure 1 for Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models
Figure 2 for Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models
Figure 3 for Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models
Figure 4 for Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models
Viaarxiv icon

SpeechLMScore: Evaluating speech generation using speech language model

Add code
Bookmark button
Alert button
Dec 08, 2022
Soumi Maiti, Yifan Peng, Takaaki Saeki, Shinji Watanabe

Figure 1 for SpeechLMScore: Evaluating speech generation using speech language model
Figure 2 for SpeechLMScore: Evaluating speech generation using speech language model
Figure 3 for SpeechLMScore: Evaluating speech generation using speech language model
Figure 4 for SpeechLMScore: Evaluating speech generation using speech language model
Viaarxiv icon

DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition

Jul 06, 2023
Zhifeng Wang, Chunyan Zeng, Surong Duan, Hongjie Ouyang, Hongmin Xu

Figure 1 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 2 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 3 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 4 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Viaarxiv icon

Speech Signal Improvement Using Causal Generative Diffusion Models

Add code
Bookmark button
Alert button
Mar 15, 2023
Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann

Figure 1 for Speech Signal Improvement Using Causal Generative Diffusion Models
Figure 2 for Speech Signal Improvement Using Causal Generative Diffusion Models
Viaarxiv icon

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Add code
Bookmark button
Alert button
Jul 28, 2023
Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini

Figure 1 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 2 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 3 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 4 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Viaarxiv icon

Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase

Add code
Bookmark button
Alert button
Jul 23, 2023
Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono

Figure 1 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 2 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 3 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 4 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Viaarxiv icon