Alert button
Picture for Jan Skoglund

Jan Skoglund

Alert button

NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment

Add code
Bookmark button
Alert button
Sep 28, 2023
Alessandro Ragano, Jan Skoglund, Andrew Hines

Viaarxiv icon

LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models

Add code
Bookmark button
Alert button
Mar 23, 2023
Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi

Figure 1 for LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models
Figure 2 for LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models
Figure 3 for LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models
Figure 4 for LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models
Viaarxiv icon

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset

Add code
Bookmark button
Alert button
Sep 14, 2022
Michael Chinen, Jan Skoglund, Chandan K A Reddy, Alessandro Ragano, Andrew Hines

Figure 1 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 2 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 3 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Viaarxiv icon

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

Add code
Bookmark button
Alert button
Jul 05, 2022
Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund

Figure 1 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 2 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 3 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Viaarxiv icon

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality

Add code
Bookmark button
Alert button
Apr 05, 2022
Alessandro Ragano, Emmanouil Benetos, Michael Chinen, Helard B. Martinez, Chandan K. A. Reddy, Jan Skoglund, Andrew Hines

Figure 1 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 2 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 3 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 4 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Viaarxiv icon

SoundStream: An End-to-End Neural Audio Codec

Add code
Bookmark button
Alert button
Jul 07, 2021
Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, Marco Tagliasacchi

Figure 1 for SoundStream: An End-to-End Neural Audio Codec
Figure 2 for SoundStream: An End-to-End Neural Audio Codec
Figure 3 for SoundStream: An End-to-End Neural Audio Codec
Figure 4 for SoundStream: An End-to-End Neural Audio Codec
Viaarxiv icon

Handling Background Noise in Neural Speech Generation

Add code
Bookmark button
Alert button
Feb 23, 2021
Tom Denton, Alejandro Luebs, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund

Figure 1 for Handling Background Noise in Neural Speech Generation
Figure 2 for Handling Background Noise in Neural Speech Generation
Figure 3 for Handling Background Noise in Neural Speech Generation
Figure 4 for Handling Background Noise in Neural Speech Generation
Viaarxiv icon

WARP-Q: Quality Prediction For Generative Neural Speech Codecs

Add code
Bookmark button
Alert button
Feb 20, 2021
Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines

Figure 1 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 2 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 3 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 4 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Viaarxiv icon

Generative Speech Coding with Predictive Variance Regularization

Add code
Bookmark button
Alert button
Feb 18, 2021
W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh

Figure 1 for Generative Speech Coding with Predictive Variance Regularization
Figure 2 for Generative Speech Coding with Predictive Variance Regularization
Figure 3 for Generative Speech Coding with Predictive Variance Regularization
Viaarxiv icon

A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet

Add code
Bookmark button
Alert button
Mar 28, 2019
Jean-Marc Valin, Jan Skoglund

Figure 1 for A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
Figure 2 for A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
Figure 3 for A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
Figure 4 for A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
Viaarxiv icon