Alert button
Picture for Pavel Denisov

Pavel Denisov

Alert button

The IMS Toucan System for the Blizzard Challenge 2023

Add code
Bookmark button
Alert button
Oct 26, 2023
Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu

Viaarxiv icon

Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding

Add code
Bookmark button
Alert button
Oct 09, 2023
Pavel Denisov, Ngoc Thang Vu

Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Bookmark button
Alert button
Sep 27, 2023
Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

Add code
Bookmark button
Alert button
Oct 20, 2022
Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

Figure 1 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 2 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 3 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 4 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Viaarxiv icon

Speaker Anonymization with Phonetic Intermediate Representations

Add code
Bookmark button
Alert button
Jul 11, 2022
Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

Figure 1 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 2 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 3 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 4 for Speaker Anonymization with Phonetic Intermediate Representations
Viaarxiv icon

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

Add code
Bookmark button
Alert button
Nov 29, 2021
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe

Figure 1 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 2 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 3 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 4 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Viaarxiv icon

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

Add code
Bookmark button
Alert button
Aug 29, 2021
Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu

Figure 1 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 2 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 3 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 4 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Viaarxiv icon

IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task

Add code
Bookmark button
Alert button
Jun 30, 2021
Pavel Denisov, Manuel Mager, Ngoc Thang Vu

Figure 1 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 2 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 3 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 4 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Viaarxiv icon

Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning

Add code
Bookmark button
Alert button
Jul 03, 2020
Pavel Denisov, Ngoc Thang Vu

Figure 1 for Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Figure 2 for Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Figure 3 for Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Figure 4 for Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Viaarxiv icon

ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents

Add code
Bookmark button
Alert button
May 04, 2020
Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu

Figure 1 for ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
Figure 2 for ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
Viaarxiv icon