Alert button
Picture for Erik Visser

Erik Visser

Alert button

Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data

Add code
Bookmark button
Alert button
Sep 12, 2023
Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser

Figure 1 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 2 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 3 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 4 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Viaarxiv icon

Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature

Add code
Bookmark button
Alert button
Sep 06, 2023
Kyungguen Byun, Sunkuk Moon, Erik Visser

Figure 1 for Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature
Figure 2 for Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature
Figure 3 for Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature
Figure 4 for Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature
Viaarxiv icon

Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation

Add code
Bookmark button
Alert button
Sep 06, 2023
Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz

Figure 1 for Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Figure 2 for Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Figure 3 for Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Figure 4 for Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Viaarxiv icon

Improved Beam Search for Hallucination Mitigation in Abstractive Summarization

Add code
Bookmark button
Alert button
Dec 06, 2022
Arvind Krishna Sridhar, Erik Visser

Figure 1 for Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Figure 2 for Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Figure 3 for Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Figure 4 for Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Viaarxiv icon

Application of Knowledge Distillation to Multi-task Speech Representation Learning

Add code
Bookmark button
Alert button
Oct 29, 2022
Mine Kerpicci, Van Nguyen, Shuhua Zhang, Erik Visser

Figure 1 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 2 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 3 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 4 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Viaarxiv icon

Activity report analysis with automatic single or multispan answer extraction

Add code
Bookmark button
Alert button
Sep 09, 2022
Ravi Choudhary, Arvind Krishna Sridhar, Erik Visser

Figure 1 for Activity report analysis with automatic single or multispan answer extraction
Figure 2 for Activity report analysis with automatic single or multispan answer extraction
Figure 3 for Activity report analysis with automatic single or multispan answer extraction
Figure 4 for Activity report analysis with automatic single or multispan answer extraction
Viaarxiv icon

Multi-task Voice Activated Framework using Self-supervised Learning

Add code
Bookmark button
Alert button
Oct 12, 2021
Shehzeen Hussain, Van Nguyen, Shuhua Zhang, Erik Visser

Figure 1 for Multi-task Voice Activated Framework using Self-supervised Learning
Figure 2 for Multi-task Voice Activated Framework using Self-supervised Learning
Figure 3 for Multi-task Voice Activated Framework using Self-supervised Learning
Viaarxiv icon

Incremental Learning Algorithm for Sound Event Detection

Add code
Bookmark button
Alert button
Mar 26, 2020
Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser

Figure 1 for Incremental Learning Algorithm for Sound Event Detection
Figure 2 for Incremental Learning Algorithm for Sound Event Detection
Figure 3 for Incremental Learning Algorithm for Sound Event Detection
Figure 4 for Incremental Learning Algorithm for Sound Event Detection
Viaarxiv icon