Alert button
Picture for Dung Tran

Dung Tran

Alert button

uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures

Add code
Bookmark button
Alert button
Mar 14, 2024
Afrina Tabassum, Dung Tran, Trung Dang, Ismini Lourentzou, Kazuhito Koishida

Figure 1 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 2 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 3 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 4 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Viaarxiv icon

Learned Image Compression with Text Quality Enhancement

Add code
Bookmark button
Alert button
Feb 13, 2024
Chih-Yu Lai, Dung Tran, Kazuhito Koishida

Viaarxiv icon

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Add code
Bookmark button
Alert button
Sep 19, 2023
Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi

Figure 1 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Figure 2 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Figure 3 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Viaarxiv icon

Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness

Add code
Bookmark button
Alert button
Jan 05, 2022
Hieu Le, Hans Walker, Dung Tran, Peter Chin

Figure 1 for Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness
Figure 2 for Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness
Figure 3 for Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness
Figure 4 for Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness
Viaarxiv icon

Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations

Add code
Bookmark button
Alert button
Dec 21, 2021
Melikasadat Emami, Dung Tran, Kazuhito Koishida

Figure 1 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 2 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 3 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 4 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Viaarxiv icon

Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features

Add code
Bookmark button
Alert button
Dec 08, 2021
Trung Dang, Dung Tran, Peter Chin, Kazuhito Koishida

Figure 1 for Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features
Figure 2 for Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features
Figure 3 for Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features
Viaarxiv icon

An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions

Add code
Bookmark button
Alert button
Apr 28, 2019
Aswin Shanmugam Subramanian, Xiaofei Wang, Shinji Watanabe, Toru Taniguchi, Dung Tran, Yuya Fujita

Figure 1 for An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Figure 2 for An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Figure 3 for An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Viaarxiv icon

Speaker Selective Beamformer with Keyword Mask Estimation

Add code
Bookmark button
Alert button
Oct 25, 2018
Yusuke Kida, Dung Tran, Motoi Omachi, Toru Taniguchi, Yuya Fujita

Figure 1 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 2 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 3 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 4 for Speaker Selective Beamformer with Keyword Mask Estimation
Viaarxiv icon