Alert button
Picture for Kazuhito Koishida

Kazuhito Koishida

Alert button

Weakly-supervised Audio Separation via Bi-modal Semantic Similarity

Add code
Bookmark button
Alert button
Apr 02, 2024
Tanvir Mahmud, Saeed Amizadeh, Kazuhito Koishida, Diana Marculescu

Viaarxiv icon

uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures

Add code
Bookmark button
Alert button
Mar 14, 2024
Afrina Tabassum, Dung Tran, Trung Dang, Ismini Lourentzou, Kazuhito Koishida

Figure 1 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 2 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 3 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Figure 4 for uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Viaarxiv icon

Learned Image Compression with Text Quality Enhancement

Add code
Bookmark button
Alert button
Feb 13, 2024
Chih-Yu Lai, Dung Tran, Kazuhito Koishida

Viaarxiv icon

Single-channel speech enhancement using learnable loss mixup

Add code
Bookmark button
Alert button
Dec 20, 2023
Oscar Chang, Dung N. Tran, Kazuhito Koishida

Viaarxiv icon

Automatic Disfluency Detection from Untranscribed Speech

Add code
Bookmark button
Alert button
Nov 01, 2023
Amrit Romana, Kazuhito Koishida, Emily Mower Provost

Viaarxiv icon

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Add code
Bookmark button
Alert button
Sep 19, 2023
Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi

Figure 1 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Figure 2 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Figure 3 for Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Viaarxiv icon

Progressive Knowledge Distillation: Building Ensembles for Efficient Inference

Add code
Bookmark button
Alert button
Feb 20, 2023
Don Kurian Dennis, Abhishek Shetty, Anish Sevekari, Kazuhito Koishida, Virginia Smith

Figure 1 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 2 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 3 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 4 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Viaarxiv icon

SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

Add code
Bookmark button
Alert button
Oct 26, 2022
Vasily Zadorozhnyy, Qiang Ye, Kazuhito Koishida

Figure 1 for SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Figure 2 for SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Figure 3 for SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Viaarxiv icon

Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations

Add code
Bookmark button
Alert button
Dec 21, 2021
Melikasadat Emami, Dung Tran, Kazuhito Koishida

Figure 1 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 2 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 3 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Figure 4 for Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
Viaarxiv icon

A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks

Add code
Bookmark button
Alert button
Dec 09, 2021
Bahareh Tolooshams, Kazuhito Koishida

Figure 1 for A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks
Figure 2 for A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks
Figure 3 for A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks
Viaarxiv icon