Alert button
Picture for Yuki Mitsufuji

Yuki Mitsufuji

Alert button

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation

Add code
Bookmark button
Alert button
May 13, 2023
Ryosuke Sawata, Naoya Takahashi, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 2 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 3 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 4 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Viaarxiv icon

Diffusion-based Signal Refiner for Speech Separation

Add code
Bookmark button
Alert button
May 12, 2023
Masato Hirano, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Diffusion-based Signal Refiner for Speech Separation
Figure 2 for Diffusion-based Signal Refiner for Speech Separation
Figure 3 for Diffusion-based Signal Refiner for Speech Separation
Figure 4 for Diffusion-based Signal Refiner for Speech Separation
Viaarxiv icon

Extending Audio Masked Autoencoders Toward Audio Restoration

Add code
Bookmark button
Alert button
May 11, 2023
Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Extending Audio Masked Autoencoders Toward Audio Restoration
Figure 2 for Extending Audio Masked Autoencoders Toward Audio Restoration
Figure 3 for Extending Audio Masked Autoencoders Toward Audio Restoration
Figure 4 for Extending Audio Masked Autoencoders Toward Audio Restoration
Viaarxiv icon

PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives

Add code
Bookmark button
Alert button
May 03, 2023
Silin Gao, Beatriz Borges, Soyoung Oh, Deniz Bayazit, Saya Kanno, Hiromi Wakaki, Yuki Mitsufuji, Antoine Bosselut

Figure 1 for PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
Figure 2 for PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
Figure 3 for PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
Figure 4 for PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
Viaarxiv icon

Cross-modal Face- and Voice-style Transfer

Add code
Bookmark button
Alert button
Mar 01, 2023
Naoya Takahashi, Mayank K. Singh, Yuki Mitsufuji

Figure 1 for Cross-modal Face- and Voice-style Transfer
Figure 2 for Cross-modal Face- and Voice-style Transfer
Figure 3 for Cross-modal Face- and Voice-style Transfer
Figure 4 for Cross-modal Face- and Voice-style Transfer
Viaarxiv icon

An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification

Add code
Bookmark button
Alert button
Feb 16, 2023
Zhi Zhong, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification
Figure 2 for An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification
Figure 3 for An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification
Viaarxiv icon

Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport

Add code
Bookmark button
Alert button
Jan 30, 2023
Yuhta Takida, Masaaki Imaizumi, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji

Figure 1 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 2 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 3 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 4 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Viaarxiv icon

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration

Add code
Bookmark button
Alert button
Jan 30, 2023
Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon

Figure 1 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 2 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 3 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 4 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Viaarxiv icon

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos

Add code
Bookmark button
Alert button
Dec 14, 2022
Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian McAuley, Taylor Berg-Kirkpatrick

Figure 1 for CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Figure 2 for CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Figure 3 for CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Figure 4 for CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Viaarxiv icon

Unsupervised vocal dereverberation with diffusion-based generative models

Add code
Bookmark button
Alert button
Nov 08, 2022
Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta Takida, Takao Fukui, Yuki Mitsufuji

Figure 1 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 2 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 3 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 4 for Unsupervised vocal dereverberation with diffusion-based generative models
Viaarxiv icon