Alert button
Picture for Ryosuke Sawata

Ryosuke Sawata

Alert button

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation

Add code
Bookmark button
Alert button
May 13, 2023
Ryosuke Sawata, Naoya Takahashi, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 2 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 3 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Figure 4 for The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation
Viaarxiv icon

A Versatile Diffusion-based Generative Refiner for Speech Enhancement

Add code
Bookmark button
Alert button
Oct 27, 2022
Ryosuke Sawata, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Figure 2 for A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Figure 3 for A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Viaarxiv icon

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability

Add code
Bookmark button
Alert button
Oct 11, 2022
Kin Wai Cheuk, Ryosuke Sawata, Toshimitsu Uesaka, Naoki Murata, Naoya Takahashi, Shusuke Takahashi, Dorien Herremans, Yuki Mitsufuji

Figure 1 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 2 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 3 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 4 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Viaarxiv icon

Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models

Add code
Bookmark button
Alert button
Oct 12, 2021
Ryosuke Sawata, Yosuke Kashiwagi, Shusuke Takahashi

Figure 1 for Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
Figure 2 for Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
Figure 3 for Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models
Viaarxiv icon

Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex

Add code
Bookmark button
Alert button
Jun 16, 2021
Keitaro Tanaka, Ryosuke Sawata, Shusuke Takahashi

Figure 1 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 2 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 3 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 4 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Viaarxiv icon