Alert button
Picture for Yuichiro Koyama

Yuichiro Koyama

Alert button

Zero- and Few-shot Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Sep 17, 2023
Kazuki Shimada, Kengo Uchida, Yuichiro Koyama, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji, Tatsuya Kawahara

Figure 1 for Zero- and Few-shot Sound Event Localization and Detection
Figure 2 for Zero- and Few-shot Sound Event Localization and Detection
Figure 3 for Zero- and Few-shot Sound Event Localization and Detection
Figure 4 for Zero- and Few-shot Sound Event Localization and Detection
Viaarxiv icon

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Bookmark button
Alert button
Jun 15, 2023
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji

Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders

Add code
Bookmark button
Alert button
May 18, 2023
Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, Yuki Mitsufuji

Figure 1 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 2 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 3 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 4 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Viaarxiv icon

Diffusion-based Signal Refiner for Speech Separation

Add code
Bookmark button
Alert button
May 12, 2023
Masato Hirano, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Diffusion-based Signal Refiner for Speech Separation
Figure 2 for Diffusion-based Signal Refiner for Speech Separation
Figure 3 for Diffusion-based Signal Refiner for Speech Separation
Figure 4 for Diffusion-based Signal Refiner for Speech Separation
Viaarxiv icon

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

Add code
Bookmark button
Alert button
Jun 04, 2022
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen

Figure 1 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 2 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 3 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 4 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Viaarxiv icon

Removing Distortion Effects in Music Using Deep Neural Networks

Add code
Bookmark button
Alert button
Feb 03, 2022
Johannes Imort, Giorgio Fabbro, Marco A. Martínez Ramírez, Stefan Uhlich, Yuichiro Koyama, Yuki Mitsufuji

Figure 1 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 2 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 3 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 4 for Removing Distortion Effects in Music Using Deep Neural Networks
Viaarxiv icon

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training

Add code
Bookmark button
Alert button
Oct 14, 2021
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji

Figure 1 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 2 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 3 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 4 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Viaarxiv icon

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Oct 13, 2021
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 2 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 3 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 4 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Viaarxiv icon

Music Source Separation with Deep Equilibrium Models

Add code
Bookmark button
Alert button
Oct 13, 2021
Yuichiro Koyama, Naoki Murata, Stefan Uhlich, Giorgio Fabbro, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Music Source Separation with Deep Equilibrium Models
Figure 2 for Music Source Separation with Deep Equilibrium Models
Figure 3 for Music Source Separation with Deep Equilibrium Models
Viaarxiv icon

Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection

Add code
Bookmark button
Alert button
Oct 12, 2021
Ricardo Falcon-Perez, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Figure 2 for Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Figure 3 for Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Figure 4 for Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Viaarxiv icon