Alert button
Picture for Sharath Adavanne

Sharath Adavanne

Alert button

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Bookmark button
Alert button
Jun 15, 2023
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji

Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon

Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts

Add code
Bookmark button
Alert button
Nov 04, 2022
Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari

Figure 1 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 2 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 3 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 4 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Viaarxiv icon

Context-based out-of-vocabulary word recovery for ASR systems in Indian languages

Add code
Bookmark button
Alert button
Jun 09, 2022
Arun Baby, Saranya Vinnaitherthan, Akhil Kerhalkar, Pranav Jawale, Sharath Adavanne, Nagaraj Adiga

Figure 1 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 2 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 3 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 4 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Viaarxiv icon

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

Add code
Bookmark button
Alert button
Jun 04, 2022
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen

Figure 1 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 2 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 3 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 4 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Viaarxiv icon

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

Add code
Bookmark button
Alert button
Oct 29, 2021
Sharath Adavanne, Archontis Politis, Tuomas Virtanen

Figure 1 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Figure 2 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Figure 3 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Viaarxiv icon

A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Jul 04, 2021
Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen

Figure 1 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 2 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 3 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 4 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Viaarxiv icon

Non-native English lexicon creation for bilingual speech synthesis

Add code
Bookmark button
Alert button
Jun 21, 2021
Arun Baby, Pranav Jawale, Saranya Vinnaitherthan, Sumukh Badam, Nagaraj Adiga, Sharath Adavanne

Figure 1 for Non-native English lexicon creation for bilingual speech synthesis
Figure 2 for Non-native English lexicon creation for bilingual speech synthesis
Figure 3 for Non-native English lexicon creation for bilingual speech synthesis
Figure 4 for Non-native English lexicon creation for bilingual speech synthesis
Viaarxiv icon

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

Add code
Bookmark button
Alert button
Apr 29, 2019
Sharath Adavanne, Archontis Politis, Tuomas Virtanen

Figure 1 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 2 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 3 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 4 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Viaarxiv icon

Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network

Add code
Bookmark button
Alert button
Aug 05, 2018
Sharath Adavanne, Archontis Politis, Tuomas Virtanen

Figure 1 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 2 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 3 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 4 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Viaarxiv icon