Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Néstor Becerra Yoma

Multichannel Robot Speech Recognition Database: MChRSR

Dec 30, 2017

José Novoa, Juan Pablo Escudero, Josué Fredes, Jorge Wuth, Rodrigo Mahu, Néstor Becerra Yoma

Figure 1 for Multichannel Robot Speech Recognition Database: MChRSR

Figure 2 for Multichannel Robot Speech Recognition Database: MChRSR

Figure 3 for Multichannel Robot Speech Recognition Database: MChRSR

Figure 4 for Multichannel Robot Speech Recognition Database: MChRSR

Abstract:In real human robot interaction (HRI) scenarios, speech recognition represents a major challenge due to robot noise, background noise and time-varying acoustic channel. This document describes the procedure used to obtain the Multichannel Robot Speech Recognition Database (MChRSR). It is composed of 12 hours of multichannel evaluation data recorded in a real mobile HRI scenario. This database was recorded with a PR2 robot performing different translational and azimuthal movements. Accordingly, 16 evaluation sets were obtained re-recording the clean set of the Aurora 4 database in different movement conditions.

Via

Access Paper or Ask Questions

DNN-based uncertainty estimation for weighted DNN-HMM ASR

May 29, 2017

José Novoa, Josué Fredes, Néstor Becerra Yoma

Figure 1 for DNN-based uncertainty estimation for weighted DNN-HMM ASR

Figure 2 for DNN-based uncertainty estimation for weighted DNN-HMM ASR

Figure 3 for DNN-based uncertainty estimation for weighted DNN-HMM ASR

Figure 4 for DNN-based uncertainty estimation for weighted DNN-HMM ASR

Abstract:In this paper, the uncertainty is defined as the mean square error between a given enhanced noisy observation vector and the corresponding clean one. Then, a DNN is trained by using enhanced noisy observation vectors as input and the uncertainty as output with a training database. In testing, the DNN receives an enhanced noisy observation vector and delivers the estimated uncertainty. This uncertainty in employed in combination with a weighted DNN-HMM based speech recognition system and compared with an existing estimation of the noise cancelling uncertainty variance based on an additive noise model. Experiments were carried out with Aurora-4 task. Results with clean, multi-noise and multi-condition training are presented.

Via

Access Paper or Ask Questions