Alert button
Picture for Maurizio Omologo

Maurizio Omologo

Alert button

Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition

Mar 01, 2023
Feng-Ju Chang, Anastasios Alexandridis, Rupak Vignesh Swaminathan, Martin Radfar, Harish Mallidi, Maurizio Omologo, Athanasios Mouchtaris, Brian King, Roland Maas

Figure 1 for Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition
Figure 2 for Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition
Figure 3 for Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition
Figure 4 for Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition
Viaarxiv icon

A neural prosody encoder for end-ro-end dialogue act classification

May 11, 2022
Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Muller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo

Figure 1 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 2 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 3 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 4 for A neural prosody encoder for end-ro-end dialogue act classification
Viaarxiv icon

Context-Aware Transformer Transducer for Speech Recognition

Nov 05, 2021
Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, Siegfried Kunzmann

Figure 1 for Context-Aware Transformer Transducer for Speech Recognition
Figure 2 for Context-Aware Transformer Transducer for Speech Recognition
Figure 3 for Context-Aware Transformer Transducer for Speech Recognition
Figure 4 for Context-Aware Transformer Transducer for Speech Recognition
Viaarxiv icon

Multi-Channel Transformer Transducer for Speech Recognition

Aug 30, 2021
Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo

Figure 1 for Multi-Channel Transformer Transducer for Speech Recognition
Figure 2 for Multi-Channel Transformer Transducer for Speech Recognition
Figure 3 for Multi-Channel Transformer Transducer for Speech Recognition
Figure 4 for Multi-Channel Transformer Transducer for Speech Recognition
Viaarxiv icon

DiPCo -- Dinner Party Corpus

Sep 30, 2019
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas

Figure 1 for DiPCo -- Dinner Party Corpus
Figure 2 for DiPCo -- Dinner Party Corpus
Figure 3 for DiPCo -- Dinner Party Corpus
Figure 4 for DiPCo -- Dinner Party Corpus
Viaarxiv icon

Automatic context window composition for distant speech recognition

May 26, 2018
Mirco Ravanelli, Maurizio Omologo

Figure 1 for Automatic context window composition for distant speech recognition
Figure 2 for Automatic context window composition for distant speech recognition
Figure 3 for Automatic context window composition for distant speech recognition
Figure 4 for Automatic context window composition for distant speech recognition
Viaarxiv icon

Light Gated Recurrent Units for Speech Recognition

Mar 26, 2018
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for Light Gated Recurrent Units for Speech Recognition
Figure 2 for Light Gated Recurrent Units for Speech Recognition
Figure 3 for Light Gated Recurrent Units for Speech Recognition
Figure 4 for Light Gated Recurrent Units for Speech Recognition
Viaarxiv icon

Contaminated speech training methods for robust DNN-HMM distant speech recognition

Oct 10, 2017
Mirco Ravanelli, Maurizio Omologo

Figure 1 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 2 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 3 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 4 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Viaarxiv icon

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

Oct 06, 2017
Mirco Ravanelli, Maurizio Omologo

Figure 1 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 2 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 3 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 4 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Viaarxiv icon

Improving speech recognition by revising gated recurrent units

Sep 29, 2017
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for Improving speech recognition by revising gated recurrent units
Figure 2 for Improving speech recognition by revising gated recurrent units
Figure 3 for Improving speech recognition by revising gated recurrent units
Figure 4 for Improving speech recognition by revising gated recurrent units
Viaarxiv icon