Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Roland Maas

Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio


Jun 28, 2021
Gokce Keskin, Minhua Wu, Brian King, Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas


  Access Paper or Ask Questions

SynthASR: Unlocking Synthetic Data for Speech Recognition


Jun 14, 2021
Amin Fazel, Wei Yang, Yulan Liu, Roberto Barra-Chicote, Yixiong Meng, Roland Maas, Jasha Droppo

* Accepted to Interspeech 2021 

  Access Paper or Ask Questions

Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition


May 14, 2021
Bhargav Pulugundla, Yang Gao, Brian King, Gokce Keskin, Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas


  Access Paper or Ask Questions

Wav2vec-C: A Self-supervised Model for Speech Representation Learning


Mar 09, 2021
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas


  Access Paper or Ask Questions

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling


Dec 14, 2020
Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas

* Submitted in ICASSP 2021 

  Access Paper or Ask Questions

Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition


Jul 27, 2020
Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas

* Accepted to Interspeech 2020 

  Access Paper or Ask Questions

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification


Jul 08, 2020
Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann


  Access Paper or Ask Questions

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition


Jun 30, 2020
Maarten Van Segbroeck, Harish Mallidih, Brian King, I-Fan Chen, Gurpreet Chadha, Roland Maas


  Access Paper or Ask Questions

Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses


Jun 01, 2020
Chander Chandak, Zeynab Raeesy, Ariya Rastrow, Yuzong Liu, Xiangyang Huang, Siyu Wang, Dong Kwon Joo, Roland Maas

* 5 pages, 2 figures 

  Access Paper or Ask Questions

DiPCo -- Dinner Party Corpus


Sep 30, 2019
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas


  Access Paper or Ask Questions

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning


Jan 11, 2019
Ladislav Mošner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Kenichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister


  Access Paper or Ask Questions

LSTM-based Whisper Detection


Sep 20, 2018
Zeynab Raeesy, Kellen Gillespie, Chengyuan Ma, Thomas Drugman, Jiacheng Gu, Roland Maas, Ariya Rastrow, Björn Hoffmeister


  Access Paper or Ask Questions

Device-directed Utterance Detection


Aug 07, 2018
Sri Harish Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister

* Interspeech 2018 (accepted) 

  Access Paper or Ask Questions

Estimating parameters of nonlinear systems using the elitist particle filter based on evolutionary strategies


May 25, 2016
Christian Huemmer, Christian Hofmann, Roland Maas, Walter Kellermann

* 13 pages, 13 figures 

  Access Paper or Ask Questions

Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments


Feb 16, 2015
Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann

* accepted for ICASSP2015 

  Access Paper or Ask Questions

The NLMS algorithm with time-variant optimum stepsize derived from a Bayesian network perspective


Nov 18, 2014
Christian Huemmer, Roland Maas, Walter Kellermann

* 4 pages, 1 page of references 

  Access Paper or Ask Questions

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition


Sep 22, 2014
Roland Maas, Christian Huemmer, Armin Sehr, Walter Kellermann


  Access Paper or Ask Questions