Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Roland Maas

Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio

Jun 28, 2021
Gokce Keskin, Minhua Wu, Brian King, Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas

  Access Paper or Ask Questions

SynthASR: Unlocking Synthetic Data for Speech Recognition

Jun 14, 2021
Amin Fazel, Wei Yang, Yulan Liu, Roberto Barra-Chicote, Yixiong Meng, Roland Maas, Jasha Droppo

* Accepted to Interspeech 2021 

  Access Paper or Ask Questions

Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition

May 14, 2021
Bhargav Pulugundla, Yang Gao, Brian King, Gokce Keskin, Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas

  Access Paper or Ask Questions

Wav2vec-C: A Self-supervised Model for Speech Representation Learning

Mar 09, 2021
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas

  Access Paper or Ask Questions

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Dec 14, 2020
Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas

* Submitted in ICASSP 2021 

  Access Paper or Ask Questions

Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition

Jul 27, 2020
Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas

* Accepted to Interspeech 2020 

  Access Paper or Ask Questions

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

Jul 08, 2020
Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann

  Access Paper or Ask Questions

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition

Jun 30, 2020
Maarten Van Segbroeck, Harish Mallidih, Brian King, I-Fan Chen, Gurpreet Chadha, Roland Maas

  Access Paper or Ask Questions

Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses

Jun 01, 2020
Chander Chandak, Zeynab Raeesy, Ariya Rastrow, Yuzong Liu, Xiangyang Huang, Siyu Wang, Dong Kwon Joo, Roland Maas

* 5 pages, 2 figures 

  Access Paper or Ask Questions

DiPCo -- Dinner Party Corpus

Sep 30, 2019
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas

  Access Paper or Ask Questions

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning

Jan 11, 2019
Ladislav Mošner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Kenichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister

  Access Paper or Ask Questions

LSTM-based Whisper Detection

Sep 20, 2018
Zeynab Raeesy, Kellen Gillespie, Chengyuan Ma, Thomas Drugman, Jiacheng Gu, Roland Maas, Ariya Rastrow, Björn Hoffmeister

  Access Paper or Ask Questions

Device-directed Utterance Detection

Aug 07, 2018
Sri Harish Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister

* Interspeech 2018 (accepted) 

  Access Paper or Ask Questions

Estimating parameters of nonlinear systems using the elitist particle filter based on evolutionary strategies

May 25, 2016
Christian Huemmer, Christian Hofmann, Roland Maas, Walter Kellermann

* 13 pages, 13 figures 

  Access Paper or Ask Questions

Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments

Feb 16, 2015
Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann

* accepted for ICASSP2015 

  Access Paper or Ask Questions

The NLMS algorithm with time-variant optimum stepsize derived from a Bayesian network perspective

Nov 18, 2014
Christian Huemmer, Roland Maas, Walter Kellermann

* 4 pages, 1 page of references 

  Access Paper or Ask Questions

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition

Sep 22, 2014
Roland Maas, Christian Huemmer, Armin Sehr, Walter Kellermann

  Access Paper or Ask Questions