Alert button
Picture for Andreas Stolcke

Andreas Stolcke

Alert button

Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition

Add code
Bookmark button
Alert button
Jun 18, 2021
Ruirui Li, Chelsea J. -T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke

Figure 1 for Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition
Figure 2 for Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition
Figure 3 for Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition
Viaarxiv icon

Graph-based Label Propagation for Semi-Supervised Speaker Identification

Add code
Bookmark button
Alert button
Jun 15, 2021
Long Chen, Venkatesh Ravichandran, Andreas Stolcke

Figure 1 for Graph-based Label Propagation for Semi-Supervised Speaker Identification
Figure 2 for Graph-based Label Propagation for Semi-Supervised Speaker Identification
Figure 3 for Graph-based Label Propagation for Semi-Supervised Speaker Identification
Figure 4 for Graph-based Label Propagation for Semi-Supervised Speaker Identification
Viaarxiv icon

End-to-end Neural Diarization: From Transformer to Conformer

Add code
Bookmark button
Alert button
Jun 14, 2021
Yi Chieh Liu, Eunjung Han, Chul Lee, Andreas Stolcke

Figure 1 for End-to-end Neural Diarization: From Transformer to Conformer
Figure 2 for End-to-end Neural Diarization: From Transformer to Conformer
Figure 3 for End-to-end Neural Diarization: From Transformer to Conformer
Figure 4 for End-to-end Neural Diarization: From Transformer to Conformer
Viaarxiv icon

Attention-based Contextual Language Model Adaptation for Speech Recognition

Add code
Bookmark button
Alert button
Jun 02, 2021
Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe

Figure 1 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 2 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 3 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 4 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Viaarxiv icon

Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

Add code
Bookmark button
Alert button
May 14, 2021
Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo

Figure 1 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 2 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 3 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 4 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Viaarxiv icon

Reranking Machine Translation Hypotheses with Structured and Web-based Language Models

Add code
Bookmark button
Alert button
Apr 25, 2021
Wen Wang, Andreas Stolcke, Jing Zheng

Figure 1 for Reranking Machine Translation Hypotheses with Structured and Web-based Language Models
Figure 2 for Reranking Machine Translation Hypotheses with Structured and Web-based Language Models
Figure 3 for Reranking Machine Translation Hypotheses with Structured and Web-based Language Models
Figure 4 for Reranking Machine Translation Hypotheses with Structured and Web-based Language Models
Viaarxiv icon

Wav2vec-C: A Self-supervised Model for Speech Representation Learning

Add code
Bookmark button
Alert button
Mar 09, 2021
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas

Figure 1 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 2 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 3 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 4 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Viaarxiv icon

Personalization Strategies for End-to-End Speech Recognition Systems

Add code
Bookmark button
Alert button
Feb 15, 2021
Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko

Figure 1 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 2 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 3 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 4 for Personalization Strategies for End-to-End Speech Recognition Systems
Viaarxiv icon

Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding

Add code
Bookmark button
Alert button
Feb 12, 2021
Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke

Figure 1 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 2 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 3 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Viaarxiv icon

Contrastive Unsupervised Learning for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Feb 12, 2021
Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang

Figure 1 for Contrastive Unsupervised Learning for Speech Emotion Recognition
Figure 2 for Contrastive Unsupervised Learning for Speech Emotion Recognition
Viaarxiv icon