Alert button

"speech recognition": models, code, and papers
Alert button

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework

Add code
Bookmark button
Alert button
Feb 03, 2021
Yucheng Zhao, Dacheng Yin, Chong Luo, Zhiyuan Zhao, Chuanxin Tang, Wenjun Zeng, Zheng-Jun Zha

Figure 1 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 2 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 3 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 4 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Viaarxiv icon

Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain

Add code
Bookmark button
Alert button
Feb 23, 2021
Julio Wissing, Benedikt Boenninghoff, Dorothea Kolossa, Tsubasa Ochiaiy, Marc Delcroixy, Keisuke Kinoshitay, Tomohiro Nakataniy, Shoko Arakiy, Christopher Schymura

Figure 1 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 2 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 3 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 4 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Viaarxiv icon

Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts

Jun 14, 2021
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover, Owen Kimball

Figure 1 for Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts
Figure 2 for Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts
Figure 3 for Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts
Figure 4 for Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts
Viaarxiv icon

Uncertainty-guided Model Generalization to Unseen Domains

Mar 12, 2021
Fengchun Qiao, Xi Peng

Figure 1 for Uncertainty-guided Model Generalization to Unseen Domains
Figure 2 for Uncertainty-guided Model Generalization to Unseen Domains
Figure 3 for Uncertainty-guided Model Generalization to Unseen Domains
Figure 4 for Uncertainty-guided Model Generalization to Unseen Domains
Viaarxiv icon

Improving N-gram Language Models with Pre-trained Deep Transformer

Add code
Bookmark button
Alert button
Nov 22, 2019
Yiren Wang, Hongzhao Huang, Zhe Liu, Yutong Pang, Yongqiang Wang, ChengXiang Zhai, Fuchun Peng

Figure 1 for Improving N-gram Language Models with Pre-trained Deep Transformer
Figure 2 for Improving N-gram Language Models with Pre-trained Deep Transformer
Figure 3 for Improving N-gram Language Models with Pre-trained Deep Transformer
Figure 4 for Improving N-gram Language Models with Pre-trained Deep Transformer
Viaarxiv icon

A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It

Add code
Bookmark button
Alert button
Apr 15, 2021
Trung Dang, Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Peter Chin, Françoise Beaufays

Figure 1 for A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
Figure 2 for A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
Figure 3 for A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
Figure 4 for A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
Viaarxiv icon

Sentence Boundary Augmentation For Neural Machine Translation Robustness

Oct 21, 2020
Daniel Li, Te I, Naveen Arivazhagan, Colin Cherry, Dirk Padfield

Figure 1 for Sentence Boundary Augmentation For Neural Machine Translation Robustness
Figure 2 for Sentence Boundary Augmentation For Neural Machine Translation Robustness
Figure 3 for Sentence Boundary Augmentation For Neural Machine Translation Robustness
Figure 4 for Sentence Boundary Augmentation For Neural Machine Translation Robustness
Viaarxiv icon

End-to-end training of time domain audio separation and recognition

Add code
Bookmark button
Alert button
Dec 25, 2019
Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach

Figure 1 for End-to-end training of time domain audio separation and recognition
Figure 2 for End-to-end training of time domain audio separation and recognition
Figure 3 for End-to-end training of time domain audio separation and recognition
Figure 4 for End-to-end training of time domain audio separation and recognition
Viaarxiv icon

Large-scale Transfer Learning for Low-resource Spoken Language Understanding

Aug 13, 2020
Xueli Jia, Jianzong Wang, Zhiyong Zhang, Ning Cheng, Jing Xiao

Figure 1 for Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Figure 2 for Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Figure 3 for Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Figure 4 for Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Viaarxiv icon

Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks

Sep 03, 2021
Yogesh Kochar, Sunil Kumar Vengalil, Neelam Sinha

Figure 1 for Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks
Figure 2 for Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks
Figure 3 for Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks
Figure 4 for Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks
Viaarxiv icon