Picture for Shinji Watanabe

Shinji Watanabe

Carnegie Mellon University

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

Add code
Jun 11, 2023
Figure 1 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 2 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 3 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 4 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Viaarxiv icon

Tensor decomposition for minimization of E2E SLU model toward on-device processing

Add code
Jun 02, 2023
Figure 1 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 2 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 3 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 4 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Viaarxiv icon

UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures

Add code
May 31, 2023
Viaarxiv icon

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning

Add code
May 29, 2023
Figure 1 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 2 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 3 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 4 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Viaarxiv icon

DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models

Add code
May 28, 2023
Viaarxiv icon

A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning

Add code
May 19, 2023
Figure 1 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 2 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 3 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 4 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Viaarxiv icon

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization

Add code
May 18, 2023
Figure 1 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 2 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 3 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 4 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Viaarxiv icon

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Add code
May 18, 2023
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
May 18, 2023
Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

Add code
May 12, 2023
Viaarxiv icon