Picture for Xuankai Chang

Xuankai Chang

Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation

Add code
Jul 23, 2023
Figure 1 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 2 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 3 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

Add code
Jun 11, 2023
Figure 1 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 2 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 3 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 4 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Viaarxiv icon

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning

Add code
May 29, 2023
Figure 1 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 2 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 3 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 4 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Viaarxiv icon

A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning

Add code
May 19, 2023
Figure 1 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 2 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 3 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 4 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
May 18, 2023
Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Add code
Apr 25, 2023
Figure 1 for AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Figure 2 for AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Figure 3 for AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Figure 4 for AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Viaarxiv icon

Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms

Add code
Mar 16, 2023
Figure 1 for Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms
Figure 2 for Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms
Figure 3 for Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms
Figure 4 for Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms
Viaarxiv icon

A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding

Add code
Nov 10, 2022
Figure 1 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 2 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 3 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 4 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Viaarxiv icon

End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation

Add code
Oct 19, 2022
Figure 1 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 2 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 3 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 4 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Viaarxiv icon