Picture for Ke Hu

Ke Hu

Enhancing Visual Continual Learning with Language-Guided Supervision

Add code
Mar 24, 2024
Figure 1 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 2 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 3 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 4 for Enhancing Visual Continual Learning with Language-Guided Supervision
Viaarxiv icon

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Figure 1 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 2 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 3 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 4 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Viaarxiv icon

Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights

Add code
Dec 12, 2023
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

Mixture-of-Expert Conformer for Streaming Multilingual ASR

Add code
May 25, 2023
Viaarxiv icon

A Deliberation-based Joint Acoustic and Text Decoder

Add code
Mar 23, 2023
Figure 1 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 2 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 3 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 4 for A Deliberation-based Joint Acoustic and Text Decoder
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

Massively Multilingual Shallow Fusion with Large Language Models

Add code
Feb 17, 2023
Viaarxiv icon

Scaling Up Deliberation for Multilingual ASR

Add code
Oct 11, 2022
Figure 1 for Scaling Up Deliberation for Multilingual ASR
Figure 2 for Scaling Up Deliberation for Multilingual ASR
Figure 3 for Scaling Up Deliberation for Multilingual ASR
Figure 4 for Scaling Up Deliberation for Multilingual ASR
Viaarxiv icon

Improving Deliberation by Text-Only and Semi-Supervised Training

Add code
Jun 29, 2022
Figure 1 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 2 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 3 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 4 for Improving Deliberation by Text-Only and Semi-Supervised Training
Viaarxiv icon