Picture for Ke Hu

Ke Hu

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Add code
May 25, 2024
Figure 1 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 2 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 3 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 4 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Viaarxiv icon

Enhancing Visual Continual Learning with Language-Guided Supervision

Add code
Mar 24, 2024
Figure 1 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 2 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 3 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 4 for Enhancing Visual Continual Learning with Language-Guided Supervision
Viaarxiv icon

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Viaarxiv icon

Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights

Add code
Dec 12, 2023
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

Mixture-of-Expert Conformer for Streaming Multilingual ASR

Add code
May 25, 2023
Figure 1 for Mixture-of-Expert Conformer for Streaming Multilingual ASR
Figure 2 for Mixture-of-Expert Conformer for Streaming Multilingual ASR
Figure 3 for Mixture-of-Expert Conformer for Streaming Multilingual ASR
Figure 4 for Mixture-of-Expert Conformer for Streaming Multilingual ASR
Viaarxiv icon

A Deliberation-based Joint Acoustic and Text Decoder

Add code
Mar 23, 2023
Figure 1 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 2 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 3 for A Deliberation-based Joint Acoustic and Text Decoder
Figure 4 for A Deliberation-based Joint Acoustic and Text Decoder
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

Massively Multilingual Shallow Fusion with Large Language Models

Add code
Feb 17, 2023
Figure 1 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 2 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 3 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 4 for Massively Multilingual Shallow Fusion with Large Language Models
Viaarxiv icon

Scaling Up Deliberation for Multilingual ASR

Add code
Oct 11, 2022
Figure 1 for Scaling Up Deliberation for Multilingual ASR
Figure 2 for Scaling Up Deliberation for Multilingual ASR
Figure 3 for Scaling Up Deliberation for Multilingual ASR
Figure 4 for Scaling Up Deliberation for Multilingual ASR
Viaarxiv icon