Picture for Nanxin Chen

Nanxin Chen

Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition

Add code
Aug 19, 2024
Viaarxiv icon

Text Injection for Neural Contextual Biasing

Add code
Jun 05, 2024
Figure 1 for Text Injection for Neural Contextual Biasing
Figure 2 for Text Injection for Neural Contextual Biasing
Figure 3 for Text Injection for Neural Contextual Biasing
Figure 4 for Text Injection for Neural Contextual Biasing
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

E3 TTS: Easy End-to-End Diffusion-based Text to Speech

Add code
Nov 02, 2023
Viaarxiv icon

SLM: Bridge the thin gap between speech and text foundation models

Add code
Sep 30, 2023
Figure 1 for SLM: Bridge the thin gap between speech and text foundation models
Figure 2 for SLM: Bridge the thin gap between speech and text foundation models
Figure 3 for SLM: Bridge the thin gap between speech and text foundation models
Figure 4 for SLM: Bridge the thin gap between speech and text foundation models
Viaarxiv icon

Efficient Adapters for Giant Speech Models

Add code
Jun 13, 2023
Figure 1 for Efficient Adapters for Giant Speech Models
Figure 2 for Efficient Adapters for Giant Speech Models
Figure 3 for Efficient Adapters for Giant Speech Models
Figure 4 for Efficient Adapters for Giant Speech Models
Viaarxiv icon

How to Estimate Model Transferability of Pre-Trained Speech Models?

Add code
Jun 01, 2023
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Viaarxiv icon

Noise2Music: Text-conditioned Music Generation with Diffusion Models

Add code
Feb 08, 2023
Figure 1 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 2 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 3 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 4 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Viaarxiv icon