Picture for Yuzong Liu

Yuzong Liu

Multi-modal Adversarial Training for Zero-Shot Voice Cloning

Add code
Aug 28, 2024
Viaarxiv icon

On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation

Add code
Jul 06, 2023
Viaarxiv icon

Small-footprint slimmable networks for keyword spotting

Add code
Apr 21, 2023
Viaarxiv icon

Self-supervised speech representation learning for keyword-spotting with light-weight transformers

Add code
Mar 07, 2023
Viaarxiv icon

Fixed-point quantization aware training for on-device keyword-spotting

Add code
Mar 04, 2023
Figure 1 for Fixed-point quantization aware training for on-device keyword-spotting
Figure 2 for Fixed-point quantization aware training for on-device keyword-spotting
Figure 3 for Fixed-point quantization aware training for on-device keyword-spotting
Figure 4 for Fixed-point quantization aware training for on-device keyword-spotting
Viaarxiv icon

Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets

Add code
Jul 13, 2022
Figure 1 for Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Figure 2 for Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Figure 3 for Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Figure 4 for Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Viaarxiv icon

DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization

Add code
Dec 11, 2020
Figure 1 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 2 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 3 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 4 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Viaarxiv icon

Transformer-Transducers for Code-Switched Speech Recognition

Add code
Nov 30, 2020
Figure 1 for Transformer-Transducers for Code-Switched Speech Recognition
Figure 2 for Transformer-Transducers for Code-Switched Speech Recognition
Figure 3 for Transformer-Transducers for Code-Switched Speech Recognition
Figure 4 for Transformer-Transducers for Code-Switched Speech Recognition
Viaarxiv icon

Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses

Add code
Jun 01, 2020
Figure 1 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 2 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 3 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 4 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Viaarxiv icon

Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition

Add code
Dec 03, 2019
Figure 1 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 2 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 3 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Figure 4 for Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Viaarxiv icon