Picture for Xin Lei

Xin Lei

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning

Add code
Aug 27, 2024
Figure 1 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 2 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 3 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 4 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Viaarxiv icon

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

Add code
Aug 23, 2024
Figure 1 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 2 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 3 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 4 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Viaarxiv icon

LLaMA based Punctuation Restoration With Forward Pass Only Decoding

Add code
Aug 09, 2024
Viaarxiv icon

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Add code
Jan 08, 2024
Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Sep 20, 2023
Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

Add code
Sep 05, 2023
Figure 1 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 2 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 3 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 4 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Viaarxiv icon

LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting

Add code
Nov 09, 2022
Figure 1 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 2 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 3 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 4 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Viaarxiv icon

SCA: Streaming Cross-attention Alignment for Echo Cancellation

Add code
Nov 01, 2022
Viaarxiv icon

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

Add code
Jul 07, 2021
Figure 1 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 2 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 3 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 4 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Viaarxiv icon

WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit

Add code
Feb 02, 2021
Figure 1 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 2 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 3 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 4 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Viaarxiv icon