Picture for Ao Zhang

Ao Zhang

Physical formula enhanced multi-task learning for pharmacokinetics prediction

Add code
Apr 16, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Viaarxiv icon

Knowledge Enhanced Conditional Imputation for Healthcare Time-series

Add code
Jan 04, 2024
Viaarxiv icon

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

Add code
Dec 15, 2023
Figure 1 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 2 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 3 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Figure 4 for U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
Viaarxiv icon

NExT-Chat: An LMM for Chat, Detection and Segmentation

Add code
Nov 13, 2023
Figure 1 for NExT-Chat: An LMM for Chat, Detection and Segmentation
Figure 2 for NExT-Chat: An LMM for Chat, Detection and Segmentation
Figure 3 for NExT-Chat: An LMM for Chat, Detection and Segmentation
Figure 4 for NExT-Chat: An LMM for Chat, Detection and Segmentation
Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Add code
Oct 07, 2023
Viaarxiv icon

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

Add code
Jun 01, 2023
Figure 1 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 2 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 3 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 4 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Viaarxiv icon

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Add code
May 21, 2023
Figure 1 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 2 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 3 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 4 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Viaarxiv icon

Transfer Visual Prompt Generator across LLMs

Add code
May 02, 2023
Figure 1 for Transfer Visual Prompt Generator across LLMs
Figure 2 for Transfer Visual Prompt Generator across LLMs
Figure 3 for Transfer Visual Prompt Generator across LLMs
Figure 4 for Transfer Visual Prompt Generator across LLMs
Viaarxiv icon

VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting

Add code
Mar 14, 2023
Figure 1 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 2 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 3 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 4 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Viaarxiv icon