Picture for Nan Yan

Nan Yan

Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition

Add code
Jun 14, 2024
Figure 1 for Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
Figure 2 for Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
Figure 3 for Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
Figure 4 for Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
Viaarxiv icon

Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network

Add code
May 07, 2024
Viaarxiv icon

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

Add code
Mar 12, 2024
Figure 1 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 2 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 3 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 4 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Viaarxiv icon

Enhanced Memory Network: The novel network structure for Symbolic Music Generation

Add code
Oct 07, 2021
Figure 1 for Enhanced Memory Network: The novel network structure for Symbolic Music Generation
Figure 2 for Enhanced Memory Network: The novel network structure for Symbolic Music Generation
Figure 3 for Enhanced Memory Network: The novel network structure for Symbolic Music Generation
Figure 4 for Enhanced Memory Network: The novel network structure for Symbolic Music Generation
Viaarxiv icon

Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel

Add code
Aug 19, 2021
Figure 1 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 2 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 3 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 4 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Viaarxiv icon

Two Streams and Two Resolution Spectrograms Model for End-to-end Automatic Speech Recognition

Add code
Aug 18, 2021
Figure 1 for Two Streams and Two Resolution Spectrograms Model for End-to-end Automatic Speech Recognition
Figure 2 for Two Streams and Two Resolution Spectrograms Model for End-to-end Automatic Speech Recognition
Figure 3 for Two Streams and Two Resolution Spectrograms Model for End-to-end Automatic Speech Recognition
Figure 4 for Two Streams and Two Resolution Spectrograms Model for End-to-end Automatic Speech Recognition
Viaarxiv icon

FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification

Add code
Aug 18, 2021
Figure 1 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 2 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 3 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 4 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Viaarxiv icon

An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning

Add code
Aug 11, 2020
Figure 1 for An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning
Figure 2 for An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning
Figure 3 for An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning
Figure 4 for An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning
Viaarxiv icon