Picture for Li-Rong Dai

Li-Rong Dai

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Add code
Feb 15, 2022
Figure 1 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 2 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 3 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 4 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Viaarxiv icon

Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals

Add code
Jan 22, 2022
Figure 1 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 2 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 3 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Figure 4 for Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals
Viaarxiv icon

A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition

Add code
Jan 22, 2022
Figure 1 for A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition
Figure 2 for A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition
Figure 3 for A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition
Figure 4 for A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition
Viaarxiv icon

XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition

Add code
Mar 15, 2021
Figure 1 for XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Figure 2 for XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Figure 3 for XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Figure 4 for XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Viaarxiv icon

Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention

Add code
Dec 28, 2020
Figure 1 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 2 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 3 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 4 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Viaarxiv icon

Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement

Add code
Sep 21, 2020
Figure 1 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 2 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 3 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 4 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Viaarxiv icon

Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling

Add code
Jun 21, 2019
Figure 1 for Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Figure 2 for Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Figure 3 for Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Figure 4 for Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Viaarxiv icon

Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis

Add code
Jul 18, 2018
Figure 1 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 2 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 3 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 4 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Viaarxiv icon