Alert button
Picture for Li-Rong Dai

Li-Rong Dai

Alert button

Adaptive Confidence Multi-View Hashing for Multimedia Retrieval

Add code
Bookmark button
Alert button
Dec 12, 2023
Jian Zhu, Yu Cui, Zhangmin Huang, Xingyu Li, Lei Liu, Lingfang Zeng, Li-Rong Dai

Viaarxiv icon

CASA-ASR: Context-Aware Speaker-Attributed ASR

Add code
Bookmark button
Alert button
May 21, 2023
Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai

Figure 1 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 2 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 3 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 4 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Viaarxiv icon

Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction

Add code
Bookmark button
Alert button
May 21, 2023
Mohan Shi, Yuchun Shu, Lingyun Zuo, Qian Chen, Shiliang Zhang, Jie Zhang, Li-Rong Dai

Figure 1 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 2 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 3 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 4 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Viaarxiv icon

Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

Add code
Bookmark button
Alert button
May 20, 2023
Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlin

Figure 1 for Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection
Figure 2 for Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection
Figure 3 for Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection
Viaarxiv icon

AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer

Add code
Bookmark button
Alert button
Mar 07, 2023
Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin, Xin Fang, Lin Liu

Figure 1 for AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Figure 2 for AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Figure 3 for AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Figure 4 for AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Viaarxiv icon

A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings

Add code
Bookmark button
Alert button
Nov 01, 2022
Mohan Shi, Jie Zhang, Zhihao Du, Fan Yu, Shiliang Zhang, Li-Rong Dai

Figure 1 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 2 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 3 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 4 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Viaarxiv icon

Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Qiu-Shi Zhu, Long Zhou, Jie Zhang, Shu-Jie Liu, Yu-Chen Hu, Li-Rong Dai

Figure 1 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 2 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 3 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 4 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Viaarxiv icon

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

Add code
Bookmark button
Alert button
May 26, 2022
Qiu-Shi Zhu, Jie Zhang, Zi-Qiang Zhang, Li-Rong Dai

Figure 1 for Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Figure 2 for Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Figure 3 for Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Figure 4 for Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Viaarxiv icon

A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 05, 2022
Ye-Qian Du, Jie Zhang, Qiu-Shi Zhu, Li-Rong Dai, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang

Figure 1 for A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Figure 2 for A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Figure 3 for A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Figure 4 for A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Viaarxiv icon

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Add code
Bookmark button
Alert button
Feb 15, 2022
Zi-Qiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai

Figure 1 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 2 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 3 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 4 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Viaarxiv icon