Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Add code
Bookmark button
Alert button
Oct 05, 2021
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li

Figure 1 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Figure 2 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Figure 3 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Figure 4 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Viaarxiv icon

Revisiting Self-Training for Few-Shot Learning of Language Model

Add code
Bookmark button
Alert button
Oct 04, 2021
Yiming Chen, Yan Zhang, Chen Zhang, Grandee Lee, Ran Cheng, Haizhou Li

Figure 1 for Revisiting Self-Training for Few-Shot Learning of Language Model
Figure 2 for Revisiting Self-Training for Few-Shot Learning of Language Model
Figure 3 for Revisiting Self-Training for Few-Shot Learning of Language Model
Figure 4 for Revisiting Self-Training for Few-Shot Learning of Language Model
Viaarxiv icon

PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction

Add code
Bookmark button
Alert button
Oct 03, 2021
Yi Ma, Kong Aik Lee, Ville Hautamaki, Haizhou Li

Figure 1 for PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction
Figure 2 for PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction
Figure 3 for PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction
Figure 4 for PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction
Viaarxiv icon

USEV: Universal Speaker Extraction with Visual Cue

Add code
Bookmark button
Alert button
Sep 30, 2021
Zexu Pan, Meng Ge, Haizhou Li

Figure 1 for USEV: Universal Speaker Extraction with Visual Cue
Figure 2 for USEV: Universal Speaker Extraction with Visual Cue
Figure 3 for USEV: Universal Speaker Extraction with Visual Cue
Figure 4 for USEV: Universal Speaker Extraction with Visual Cue
Viaarxiv icon

Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification

Add code
Bookmark button
Alert button
Sep 28, 2021
Bidisha Sharma, Maulik Madhavi, Xuehao Zhou, Haizhou Li

Figure 1 for Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification
Figure 2 for Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification
Figure 3 for Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification
Figure 4 for Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification
Viaarxiv icon

Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification

Add code
Bookmark button
Alert button
Aug 05, 2021
Yidi Jiang, Bidisha Sharma, Maulik Madhavi, Haizhou Li

Figure 1 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 2 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 3 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 4 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Viaarxiv icon

SLoClas: A Database for Joint Sound Localization and Classification

Add code
Bookmark button
Alert button
Aug 05, 2021
Xinyuan Qian, Bidisha Sharma, Amine El Abridi, Haizhou Li

Figure 1 for SLoClas: A Database for Joint Sound Localization and Classification
Figure 2 for SLoClas: A Database for Joint Sound Localization and Classification
Figure 3 for SLoClas: A Database for Joint Sound Localization and Classification
Figure 4 for SLoClas: A Database for Joint Sound Localization and Classification
Viaarxiv icon

Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection

Add code
Bookmark button
Alert button
Jul 25, 2021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li

Figure 1 for Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Figure 2 for Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Figure 3 for Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Figure 4 for Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Viaarxiv icon

Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding

Add code
Bookmark button
Alert button
Jul 14, 2021
Hongning Zhu, Kong Aik Lee, Haizhou Li

Figure 1 for Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Figure 2 for Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Figure 3 for Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Figure 4 for Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Viaarxiv icon