Picture for Wei Zou

Wei Zou

ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

Add code
Jul 28, 2023
Figure 1 for ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation
Figure 2 for ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation
Figure 3 for ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation
Figure 4 for ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation
Viaarxiv icon

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

Add code
Aug 17, 2022
Figure 1 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 2 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 3 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 4 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Viaarxiv icon

DSLA: Dynamic smooth label assignment for efficient anchor-free object detection

Add code
Aug 01, 2022
Figure 1 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 2 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 3 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 4 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Viaarxiv icon

Audio-Visual Wake Word Spotting System For MISP Challenge 2021

Add code
Apr 20, 2022
Figure 1 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 2 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 3 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 4 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Viaarxiv icon

Time Domain Adversarial Voice Conversion for ADD 2022

Add code
Apr 20, 2022
Figure 1 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 2 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 3 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 4 for Time Domain Adversarial Voice Conversion for ADD 2022
Viaarxiv icon

Audio Deep Fake Detection System with Neural Stitching for ADD 2022

Add code
Apr 20, 2022
Figure 1 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 2 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 3 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 4 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Viaarxiv icon

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

Add code
Jun 13, 2021
Figure 1 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 2 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 3 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 4 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Viaarxiv icon

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition

Add code
Apr 26, 2021
Figure 1 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 2 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 3 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 4 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Viaarxiv icon

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Add code
Oct 27, 2020
Figure 1 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 2 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 3 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 4 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Viaarxiv icon

TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog

Add code
Oct 21, 2020
Figure 1 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 2 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 3 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 4 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Viaarxiv icon