Picture for Tom Ko

Tom Ko

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Add code
Mar 29, 2022
Figure 1 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 2 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 3 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 4 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Viaarxiv icon

SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing

Add code
Oct 14, 2021
Figure 1 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 2 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 3 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 4 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Viaarxiv icon

Multi-View Self-Attention Based Transformer for Speaker Recognition

Add code
Oct 11, 2021
Figure 1 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 2 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 3 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 4 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Viaarxiv icon

An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning

Add code
Aug 05, 2021
Figure 1 for An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning
Figure 2 for An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning
Figure 3 for An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning
Figure 4 for An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning
Viaarxiv icon

CL4AC: A Contrastive Loss for Audio Captioning

Add code
Jul 21, 2021
Figure 1 for CL4AC: A Contrastive Loss for Audio Captioning
Figure 2 for CL4AC: A Contrastive Loss for Audio Captioning
Figure 3 for CL4AC: A Contrastive Loss for Audio Captioning
Figure 4 for CL4AC: A Contrastive Loss for Audio Captioning
Viaarxiv icon

Token-Level Supervised Contrastive Learning for Punctuation Restoration

Add code
Jul 19, 2021
Figure 1 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 2 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 3 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 4 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Viaarxiv icon

Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation

Add code
Apr 08, 2021
Figure 1 for Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation
Figure 2 for Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation
Figure 3 for Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation
Figure 4 for Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation
Viaarxiv icon

Auto-KWS 2021 Challenge: Task, Datasets, and Baselines

Add code
Mar 31, 2021
Figure 1 for Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Figure 2 for Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Figure 3 for Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Figure 4 for Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Viaarxiv icon

AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification

Add code
Oct 25, 2020
Figure 1 for AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
Figure 2 for AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
Figure 3 for AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
Figure 4 for AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
Viaarxiv icon

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Add code
Oct 10, 2020
Figure 1 for MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization
Figure 2 for MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization
Figure 3 for MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization
Figure 4 for MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization
Viaarxiv icon