Alert button
Picture for Chutong Meng

Chutong Meng

Alert button

RepCodec: A Speech Representation Codec for Speech Tokenization

Aug 31, 2023
Zhichao Huang, Chutong Meng, Tom Ko

Figure 1 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 2 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 3 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 4 for RepCodec: A Speech Representation Codec for Speech Tokenization
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Mar 30, 2023
Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang

Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon

CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning

Oct 08, 2022
Chutong Meng, Junyi Ao, Tom Ko, Mingxuan Wang, Haizhou Li

Figure 1 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 2 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 3 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 4 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Viaarxiv icon

GigaST: A 10,000-hour Pseudo Speech Translation Corpus

Apr 08, 2022
Rong Ye, Chengqi Zhao, Tom Ko, Chutong Meng, Tao Wang, Mingxuan Wang, Jun Cao

Figure 1 for GigaST: A 10,000-hour Pseudo Speech Translation Corpus
Figure 2 for GigaST: A 10,000-hour Pseudo Speech Translation Corpus
Figure 3 for GigaST: A 10,000-hour Pseudo Speech Translation Corpus
Figure 4 for GigaST: A 10,000-hour Pseudo Speech Translation Corpus
Viaarxiv icon